dc.contributor.author | Morais, Edmilson | en |
dc.contributor.author | Taylor, Paul | en |
dc.contributor.author | Violaro, Fabio | en |
dc.coverage.spatial | 4 | en |
dc.date.accessioned | 2006-05-11T13:09:49Z | |
dc.date.available | 2006-05-11T13:09:49Z | |
dc.date.issued | 2000-10 | |
dc.identifier.citation | In ICSLP-2000, vol.2, 387-390. | |
dc.identifier.uri | http://www.isca-speech.org/archive/icslp_2000/ | |
dc.identifier.uri | http://hdl.handle.net/1842/983 | |
dc.description.abstract | This paper presents some preliminary methods to apply the Time- Frequency Interpolation technique - TFI [3] to concatenative text-to-speech synthesis. The TFI technique described here is a pitch-synchronous time-frequency approach of the well known Prototype-Waveform Interpolation technique - PWI [2]. The basic concepts of representing the speech signal in the Time-Frequency Domain as well as techniques to perform Time-Scale and Pitch- Scale modifications are described. Using the flexibility of TFI technique to perform spectral smothing, a method was developed to minimize the spectral mismatch at the boundaries of the Synthesis-Units - SUs. The proposed system was evaluated using SUs (Diphones) and prosodic modifications generated by the Festival system [1]. An informal subjective test was performed, between the proposed TFI system and the standard TD-PSOLA system, highligthing the superior quality of the proposed system in comparasion with TD-PSOLA. | en |
dc.format.extent | 172143 bytes | en |
dc.format.mimetype | application/pdf | en |
dc.language.iso | en | |
dc.publisher | International Speech Communication Association | en |
dc.title | Concatenative Text-to-Speech Synthesis Based on Prototype Waveform Interpolation (A Time Frequency Approach) | en |
dc.type | Conference Paper | en |