Show simple item record

dc.contributor.authorMorais, Edmilsonen
dc.contributor.authorTaylor, Paulen
dc.contributor.authorViolaro, Fabioen
dc.coverage.spatial4en
dc.date.accessioned2006-05-11T13:09:49Z
dc.date.available2006-05-11T13:09:49Z
dc.date.issued2000-10
dc.identifier.citationIn ICSLP-2000, vol.2, 387-390.
dc.identifier.urihttp://www.isca-speech.org/archive/icslp_2000/
dc.identifier.urihttp://hdl.handle.net/1842/983
dc.description.abstractThis paper presents some preliminary methods to apply the Time- Frequency Interpolation technique - TFI [3] to concatenative text-to-speech synthesis. The TFI technique described here is a pitch-synchronous time-frequency approach of the well known Prototype-Waveform Interpolation technique - PWI [2]. The basic concepts of representing the speech signal in the Time-Frequency Domain as well as techniques to perform Time-Scale and Pitch- Scale modifications are described. Using the flexibility of TFI technique to perform spectral smothing, a method was developed to minimize the spectral mismatch at the boundaries of the Synthesis-Units - SUs. The proposed system was evaluated using SUs (Diphones) and prosodic modifications generated by the Festival system [1]. An informal subjective test was performed, between the proposed TFI system and the standard TD-PSOLA system, highligthing the superior quality of the proposed system in comparasion with TD-PSOLA.en
dc.format.extent172143 bytesen
dc.format.mimetypeapplication/pdfen
dc.language.isoen
dc.publisherInternational Speech Communication Associationen
dc.titleConcatenative Text-to-Speech Synthesis Based on Prototype Waveform Interpolation (A Time Frequency Approach)en
dc.typeConference Paperen


Files in this item

This item appears in the following Collection(s)

Show simple item record