Show simple item record

dc.contributor.authorGillett, Ben
dc.contributor.authorKing, Simon
dc.date.accessioned2006-05-15T16:38:00Z
dc.date.available2006-05-15T16:38:00Z
dc.date.issued2003
dc.identifier.citationGillett, Ben / King, Simon (2003): "Transforming voice quality", In EUROSPEECH-2003, 1713-1716.en
dc.identifier.urihttp://www.isca-speech.org/archive/eurospeech_2003/index.html
dc.identifier.urihttp://hdl.handle.net/1842/1086
dc.description.abstractVoice transformation is the process of transforming the characteristics of speech uttered by a source speaker, such that a listener would believe the speech was uttered by a target speaker. In this paper we address the problem of transforming voice quality. We do not attempt to transform prosody. Our system has two main parts corresponding to the two components of the source-filter model of speech production. The first component transforms the spectral envelope as represented by a linear prediction model. The transformation is achieved using a Gaussian mixture model, which is trained on aligned speech from source and target speakers. The second part of the system predicts the spectral detail from the transformed linear prediction coefficients. A novel approach is proposed, which is based on a classifier and residual codebooks. On the basis of a number of performance metrics it outperforms existing systems.en
dc.format.extent136177 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherInternational Speech Communication Associationen
dc.subjectspeech processingen
dc.subjecttransformationen
dc.titleTransforming Voice Qualityen
dc.typeConference Paperen


Files in this item

This item appears in the following Collection(s)

Show simple item record