Show simple item record

dc.contributor.authorVepa, Jithendra
dc.contributor.authorKing, Simon
dc.contributor.authorTaylor, Paul
dc.coverage.spatial4en
dc.date.accessioned2006-05-10T17:32:59Z
dc.date.available2006-05-10T17:32:59Z
dc.date.issued2002-09
dc.identifier.citationIn Speech Synthesis, 2002. Proceedings of 2002 IEEE Workshopen
dc.identifier.otherDigital Object Identifier: 10.1109/WSS.2002.1224414
dc.identifier.urihttp://ieeexplore.ieee.org/servlet/opac?punumber=8670
dc.identifier.urihttp://hdl.handle.net/1842/959
dc.description.abstractThe quality of unit selection based concatenative speech synthesis mainly depends on how well two successive units can be joined together to minimise the audible discontinuities. The objective measure of discontinuity used when selecting units is known as the join cost. The ideal join cost measures perceived discontinuity, based on easily measurable spectral properties of the units being joined, in order to ensure smooth and natural-sounding synthetic speech. In this paper we describe a perceptual experiment conducted to measure the correlation between subjective human perception and various objective spectrally-based measures proposed in the literature. Also we report new objective distance measures derived from various distance metrics based on these spectral features, which have good correlation with human perception to concatenation discontinuities. Our experiments used a state-of-the art unit-selection text-to-speech system: rVoice from Rhetorical Systems Limited.en
dc.format.extent384376 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherIEEEen
dc.titleNew objective distance measures for spectral discontinuities in concatenative speech synthesisen
dc.typeConference Paperen


Files in this item

This item appears in the following Collection(s)

Show simple item record