Automatically clustering similar units for unit selection in speech synthesis.
dc.contributor.author
Black, Alan W
en
dc.contributor.author
Taylor, Paul A
en
dc.date.accessioned
2006-06-14T15:41:53Z
dc.date.available
2006-06-14T15:41:53Z
dc.date.issued
1997
dc.description.abstract
This paper describes a new method for synthesizing
speech by concatenating sub-word units from a
database of labelled speech. A large unit inventory is
created by automatically clustering units of the same
phone class based on their phonetic and prosodic context.
The appropriate cluster is then selected for a target
unit offering a small set of candidate units. An optimal
path is found through the candidate units based on
their distance from the cluster center and an acoustically
based join cost. Details of the method and justification
are presented. The results of experiments using
two different databases are given, optimising various
parameters within the system. Also a comparison
with other existing selection based synthesis techniques
is given showing the advantages this method has over
existing ones. The method is implemented within a full
text-to-speech system offering efficient natural sounding
speech synthesis.
en
dc.format.extent
73786 bytes
en
dc.format.extent
54422 bytes
en
dc.format.mimetype
application/postscript
en
dc.format.mimetype
application/pdf
en
dc.identifier.citation
In Eurospeech97, volume 2, pages 601-604, Rhodes, Greece, 1997.
dc.identifier.uri
http://hdl.handle.net/1842/1236
dc.language.iso
en
dc.publisher
International Speech Communication Association
en
dc.title
Automatically clustering similar units for unit selection in speech synthesis.
en
dc.type
Conference Paper
en
This item appears in the following Collection(s)

