Show simple item record

Interspeech

dc.contributor.authorAylett, Matthew
dc.contributor.authorKing, Simon
dc.contributor.authorYamagishi, Junichi
dc.date.accessioned2010-10-12T13:01:28Z
dc.date.available2010-10-12T13:01:28Z
dc.date.issued2009en
dc.identifier.urihttp://hdl.handle.net/1842/3909
dc.description.abstractIn speech synthesis the unit inventory is decided using phonological and phonetic expertise. This process is resource intensive and potentially sub-optimal. In this paper we investigate how acoustic clustering, together with lexicon constraints, can be used to build a self-organised inventory. Six English speech synthesis systems were built using two frameworks, unit selection and parametric HTS for three inventory conditions: 1) a traditional phone set, 2) a system using orthographic units, and 3) a self-organised inventory. A listening test showed a strong preference for the classic system, and for the orthographic system over the self-organised system. Results also varied by letter to sound complexity and database coverage. This suggests the self-organised approach failed to generalise pronunciation as well as introducing noise above and beyond that caused by orthographic sound mismatch.en
dc.titleSpeech Synthesis Without a Phone Inventoryen
dc.typeConference Paperen
rps.titleInterspeechen
dc.date.updated2010-10-12T13:01:29Z
dc.date.openingDate2009


Files in this item

This item appears in the following Collection(s)

Show simple item record