Show simple item record

dc.contributor.authorWester, Mirjam
dc.date.accessioned2006-05-19T10:26:39Z
dc.date.available2006-05-19T10:26:39Z
dc.date.issued2003
dc.identifier.citationComputer Speech and Language, 17:69-85, 2003.en
dc.identifier.uridoi:10.1016 /S0885-2308 (02 )00 03 0-X
dc.identifier.urihttp://hdl.handle.net/1842/1133
dc.description.abstractThis article focuses on modeling pronunciation variation in two different ways: data-derived and knowledge-based. The knowledge-based approach consists of using phonological rules to generate variants. The data-derived approach consists of performing phone recognition, followed by smoothing using decision trees (D-trees) to alleviate some of the errors in the phone recognition. Using phonological rules led to a small improvement in WER; a data-derived approach in which the phone recognition was smoothed using D-trees prior to lexicon generation led to larger improvements compared to the baseline. The lexicon was employed in two different recognition systems: a hybrid HMM/ANN system and a HMM-based system, to ascertain whether pronunciation variation was truly being modeled. This proved to be the case as no significant differences were found between the results obtained with the two systems. Furthermore, we found that 10% of variants generated by the phonological rules were also found using phone recognition, and this increased to 28% when the phone recognition output was smoothed by using D-trees. This indicates that the D-trees generalize beyond what has been seen in the training material, whereas when the phone recognition approach is employed directly, unseen pronunciations cannot be predicted. In addition, we propose a metric to measure confusability in the lexicon. Using this confusion metric to prune variants results in roughly the same improvement as using the D-tree method.en
dc.format.extent409114 bytes
dc.format.mimetypeapplication/pdf
dc.language.isoen
dc.publisherElsevieren
dc.subjectspeechen
dc.subjectautomatic speech recognitionen
dc.subjectasren
dc.titlePronunciation modeling for ASR - knowledge-based and data-derived methods.en
dc.typeArticleen


Files in this item

This item appears in the following Collection(s)

Show simple item record