Edinburgh Research Archive

Multiple acoustic cues for Korean stops and automatic speech recognition

dc.contributor.author
Yun, Weonhee.
en
dc.date.accessioned
2018-01-31T11:40:55Z
dc.date.available
2018-01-31T11:40:55Z
dc.date.issued
2003
dc.description.abstract
The purpose of this thesis is to analyse acoustic characteristics of Korean stops by way of multivariate statistical tests, and to apply the results of the analysis in Automatic Speech Recognition (ASR) of Korean. Three acoustic cues that differentiate three types of Ko¬ rean oral stops are closure duration, Voice Onset Time (VOT) and fundamental frequency (FO) of a vowel after a stop. We review the characteristics of these parameters previously reported in various phonetic studies and test their usefulness for differentiating the three types of stops on two databases, one with controlled contexts, as in other phonetic stud¬ ies, and the other a continuous speech database designed for ASR. Statistical tests on both databases confirm that the three types of stops can be differentiated by the three acoustic parameters. In order to exploit these parameters for ASR, a context dependent Hidden Markov Model (HMM) based baseline system with a short pause model is built, which results in great improvement of performance compared to other systems. For mod¬ elling of the three acoustic parameters, an automatic segmentation technique for closure and VOT is developed. Samples of each acoustic parameter are modelled with univariate and multivariate probability distribution functions. Stop probability from these models is integrated by a post-processing technique. Our results show that integration of stop prob¬ ability does not make much improvement over the results of a baseline system. However, the results suggest that stop probabilities will be useful in determining the correct hy¬ pothesis with a larger lexicon containing more minimal pairs of words that differ by the identity of just one stop.
en
dc.identifier.uri
http://hdl.handle.net/1842/27736
dc.publisher
The University of Edinburgh
en
dc.relation.ispartof
Annexe Thesis Digitisation Project 2017 Block 16
en
dc.relation.isreferencedby
en
dc.title
Multiple acoustic cues for Korean stops and automatic speech recognition
en
dc.type
Thesis or Dissertation
en
dc.type.qualificationlevel
en
dc.type.qualificationname
PhD Doctor of Philosophy
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
YunW_2003redux.pdf
Size:
24.92 MB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)