Multiple acoustic cues for Korean stops and automatic speech recognition

Yun, Weonhee.

Multiple acoustic cues for Korean stops and automatic speech recognition

Simple item page

dc.contributor.author

Yun, Weonhee.

en

dc.date.accessioned

2018-01-31T11:40:55Z

dc.date.available

2018-01-31T11:40:55Z

dc.date.issued

2003

dc.description.abstract

The purpose of this thesis is to analyse acoustic characteristics of Korean stops by way of multivariate statistical tests, and to apply the results of the analysis in Automatic Speech Recognition (ASR) of Korean. Three acoustic cues that differentiate three types of Ko¬ rean oral stops are closure duration, Voice Onset Time (VOT) and fundamental frequency (FO) of a vowel after a stop. We review the characteristics of these parameters previously reported in various phonetic studies and test their usefulness for differentiating the three types of stops on two databases, one with controlled contexts, as in other phonetic stud¬ ies, and the other a continuous speech database designed for ASR. Statistical tests on both databases confirm that the three types of stops can be differentiated by the three acoustic parameters. In order to exploit these parameters for ASR, a context dependent Hidden Markov Model (HMM) based baseline system with a short pause model is built, which results in great improvement of performance compared to other systems. For mod¬ elling of the three acoustic parameters, an automatic segmentation technique for closure and VOT is developed. Samples of each acoustic parameter are modelled with univariate and multivariate probability distribution functions. Stop probability from these models is integrated by a post-processing technique. Our results show that integration of stop prob¬ ability does not make much improvement over the results of a baseline system. However, the results suggest that stop probabilities will be useful in determining the correct hy¬ pothesis with a larger lexicon containing more minimal pairs of words that differ by the identity of just one stop.

en

dc.identifier.uri

http://hdl.handle.net/1842/27736

dc.publisher

The University of Edinburgh

en

dc.relation.ispartof

Annexe Thesis Digitisation Project 2017 Block 16

en

dc.relation.isreferencedby

en

dc.title

Multiple acoustic cues for Korean stops and automatic speech recognition

en

dc.type

Thesis or Dissertation

en

dc.type.qualificationlevel

en

dc.type.qualificationname

PhD Doctor of Philosophy

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: YunW_2003redux.pdf
Size:: 24.92 MB
Format:: Adobe Portable Document Format

Download

This item appears in the following Collection(s)

Informatics thesis and dissertation collection