Articulatory feature classifiers trained on 2000 hours of telephone speech
dc.contributor.author
Frankel, Joe
en
dc.contributor.author
Magimai-Doss, Mathew
en
dc.contributor.author
King, Simon
en
dc.contributor.author
Livescu, Karen
en
dc.contributor.author
Çetin, Ozgur
en
dc.date.accessioned
2007-09-18T10:02:10Z
dc.date.available
2007-09-18T10:02:10Z
dc.date.issued
2007
dc.description.abstract
The so-called tandem approach, where the posteriors of a multilayer perceptron (MLP) classifier are used as features in an automatic speech recognition (ASR) system has proven to be a very effective method. Most tandem approaches up to date have relied on MLPs trained for phone classification, and appended the posterior features to some standard feature hidden Markov model (HMM). In this paper, we develop an alternative tandem approach based on MLPs trained for articulatory feature (AF) classification. We also develop a factored observation model for characterizing the posterior and standard features at the HMM outputs, allowing for separate hidden mixture and state-tying structures for each factor. In experiments on a subset of Switchboard, we show that the AFbased tandem approach is as effective as the phone-based approach, and that the factored observation model significantly outperforms the simple feature concatenation approach while using fewer parameters.
en
dc.format.extent
100851 bytes
en
dc.format.mimetype
application/pdf
en
dc.identifier.citation
J. Frankel, M. Magimai-Doss, S. King, K. Livescu, and O. Çetin. Articulatory feature classifiers trained on 2000 hours of telephone speech. In Proc. Interspeech, Antwerp, Belgium, August 2007.
dc.identifier.uri
http://hdl.handle.net/1842/2000
dc.language.iso
en
dc.subject
speech technology
en
dc.title
Articulatory feature classifiers trained on 2000 hours of telephone speech
en
dc.type
Conference Paper
en
Files
Original bundle
1 - 1 of 1
- Name:
- frankel_AF-MLP.pdf
- Size:
- 98.49 KB
- Format:
- Adobe Portable Document Format
This item appears in the following Collection(s)

