Show simple item record

Proc. Interspeech

dc.contributor.authorWang, Dong
dc.contributor.authorHimawan, Ivan
dc.contributor.authorFrankel, Joe
dc.contributor.authorKing, Simon
dc.date.accessioned2010-10-12T09:23:35Z
dc.date.available2010-10-12T09:23:35Z
dc.date.issued2008en
dc.identifier.urihttp://hdl.handle.net/1842/3907
dc.description.abstractAutomatic speech recognition (ASR) becomes rather difficult in meetings domains because of the adverse acoustic conditions, including more background noise, more echo and reverberation and frequent cross-talking. Microphone arrays have been demonstrated able to boost ASR performance dramatically in such noisy and reverberant environments, with various beamforming algorithms. However, almost all existing beamforming measures work in the acoustic domain, resorting to signal processing theories and geometric explanation. This limits their application, and induces significant performance degradation when the geometric property is unavailable or hard to estimate, or if heterogenous channels exist in the audio system. In this paper, we preset a new posterior-based approach for array-based speech recognition. The main idea is, instead of enhancing speech signals, we try to enhance the posterior probabilities that frames belonging to recognition units, e.g., phones. These enhanced posteriors are then transferred to posterior probability based features and are modeled by HMMs, leading to a tandem ANN-HMM hybrid system presented by Hermansky et al.. Experimental results demonstrated the validity of this posterior approach. With the posterior accumulation or enhancement, significant improvement was achieved over the single channel baseline. Moreover, we can combine the acoustic enhancement and posterior enhancement together, leading to a hybrid acoustic-posterior beamforming approach, which works significantly better than just the acoustic beamforming, especially in the scenario with moving-speakers.en
dc.titleA Posterior Approach for Microphone Array Based Speech Recognitionen
dc.typeConference Paperen
rps.titleProc. Interspeechen
dc.date.updated2010-10-12T09:23:36Z
dc.date.openingDate2008


Files in this item

This item appears in the following Collection(s)

Show simple item record