Edinburgh Research Archive

Hierarchical Bayesian Language Models for Conversational Speech Recognition

dc.contributor.author
Huang, Songfang
en
dc.contributor.author
Renals, Steve
en
dc.date.accessioned
2010-12-15T13:57:33Z
dc.date.available
2010-12-15T13:57:33Z
dc.date.issued
2010
dc.date.updated
2010-12-15T13:57:33Z
dc.description.abstract
Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfitting of maximum-likelihood estimation and the lack of rich contextual knowledge sources. In this paper, we exploit a hierarchical Bayesian interpretation for language modeling, based on a nonparametric prior called the Pitman--Yor process. This offers a principled approach to language model smoothing, embedding the power-law distribution for natural language. Experiments on the recognition of conversational speech in multiparty meetings demonstrate that by using hierarchical Bayesian language models, we are able to achieve significant reductions in perplexity and word error rate.
en
dc.extent.pageNumbers
1941--1954
en
dc.identifier.doi
10.1109/TASL.2010.2040782
dc.identifier.uri
http://dx.doi.org/10.1109/TASL.2010.2040782
dc.identifier.uri
http://hdl.handle.net/1842/4528
dc.publisher
IEEE
en
dc.title
Hierarchical Bayesian Language Models for Conversational Speech Recognition
en
dc.type
Article
en
rps.issue
8
en
rps.title
IEEE Transactions on Audio, Speech and Language Processing
en
rps.volume
18
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
huang-taslp10.pdf
Size:
986.9 KB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)