Hierarchical Bayesian Language Models for Conversational Speech Recognition

Huang, Songfang; Renals, Steve

doi:10.1109/TASL.2010.2040782

Hierarchical Bayesian Language Models for Conversational Speech Recognition

Simple item page

dc.contributor.author

Huang, Songfang

en

dc.contributor.author

Renals, Steve

en

dc.date.accessioned

2010-12-15T13:57:33Z

dc.date.available

2010-12-15T13:57:33Z

dc.date.issued

2010

dc.date.updated

2010-12-15T13:57:33Z

dc.description.abstract

Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfitting of maximum-likelihood estimation and the lack of rich contextual knowledge sources. In this paper, we exploit a hierarchical Bayesian interpretation for language modeling, based on a nonparametric prior called the Pitman--Yor process. This offers a principled approach to language model smoothing, embedding the power-law distribution for natural language. Experiments on the recognition of conversational speech in multiparty meetings demonstrate that by using hierarchical Bayesian language models, we are able to achieve significant reductions in perplexity and word error rate.

en

dc.extent.pageNumbers

1941--1954

en

dc.identifier.doi

10.1109/TASL.2010.2040782

dc.identifier.uri

http://dx.doi.org/10.1109/TASL.2010.2040782

dc.identifier.uri

http://hdl.handle.net/1842/4528

dc.publisher

IEEE

en

dc.title

Hierarchical Bayesian Language Models for Conversational Speech Recognition

en

dc.type

Article

en

rps.issue

8

en

rps.title

IEEE Transactions on Audio, Speech and Language Processing

en

rps.volume

18

en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: huang-taslp10.pdf
Size:: 986.9 KB
Format:: Adobe Portable Document Format

Download

This item appears in the following Collection(s)

CSTR publications