The THISL broadcast news retrieval system.

Abberley, Dave; Kirby, David; Renals, Steve; Robinson, Tony

The THISL broadcast news retrieval system.

Simple item page

dc.contributor.author

Abberley, Dave

en

dc.contributor.author

Kirby, David

en

dc.contributor.author

Renals, Steve

en

dc.contributor.author

Robinson, Tony

en

dc.date.accessioned

2006-05-26T16:23:49Z

dc.date.available

2006-05-26T16:23:49Z

dc.date.issued

1999

dc.description.abstract

In this paper we introduce a set of related confidence measures for large vocabulary continuous speech recognition (LVCSR) based on local phone posterior probability estimates output by an acceptor HMM acoustic model. In addition to their computational efficiency, these confidence measures are attractive as they may be applied at the state-, phone-, word- or utterance-levels, potentially enabling discrimination between different causes of low confidence recognizer output, such as unclear acoustics or mismatched pronunciation models. We have evaluated these confidence measures for utterance verification using a number of different metrics. Experiments reveal several trends in `profitability of rejection', as measured by the unconditional error rate of a hypothesis test. These trends suggest that crude pronunciation models can mask the relatively subtle reductions in confidence caused by out-of-vocabulary (OOV) words and disfluencies, but not the gross model mismatches elicited by non-speech sounds. The observation that a purely acoustic confidence measure can provide improved performance over a measure based upon both acoustic and language model information for data drawn from the Broadcast News corpus, but not for data drawn from the North American Business News corpus suggests that the quality of model fit offered by a trigram language model is reduced for Broadcast News data. We also argue that acoustic confidence measures may be used to inform the search for improved pronunciation models.

en

dc.format.extent

48491 bytes

en

dc.format.extent

62511 bytes

en

dc.format.mimetype

application/octet-stream

en

dc.format.mimetype

application/pdf

en

dc.identifier.citation

In Proc. ESCA Workshop on Accessing Information In Spoken Audio, pages 19-24, Cambridge, 1999.

dc.identifier.uri

http://homepages.inf.ed.ac.uk/srenals/pubs/1999/esca99-thisl/

dc.identifier.uri

http://hdl.handle.net/1842/1193

dc.language.iso

en

dc.title

The THISL broadcast news retrieval system.

en

dc.type

Conference Paper

en

Files

Original bundle

Now showing 1 - 2 of 2

Name:: esca99-thisl.pdf
Size:: 61.05 KB
Format:: Adobe Portable Document Format
Description:

Download

Name:: esca99-thisl.ps.gz
Size:: 47.35 KB
Format:: Unknown data format
Description:

Download

This item appears in the following Collection(s)

CSTR publications