dc.contributor.author | Abberley, Dave | en |
dc.contributor.author | Renals, Steve | en |
dc.contributor.author | Ellis, Dan | en |
dc.contributor.author | Robinson, Tony | en |
dc.coverage.spatial | 8 | en |
dc.date.accessioned | 2006-05-11T13:09:18Z | |
dc.date.available | 2006-05-11T13:09:18Z | |
dc.date.issued | 2000 | |
dc.identifier.citation | NIST Special Publication 500-246: The Eighth Text REtrieval Conference (TREC 8), held in Gaithersburg, Maryland, November 17-19, 1999. pp.699- | |
dc.identifier.other | stock number SN003-003-03664-1 | |
dc.identifier.uri | http://trec.nist.gov/pubs/trec8/papers/shef-proc-trec8.pdf | |
dc.identifier.uri | http://hdl.handle.net/1842/980 | |
dc.description.abstract | This paper describes the participation of the THISL group at the TREC-8 Spoken Document Retrieval (SDR) track. The THISL SDR system consists of the realtime version of the ABBOT large vocabulary speech recognition system and the THISLIR text retrieval system. The TREC-8 evaluation assessed SDR performance on a corpus of 500 hours of broadcast news material collected over a five month period. The main test condition involved retrieval of stories defined by manual segmentation of the corpus in which non-news material, such as commercials, were excluded. An optional test condition required required retrieval of the same stories
from the unsegmented audio stream. The THISL SDR system participated at both test conditions. The results show that a system such as THISL can produce respectable information retrieval performance on a realistically-sized corpus of unsegmented audio material. | en |
dc.format.extent | 65022 bytes | en |
dc.format.mimetype | application/pdf | en |
dc.language.iso | en | |
dc.publisher | Department of Commerce, National Institute of Standards and Technology | en |
dc.title | The THISL SDR system at TREC-8 | en |
dc.type | Conference Paper | en |