Edinburgh Research Archive

A Cascaded Broadcast News Highlighter

dc.contributor.author
Christensen, Heidi
en
dc.contributor.author
Gotoh, Yoshihiko
en
dc.contributor.author
Renals, Steve
en
dc.date.accessioned
2010-10-12T08:57:19Z
dc.date.available
2010-10-12T08:57:19Z
dc.date.issued
2008
dc.date.updated
2010-10-12T08:57:20Z
dc.description.abstract
This paper presents a fully automatic news skimming system which takes a broadcast news audio stream and provides the user with the segmented, structured and highlighted transcript. This constitutes a system with three different, cascading stages: converting the audio stream to text using an automatic speech recogniser, segmenting into utterances and stories and finally determining which utterance should be highlighted using a saliency score. Each stage must operate on the erroneous output from the previous stage in the system; an effect which is naturally amplified as the data progresses through the processing stages. We present a large corpus of transcribed broadcast news data enabling us to investigate to which degree information worth highlighting survives this cascading of processes. Both extrinsic and intrinsic experimental results indicate that mistakes in the story boundary detection has a strong impact on the quality of highlights, whereas erroneous utterance boundaries cause only minor problems. Further, the difference in transcription quality does not affect the overall performance greatly.
en
dc.extent.pageNumbers
151--161
en
dc.identifier.doi
10.1109/TASL.2007.910746
dc.identifier.eIssn
1558-7916
dc.identifier.uri
http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4407525&arnumber=4383075&count=28&index=16
dc.identifier.uri
http://hdl.handle.net/1842/3906
dc.publisher
IEEE
en
dc.title
A Cascaded Broadcast News Highlighter
en
dc.type
Article
en
rps.issue
1
en
rps.title
IEEE Transactions on Audio, Speech and Language Processing
en
rps.volume
16
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
christensen-tasl08.pdf
Size:
523.72 KB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)