Edinburgh Research Archive

Speech and crosstalk detection in multi-channel audio

dc.contributor.author
Wrigley, Stuart N
en
dc.contributor.author
Brown, Guy J
en
dc.contributor.author
Wan, Vincent
en
dc.contributor.author
Renals, Steve
en
dc.date.accessioned
2006-05-09T14:36:57Z
dc.date.available
2006-05-09T14:36:57Z
dc.date.issued
2005
dc.description.abstract
The analysis of scenarios in which a number of microphones record the activity of speakers, such as in a roundtable meeting, presents a number of computational challenges. For example, if each participant wears a microphone, it can receive speech from both the microphone's wearer (local speech) and from other participants (crosstalk). The recorded audio can be broadly classified in four ways: local speech, crosstalk plus local speech, crosstalk alone and silence. We describe two experiments related to the automatic classification of audio into these four classes. The first experiment attempted to optimise a set of acoustic features for use with a Gaussian mixture model (GMM) classifier. A large set of potential acoustic features were considered, some of which have been employed in previous studies. The best-performing features were found to be kurtosis, fundamentalness and cross-correlation metrics. The second experiment used these features to train an ergodic hidden Markov model classifier. Tests performed on a large corpus of recorded meetings show classification accuracies of up to 96%, and automatic speech recognition performance close to that obtained using ground truth segmentation.
en
dc.format.extent
200352 bytes
en
dc.format.mimetype
application/pdf
en
dc.identifier.citation
IEEE Trans. on Speech and Audio Processing, 13:84-91, 2005.
dc.identifier.uri
http://hdl.handle.net/1842/949
dc.language.iso
en
dc.publisher
IEEE Signal Processing Society Press
en
dc.subject
speech
en
dc.subject
Gaussian mixture model
en
dc.subject
Markov model
en
dc.title
Speech and crosstalk detection in multi-channel audio
en
dc.type
Article
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
sap04-xtalk.pdf
Size:
195.66 KB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)