Edinburgh Research Archive

Applying Vocal Tract Length Normalization to Meeting Recordings

dc.contributor.author
Garau, Giulia
en
dc.contributor.author
Renals, Steve
en
dc.contributor.author
Hain, Thomas
en
dc.date.accessioned
2006-05-08T16:33:38Z
dc.date.available
2006-05-08T16:33:38Z
dc.date.issued
2005
dc.description.abstract
Vocal Tract Length Normalisation (VTLN) is a commonly used technique to normalise for inter-speaker variability. It is based on the speaker-specific warping of the frequency axis, parameterised by a scalar warp factor. This factor is typically estimated using maximum likelihood. We discuss how VTLN may be applied to multiparty conversations, reporting a substantial decrease in word error rate in experiments using the ICSI meetings corpus. We investigate the behaviour of the VTLN warping factor and show that a stable estimate is not obtained. Instead it appears to be influenced by the context of the meeting, in particular the current conversational partner. These results are consistent with predictions made by the psycholinguistic interactive alignment account of dialogue, when applied at the acoustic and phonological levels.
en
dc.format.extent
87732 bytes
en
dc.format.mimetype
application/pdf
en
dc.identifier.citation
In Proceedings, Interspeech'2005 - Eurospeech, 9th European Conference on Speech Communication and Technology, Lisbon, Portugal, September 4-8, 2005
dc.identifier.uri
http://www.isca-speech.org/archive/interspeech_2005
dc.identifier.uri
http://hdl.handle.net/1842/918
dc.language.iso
en
dc.publisher
International Speech Communication Association
en
dc.subject
Vocal Tract Length Normalisation
en
dc.subject
inter-speaker variability
en
dc.title
Applying Vocal Tract Length Normalization to Meeting Recordings
en
dc.type
Conference Paper
en

Files

Original bundle

Now showing 1 - 1 of 1
Name:
giuliagarau_eurospeech05.pdf
Size:
85.68 KB
Format:
Adobe Portable Document Format

This item appears in the following Collection(s)