The AMI System for the Transcription of Speech in Meetings
View/ Open
Date
2007Author
Hain, Thomas
Burget, Lukas
Dines, John
Garau, Giulia
Wan, Vincent
Karafiat, Martin
Vepa, Jithendra
Lincoln, Michael
Metadata
Abstract
This paper describes the AMI transcription system for speech in
meetings developed in collaboration by five research groups. The
system includes generic techniques such as discriminative and speaker
adaptive training, vocal tract length normalisation, heteroscedastic
linear discriminant analysis, maximum likelihood linear regression,
and phone posterior based features, as well as techniques specifically
designed for meeting data. These include segmentation and
cross-talk suppression, beam-forming, domain adaptation, web-data
collection, and channel adaptive training. The system was improved
by more than 20% relative in word error rate compared to our previous
system and was used in the NIST RT’06 evaluations where it was
found to yield competitive performance.