Recent development of the HMM-based speech synthesis system (HTS)
Proc. 2009 Asia-Pacific Signal and Information Processing Association (APSIPA)
View/ Open
Date
2009Author
Zen, Heiga
Oura, Keiichiro
Nose, Takashi
Yamagishi, Junichi
Sako, Shinji
Toda, Tomoki
Masuko, Takashi
Black, Alan W
Tokuda, Keiichi
Metadata
Abstract
A statistical parametric approach to speech synthesis based on hidden Markov models (HMMs) has grown in popularity over the last few years. In this approach, spectrum, excitation, and duration of speech are simultaneously modeled by context-dependent HMMs, and speech waveforms are generate from the HMMs themselves. Since December 2002, we have publicly released an open-source software toolkit named “HMM-based speech synthesis system (HTS)” to provide a research and development toolkit for statistical parametric speech synthesis. This paper describes recent developments of HTS in detail, as well as future release plans.