Linear dynamical models in speech synthesis

Tsiaras Vasileios, Ranniery Maia, Diakoloukas Vasilis, Stylianou, Yannis, Digalakis Vasilis

URI	http://purl.tuc.gr/dl/dias/42E91AFE-C479-4B0E-9797-C9300FB9FF56	-
Identifier	https://doi.org/10.1109/ICASSP.2014.6853606	-
Identifier	http://ieeexplore.ieee.org/document/6853606/	-
Language	en	-
Title	Linear dynamical models in speech synthesis	en
Creator	Tsiaras Vasileios	en
Creator	Τσιαρας Βασιλειος	el
Creator	Ranniery Maia	en
Creator	Diakoloukas Vasilis	en
Creator	Διακολουκας Βασιλeioς	el
Creator	Stylianou, Yannis	en
Creator	Digalakis Vasilis	en
Creator	Διγαλακης Βασιλης	el
Publisher	Institute of Electrical and Electronics Engineers	en
Content Summary	Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech synthesis (TTS). HMMs provide an attractive acoustic modeling scheme which has been exhaustively investigated and developed for many years. Modern HMM-based speech synthesizers have approached the quality of the best state-of-the-art unit selection systems. However, we believe that statistical parametric speech synthesis has not reached its potential, since HMMs are limited by several assumptions which do not apply to the properties of speech. We, therefore, propose in this paper to use Lin-ear Dynamical Models (LDMs) instead of HMMs. LDMs can better model the dynamics of speech and can produce a naturally smoother trajectory of the synthesized speech. We perform a series of experiments using different system configurations to check on the performance of LDMs for speech synthesis. We show that LDM-based synthesizers can outperform HMM-based ones in terms of cepstral distance and are a very promising acoustic modeling alternative for statistical parametric TTS.	en
Type of Item	Πλήρης Δημοσίευση σε Συνέδριο	el
Type of Item	Conference Full Paper	en
License	http://creativecommons.org/licenses/by/4.0/	en
Date of Item	2015-11-08	-
Date of Publication	2014	-
Subject	HMMs (Hidden Markov models)	en
Subject	hidden markov models	en
Subject	hmms hidden markov models	en
Bibliographic Citation	V. Tsiaras, R. Maia, V. Diakoloukas, Y. Stylianou and V. Digalakis, "Linear dynamical models in speech synthesis", in 2014 IEEE Int. Conf. on Acoust., Speech and Sign. Process. (ICASSP) doi: 10.1109/ICASSP.2014.6853606	en

Search

Browse

My Space

Linear dynamical models in speech synthesis

Tsiaras Vasileios, Ranniery Maia, Diakoloukas Vasilis, Stylianou, Yannis, Digalakis Vasilis

Services

Export

Share

Statistics

Metadata & Content in a METS Package:

Metadata in Format: