Speaker adaptation using constrained estimation of Gaussian mixtures

Digalakis Vasilis, Rtischev D., Neumeyer Leonardo

URI	http://purl.tuc.gr/dl/dias/00ED0C02-D3BE-44AA-B964-C0F7FE113B99	-
Αναγνωριστικό	http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=466659&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel4%2F89%2F9789%2F00466659.pdf%3Farnumber%3D466659	-
Αναγνωριστικό	https://doi.org/10.1109/89.466659	-
Γλώσσα	en	-
Μέγεθος	10 pages	en
Τίτλος	Speaker adaptation using constrained estimation of Gaussian mixtures	en
Δημιουργός	Digalakis Vasilis	en
Δημιουργός	Διγαλακης Βασιλης	el
Δημιουργός	Rtischev D.	en
Δημιουργός	Neumeyer Leonardo	en
Εκδότης	Institute of Electrical and Electronics Engineers	en
Περίληψη	A trend in automatic speech recognition systems is the use of continuous mixture-density hidden Markov models (HMMs). Despite the good recognition performance that these systems achieve on average in large vocabulary applications, there is a large variability in performance across speakers. Performance degrades dramatically when the user is radically different from the training population. A popular technique that can improve the performance and robustness of a speech recognition system is adapting speech models to the speaker, and more generally to the channel and the task. In continuous mixture-density HMMs the number of component densities is typically very large, and it may not be feasible to acquire a sufficient amount of adaptation data for robust maximum-likelihood estimates. To solve this problem, the authors propose a constrained estimation technique for Gaussian mixture densities. The algorithm is evaluated on the large-vocabulary Wall Street Journal corpus for both native and nonnative speakers of American English. For nonnative speakers, the recognition error rate is approximately halved with only a small amount of adaptation data, and it approaches the speaker-independent accuracy achieved for native speakers. For native speakers, the recognition performance after adaptation improves to the accuracy of speaker-dependent systems that use six times as much training data	en
Τύπος	Peer-Reviewed Journal Publication	en
Τύπος	Δημοσίευση σε Περιοδικό με Κριτές	el
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2015-11-02	-
Ημερομηνία Δημοσίευσης	1995	-
Θεματική Κατηγορία	HMM	en
Θεματική Κατηγορία	Hidden Markov Models	en
Θεματική Κατηγορία	Speech recognition	en
Βιβλιογραφική Αναφορά	V. Digalakis, D. Rtischev and L. Neumeyer, "Speaker adaptation using constrained estimation of Gaussian mixtures," IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 357-366, Sep. 1995. doi:10.1109/89.466659	en

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Speaker adaptation using constrained estimation of Gaussian mixtures

Digalakis Vasilis, Rtischev D., Neumeyer Leonardo

Υπηρεσίες

Εξαγωγή

Κοινοποίηση

Στατιστικά

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: