Estimation of general identifiable linear dynamic models with an application in speech recognition

Tsontzos Georgios , Diakoloukas Vasilis, Koniaris, Christos, 1979-, Digalakis Vasilis

Full record

URI:

http://purl.tuc.gr/dl/dias/DD5D0269-6830-4FB8-8778-977FC068EAEE

Year

2007

Type of Item

Conference Publication

License

Details

Bibliographic Citation

G. Tsontzos, V. Diakoloukas, C. Koniaris and V. Digalakis, "Estimation of general identifiable linear dynamic models with an application in speech recognition," in 2007 IEEE Int. Conf. on Acoust., Speech and Sign. Process. (ICASSP) doi: 10.1109/ICASSP.2007.366947 https://doi.org/10.1109/ICASSP.2007.366947

Appears in Collections

Conference Publications in Community School of Electrical and Computer Engineering

Summary

Although hidden Markov models (HMMs) provide a relatively efficient modeling framework for speech recognition, they suffer from several shortcomings which set upper bounds in the performance that can be achieved. Alternatively, linear dynamic models (LDM) can be used to model speech segments. Several implementations of LDM have been proposed in the literature. However, all had a restricted structure to satisfy identifiability constraints. In this paper, we relax all these constraints and use a general, canonical form for a linear state-space system that guarantees identifiability for arbitrary state and observation vector dimensions. For this system, we present a novel, element-wise maximum likelihood (ML) estimation method. Classification experiments on the AURORA2 speech database show performance gains compared to HMMs, particularly on highly noisy conditions.

Search

Browse

My Space

Estimation of general identifiable linear dynamic models with an application in speech recognition

Tsontzos Georgios , Diakoloukas Vasilis, Koniaris, Christos, 1979-, Digalakis Vasilis

Summary

Services

Export

Share

Statistics

Metadata & Content in a METS Package:

Metadata in Format: