Το έργο με τίτλο From HMM's to segment models: a unified view of stochastic modeling for speech recognition από τον/τους δημιουργό/ούς Ostendorf M., Digalakis Vasilis, Kimball O. διατίθεται με την άδεια Creative Commons Αναφορά Δημιουργού 4.0 Διεθνές
Βιβλιογραφική Αναφορά
M. Ostendorf, V. Digalakis and O. Kimball, "From HMM's to segment models: a unified view of stochastic modeling for speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 360-378, Sep. 1996. doi:10.1109/89.536930
https://doi.org/10.1109/89.536930
Many alternative models have been proposed to address some of the shortcomings of the hidden Markov model (HMM), which is currently the most popular approach to speech recognition. In particular, a variety of models that could be broadly classified as segment models have been described for representing a variable-length sequence of observation vectors in speech recognition applications. Since there are many aspects in common between these approaches, including the general recognition and training problems, it is useful to consider them in a unified framework. The paper describes a general stochastic model that encompasses most of the models proposed in the literature, pointing out similarities of the models in terms of correlation and parameter tying assumptions, and drawing analogies between segment models and HMMs. In addition, we summarize experimental results assessing different modeling assumptions and point out remaining open questions