URI | http://purl.tuc.gr/dl/dias/C60F51E7-5E0D-416D-9C68-02F155B96CF0 | - |
Αναγνωριστικό | http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=743698&url=http%3A%2F%2Fieeexplore.ieee.org%2Fiel4%2F49%2F16030%2F00743698.pdf%3Farnumber%3D743698 | - |
Αναγνωριστικό | https://doi.org/10.1109/49.743698 | - |
Γλώσσα | en | - |
Μέγεθος | 9 pages | en |
Τίτλος | Quantization of cepstral parameters for speech recognition over the World Wide Web | en |
Δημιουργός | Digalakis Vasilis | en |
Δημιουργός | Διγαλακης Βασιλης | el |
Δημιουργός | Neumeyer Leonardo | en |
Δημιουργός | Perakakis M. | en |
Εκδότης | Institute of Electrical and Electronics Engineers | en |
Περίληψη | We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web (WWW). We compare a server-only processing model where the client encodes and transmits the speech signal to the server, to a model where the recognition front end runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly. We find that the required bit rate to achieve the recognition performance of high-quality unquantized speech is just 2000 bits per second | en |
Τύπος | Peer-Reviewed Journal Publication | en |
Τύπος | Δημοσίευση σε Περιοδικό με Κριτές | el |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-11-02 | - |
Ημερομηνία Δημοσίευσης | 1999 | - |
Θεματική Κατηγορία | Speech recognition | en |
Βιβλιογραφική Αναφορά | V. Digalakis, L. Neumeyer and M. Perakakis, "Quantization of cepstral parameters for speech recognition over the World Wide Web," IEEE J. Sel. Areas Commun., vol. 17, no. 1, pp. 82-90, Jan. 1999. doi:0.1109/49.743698 | en |