Quantization of cepstral parameters for speech recognition over the World Wide Web

Digalakis Vasilis, Neumeyer Leonardo, Perakakis M.

Πλήρης Εγγραφή

URI:

http://purl.tuc.gr/dl/dias/C60F51E7-5E0D-416D-9C68-02F155B96CF0

Έτος

1999

Τύπος

Δημοσίευση σε Περιοδικό με Κριτές

Άδεια Χρήσης

Λεπτομέρειες

Βιβλιογραφική Αναφορά

V. Digalakis, L. Neumeyer and M. Perakakis, "Quantization of cepstral parameters for speech recognition over the World Wide Web," IEEE J. Sel. Areas Commun., vol. 17, no. 1, pp. 82-90, Jan. 1999. doi:0.1109/49.743698 https://doi.org/10.1109/49.743698

Εμφανίζεται στις Συλλογές

Δημοσιεύσεις σε Περιοδικά στην Κοινότητα Σχολή Μηχανικών Ορυκτών Πόρων

Περίληψη

We examine alternative architectures for a client-server model of speech-enabled applications over the World Wide Web (WWW). We compare a server-only processing model where the client encodes and transmits the speech signal to the server, to a model where the recognition front end runs locally at the client and encodes and transmits the cepstral coefficients to the recognition server over the Internet. We follow a novel encoding paradigm, trying to maximize recognition performance instead of perceptual reproduction, and we find that by transmitting the cepstral coefficients we can achieve significantly higher recognition performance at a fraction of the bit rate required when encoding the speech signal directly. We find that the required bit rate to achieve the recognition performance of high-quality unquantized speech is just 2000 bits per second

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Quantization of cepstral parameters for speech recognition over the World Wide Web

Digalakis Vasilis, Neumeyer Leonardo, Perakakis M.

Περίληψη

Υπηρεσίες

Εξαγωγή

Κοινοποίηση

Στατιστικά

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: