Discriminative training of language models

Fytopoulos Nikolaos

URI	http://purl.tuc.gr/dl/dias/9E3E6133-EC63-4B6A-B43A-BCDAB53B3774	-
Αναγνωριστικό	https://doi.org/10.26233/heallink.tuc.23008	-
Γλώσσα	en	-
Μέγεθος	54 pages	en
Τίτλος	Discriminative training of language models	en
Δημιουργός	Fytopoulos Nikolaos	en
Δημιουργός	Φυτοπουλος Νικολαος	el
Συντελεστής [Επιβλέπων Καθηγητής]	Digalakis Vasilis	en
Συντελεστής [Επιβλέπων Καθηγητής]	Διγαλακης Βασιλης	el
Συντελεστής [Μέλος Εξεταστικής Επιτροπής]	Lagoudakis Michael	en
Συντελεστής [Μέλος Εξεταστικής Επιτροπής]	Λαγουδακης Μιχαηλ	el
Συντελεστής [Μέλος Εξεταστικής Επιτροπής]	Diakoloukas Vasilis	en
Συντελεστής [Μέλος Εξεταστικής Επιτροπής]	Διακολουκας Βασιλeioς	el
Εκδότης	Technical University of Crete	en
Εκδότης	Πολυτεχνείο Κρήτης	el
Ακαδημαϊκή Μονάδα	Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών	el
Περίληψη	The present thesis investigates the use of discriminative training on continuous Language Models. The main motivation for dealing with continuous language models was that by construction they overcome the limits of N-gram based models. N-gram models have been widely used in Language Modeling, but suffer from lack of generalizability and contain a very large number of parameters that are hard to adapt. Another flaw of N-gram models is the need for a large amount of training data, in order to cover as many N-grams as possible. Continuous Gaussian Mixture Language Models (GMLMs) for Speech Recognition have proven to be effective in terms of smoothing unseen events and adapting efficiently while using relatively small amount of data when compared to N-gram models. The training and testing data were extracted from the Wall Street Journal Corpus. Although the size of the vocabulary used in the corpus is large, the actual number of words being used in the present thesis is resticted. Data has the form of a continuous-space vector and consists of the history of each word in the corpus. The dimensions of these vectors were reduced by using SVD and LDA techniques. As far as the main objective of the thesis is concerned, attempts focus on improving the performance of GMLMs that have been previously trained by using the ML criterion on Language Models by adapting and using the Maximum Mutual Information(MMI) Estimation Method previously deployed in training HMMs for acoustic models. MMI acoustic models have proven to perform better than ML models, therefore MMI training gave a strong incentive in order to apply it on continuous language models. In addition, other discriminative criteria such as Minimum Phone Error (MPE) or Minimum Classification Error(MCE) are also theoretically investigated. Perplexity is the metric being used to measure the effectiveness of the presented method. The experiments of the thesis focus on testing MMI models that are smoothed with their correspondent baseline ML model and MMI models that are unsmoothed, with mixed results. The desired improvement is achieved in the case of unsmoothed MMI models against ML models.	en
Τύπος	Διπλωματική Εργασία	el
Τύπος	Diploma Work	en
Άδεια Χρήσης	http://creativecommons.org/licenses/by-nc/4.0/	en
Ημερομηνία	2014-10-22	-
Ημερομηνία Δημοσίευσης	2014	-
Θεματική Κατηγορία	Language modeling	en
Θεματική Κατηγορία	Pattern classification systems	en
Θεματική Κατηγορία	Pattern recognition computers	en
Θεματική Κατηγορία	pattern recognition systems	en
Θεματική Κατηγορία	pattern classification systems	en
Θεματική Κατηγορία	pattern recognition computers	en
Βιβλιογραφική Αναφορά	Νικόλαος Φυτόπουλος, "Discriminative training of language models", Διπλωματική Εργασία, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Χανιά, Ελλάς, 2014	el
Βιβλιογραφική Αναφορά	Nikolaos Fytopoulos, "Discriminative training of language models", Diploma Work, Σχολή Ηλεκτρονικών Μηχανικών και Μηχανικών Υπολογιστών, Πολυτεχνείο Κρήτης, Chania, Greece, 2014	en

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Discriminative training of language models

Fytopoulos Nikolaos

Διαθέσιμα αρχεία

Υπηρεσίες

Εξαγωγή

Κοινοποίηση

Στατιστικά

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: