Approximate policy iteration using large-margin classifiers

URI	http://purl.tuc.gr/dl/dias/B95FD666-3683-44DB-8681-8CB3C2DFEC7B	-
Γλώσσα	en	-
Μέγεθος	3 pages	en
Τίτλος	Approximate policy iteration using large-margin classifiers	en
Δημιουργός	Lagoudakis Michael	en
Δημιουργός	Λαγουδακης Μιχαηλ	el
Περίληψη	We present an approximate policy iteration algorithm that uses rollouts to estimate the value of each action under a given policy in a subset of states and a classifier to generalize and learn the improved policy over the entire state space. Using a multiclass support vector machine as the classifier, we obtained successful results on the inverted pendulum and the bicycle balancing and riding domains.	en
Τύπος	Πλήρης Δημοσίευση σε Συνέδριο	el
Τύπος	Conference Full Paper	en
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2015-11-13	-
Ημερομηνία Δημοσίευσης	2003	-
Θεματική Κατηγορία	Artificial Intelligence	en
Βιβλιογραφική Αναφορά	M.G. Lagoudakis and R. Parr, “Approximate policy iteration using large-margin classifiers,” in Proceedings of the 18th International Joint Conference on Artificial Intelligence (IJCAI), 2003, pp. 1432–1434.	en

Εξαγωγή

Μεταδεδομένων & Περιεχομένου σε METS:

Μεταδεδομένων σε Μορφότυπο: