Ιδρυματικό Αποθετήριο [SANDBOX]
Πολυτεχνείο Κρήτης

EN | EL

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Είσοδος

Reinforcement learning as classification: leveraging modern classifiers

Lagoudakis Michael, Parr, R.

Απλή Εγγραφή

URI	http://purl.tuc.gr/dl/dias/78C8B833-D841-436A-82B4-676C1B860269	-
Αναγνωριστικό	http://www.aaai.org/Papers/ICML/2003/ICML03-057.pdf	-
Γλώσσα	en	-
Μέγεθος	8 pages	en
Τίτλος	Reinforcement learning as classification: leveraging modern classifiers	en
Δημιουργός	Lagoudakis Michael	en
Δημιουργός	Λαγουδακης Μιχαηλ	el
Δημιουργός	Parr, R.	en
Περίληψη	The basic tools of machine learning appear in the inner loop of most reinforcement learning algorithms, typically in the form of Monte Carlo methods or function approximation techniques. To a large extent, however, current reinforcement learning algorithms draw upon machine learning techniques that are at least ten years old and, with a few exceptions, very little has been done to exploit recent advances in classification learning for the purposes of reinforcement learning. We use a variant of approximate policy iteration based on rollouts that allows us to use a pure classification learner, such as a support vector machine (SVM), in the inner loop of the algorithm. We argue that the use of SVMs, particularly in combination with the kernel trick, can make it easier to apply reinforcement learning as an “outof-the-box” technique, without extensive feature engineering. Our approach opens the door to modern classification methods, but does not preclude the use of classical methods. We present experimental results in the pendulum balancing and bicycle riding domains using both SVMs and neural networks for classifiers	en
Τύπος	Πλήρης Δημοσίευση σε Συνέδριο	el
Τύπος	Conference Full Paper	en
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2015-11-13	-
Ημερομηνία Δημοσίευσης	2003	-
Θεματική Κατηγορία	machine learning	en
Βιβλιογραφική Αναφορά	M.G. Lagoudakis and R. Parr. (2003, Aug.). Reinforcement learning as classification: leveraging modern classifiers. [Online]. Available: http://www.aaai.org/Papers/ICML/2003/ICML03-057.pdf	en

Υπηρεσίες

Στατιστικά

Copyright © DIAS 2013