Ιδρυματικό Αποθετήριο [SANDBOX]
Πολυτεχνείο Κρήτης

EN | EL

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Είσοδος

Learning in zero–sum team Markov games using factored value functions

Lagoudakis Michael, Parr, R.

Απλή Εγγραφή

URI	http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA	-
Αναγνωριστικό	http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf	-
Γλώσσα	en	-
Μέγεθος	8 pages	en
Τίτλος	Learning in zero–sum team Markov games using factored value functions	en
Δημιουργός	Lagoudakis Michael	en
Δημιουργός	Λαγουδακης Μιχαηλ	el
Δημιουργός	Parr, R.	en
Περίληψη	We present a new method for learning good strategies in zero-sum Markov games in which each side is composed of multiple agents collaborating against an opposing team of agents. Our method requires full observability and communication during learning, but the learned policies can be executed in a distributed manner. The value function is represented as a factored linear architecture and its structure determines the necessary computational resources and communication bandwidth. This approach permits a tradeoff between simple representations with little or no communication between agents and complex, computationally intensive representations with extensive coordination between agents. Thus, we provide a principled means of using approximation to combat the exponential blowup in the joint action space of the participants. The approach is demonstrated with an example that shows the efficiency gains over naive enumeration.	en
Τύπος	Πλήρης Δημοσίευση σε Συνέδριο	el
Τύπος	Conference Full Paper	en
Άδεια Χρήσης	http://creativecommons.org/licenses/by/4.0/	en
Ημερομηνία	2015-11-13	-
Ημερομηνία Δημοσίευσης	2002	-
Θεματική Κατηγορία	HMMs (Hidden Markov models)	en
Θεματική Κατηγορία	hidden markov models	en
Θεματική Κατηγορία	hmms hidden markov models	en
Βιβλιογραφική Αναφορά	M.G. Lagoudakis and R.Parr. (2002, Dec.).Learning in zero–sum team Markov games using factored value functions. [Online]. Available: http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf	en

Υπηρεσίες

Στατιστικά

Copyright © DIAS 2013