URI | http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA | - |
Αναγνωριστικό | http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf | - |
Γλώσσα | en | - |
Μέγεθος | 8 pages | en |
Τίτλος | Learning in zero–sum team Markov games using factored value functions | en |
Δημιουργός | Lagoudakis Michael | en |
Δημιουργός | Λαγουδακης Μιχαηλ | el |
Δημιουργός | Parr, R. | en |
Περίληψη | We present a new method for learning good strategies in zero-sum
Markov games in which each side is composed of multiple agents collaborating
against an opposing team of agents. Our method requires full
observability and communication during learning, but the learned policies
can be executed in a distributed manner. The value function is represented
as a factored linear architecture and its structure determines the
necessary computational resources and communication bandwidth. This
approach permits a tradeoff between simple representations with little or
no communication between agents and complex, computationally intensive
representations with extensive coordination between agents. Thus,
we provide a principled means of using approximation to combat the
exponential blowup in the joint action space of the participants. The approach
is demonstrated with an example that shows the efficiency gains
over naive enumeration.
| en |
Τύπος | Πλήρης Δημοσίευση σε Συνέδριο | el |
Τύπος | Conference Full Paper | en |
Άδεια Χρήσης | http://creativecommons.org/licenses/by/4.0/ | en |
Ημερομηνία | 2015-11-13 | - |
Ημερομηνία Δημοσίευσης | 2002 | - |
Θεματική Κατηγορία | HMMs (Hidden Markov models) | en |
Θεματική Κατηγορία | hidden markov models | en |
Θεματική Κατηγορία | hmms hidden markov models | en |
Βιβλιογραφική Αναφορά | M.G. Lagoudakis and R.Parr. (2002, Dec.).Learning in zero–sum team Markov games using factored value functions. [Online]. Available: http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf | en |