URI | http://purl.tuc.gr/dl/dias/FBB6EA9E-B181-4D39-8F6C-4DDB3B0278DA | - |
Identifier | http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf | - |
Language | en | - |
Extent | 8 pages | en |
Title | Learning in zero–sum team Markov games using factored value functions | en |
Creator | Lagoudakis Michael | en |
Creator | Λαγουδακης Μιχαηλ | el |
Creator | Parr, R. | en |
Content Summary | We present a new method for learning good strategies in zero-sum
Markov games in which each side is composed of multiple agents collaborating
against an opposing team of agents. Our method requires full
observability and communication during learning, but the learned policies
can be executed in a distributed manner. The value function is represented
as a factored linear architecture and its structure determines the
necessary computational resources and communication bandwidth. This
approach permits a tradeoff between simple representations with little or
no communication between agents and complex, computationally intensive
representations with extensive coordination between agents. Thus,
we provide a principled means of using approximation to combat the
exponential blowup in the joint action space of the participants. The approach
is demonstrated with an example that shows the efficiency gains
over naive enumeration.
| en |
Type of Item | Πλήρης Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Full Paper | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-11-13 | - |
Date of Publication | 2002 | - |
Subject | HMMs (Hidden Markov models) | en |
Subject | hidden markov models | en |
Subject | hmms hidden markov models | en |
Bibliographic Citation | M.G. Lagoudakis and R.Parr. (2002, Dec.).Learning in zero–sum team Markov games using factored value functions. [Online]. Available: http://machinelearning.wustl.edu/mlpapers/paper_files/CN15.pdf | en |