Institutional Repository [SANDBOX]
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Value function approximation in zero–sum Markov games

Lagoudakis Michael, Parr,R.

Simple record


URIhttp://purl.tuc.gr/dl/dias/2F95F669-B215-44BD-90AF-6176BD490AA9-
Identifierhttp://arxiv.org/ftp/arxiv/papers/1301/1301.0580.pdf-
Languageen-
Extent10 pagesen
TitleValue function approximation in zero–sum Markov gamesen
CreatorLagoudakis Michaelen
CreatorΛαγουδακης Μιχαηλel
CreatorParr,R.en
Content SummaryThis paper investigates value function approximation in the context of zero-sum Markov games, which can be viewed as a generalization of the Markov decision process (MDP) framework to the two-agent case. We generalize error bounds from MDPs to Markov games and describe generalizations of reinforcement learning algorithms to Markov games. We present a generalization of the optimal stopping problem to a two-player simultaneous move Markov game. For this special problem, we provide stronger bounds and can guarantee convergence for LSTD and temporal difference learning with linear value function approximation. We demonstrate the viability of value function approximation for Markov games by using the Least squares policy iteration (LSPI) algorithm to learn good policies for a soccer domain and a flow control problem. en
Type of ItemΠλήρης Δημοσίευση σε Συνέδριοel
Type of ItemConference Full Paperen
Licensehttp://creativecommons.org/licenses/by/4.0/en
Date of Item2015-11-13-
Date of Publication2002-
SubjectArtificial Intelligenceen
Bibliographic CitationM.G. Lagoudakis and R. Parr. (2002, Aug.). Value function approximation in zero–sum Markov games. [Online]. Available: http://arxiv.org/ftp/arxiv/papers/1301/1301.0580.pdfen

Services

Statistics