Institutional Repository [SANDBOX]
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Coordinated reinforcement learning

Lagoudakis Michael, Guestrin, C., Parr, R.

Full record


URI: http://purl.tuc.gr/dl/dias/15CFBFB5-CCCD-4BC0-ABAF-DAAE65C69CBD
Year 2002
Type of Item Conference Full Paper
License
Details
Bibliographic Citation C. Guestrin, M. G. Lagoudakis. (2002, July).Coordinated reinforcement learning. [Online]. Available: http://www.cs.berkeley.edu/~russell/classes/cs294/f05/papers/guestrin+al-2002.pdf
Appears in Collections

Summary

We present several new algorithms for multiagentreinforcement learning. A common feature of thesealgorithms is a parameterized, structured representationof a policy or value function. This structureis leveraged in an approach we call coordinated reinforcementlearning, by which agents coordinateboth their action selection activities and their parameterupdates. Within the limits of our parametricrepresentations, the agents will determinea jointly optimal action without explicitly consideringevery possible action in their exponentiallylarge joint action space. Our methods differ frommany previous reinforcement learning approachesto multiagent coordination in that structured communicationand coordination between agents appearsat the core of both the learning algorithm andthe execution architecture. Our experimental results,comparing our approach to other RL methods,illustrate both the quality of the policies obtainedand the additional benefits of coordination.

Services

Statistics