Institutional Repository [SANDBOX]
Technical University of Crete

EN | EL

Search

Browse

My Space

Login

On the locality of action domination in sequential decision making

Rachelson, Emmanuel, Lagoudakis Michael

URI	http://purl.tuc.gr/dl/dias/E0292307-A486-42F6-A1D4-8BF6498753E2	-
Identifier	http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf	-
Language	en	-
Extent	8 pages	en
Title	On the locality of action domination in sequential decision making	en
Creator	Rachelson, Emmanuel	en
Creator	Lagoudakis Michael	en
Creator	Λαγουδακης Μιχαηλ	el
Content Summary	In the field of sequential decision making and reinforcement learning, it has been observed that good policies for most problems exhibit a significant amount of structure. In practice, this implies that when a learning agent discovers an action is better than any other in a given state, this action actually happens to also dominate in a certain neighbourhood around that state. This paper presents new results proving that this notion of locality in action domination can be linked to the smoothness of the environment’s underlying stochastic model. Namely, we link the Lipschitz continuity of a Markov Decision Process to the Lispchitz continuity of its policies’ value functions and introduce the key concept of influence radius to describe the neighbourhood of states where the dominating action is guaranteed to be constant. These ideas are directly exploited into the proposed Localized Policy Iteration (LPI) algorithm, which is an active learning version of Rollout-based Policy Iteration. Preliminary results on the Inverted Pendulum domain demonstrate the viability and the potential of the proposed approach.	en
Type of Item	Πλήρης Δημοσίευση σε Συνέδριο	el
Type of Item	Conference Full Paper	en
License	http://creativecommons.org/licenses/by/4.0/	en
Date of Item	2015-11-13	-
Date of Publication	2010	-
Subject	Artificial Intelligence	en
Bibliographic Citation	E. Rachelson and Michail G. Lagoudakis. (2010, Jan.). On the locality of action domination in sequential decision making. Presented at 11th International Symposium on Artificial Intelligence and Mathematics (ISAIM). [Online]. Available: http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf	en

Services

Statistics

Copyright © DIAS 2013