URI | http://purl.tuc.gr/dl/dias/E0292307-A486-42F6-A1D4-8BF6498753E2 | - |
Identifier | http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf | - |
Language | en | - |
Extent | 8 pages | en |
Title | On the locality of action domination in sequential decision making | en |
Creator | Rachelson, Emmanuel | en |
Creator | Lagoudakis Michael | en |
Creator | Λαγουδακης Μιχαηλ | el |
Content Summary | In the field of sequential decision making and reinforcement
learning, it has been observed that good policies for most
problems exhibit a significant amount of structure. In practice,
this implies that when a learning agent discovers an action
is better than any other in a given state, this action actually
happens to also dominate in a certain neighbourhood
around that state. This paper presents new results proving
that this notion of locality in action domination can be linked
to the smoothness of the environment’s underlying stochastic
model. Namely, we link the Lipschitz continuity of a Markov
Decision Process to the Lispchitz continuity of its policies’
value functions and introduce the key concept of influence radius
to describe the neighbourhood of states where the dominating
action is guaranteed to be constant. These ideas are
directly exploited into the proposed Localized Policy Iteration
(LPI) algorithm, which is an active learning version of
Rollout-based Policy Iteration. Preliminary results on the Inverted
Pendulum domain demonstrate the viability and the
potential of the proposed approach. | en |
Type of Item | Πλήρης Δημοσίευση σε Συνέδριο | el |
Type of Item | Conference Full Paper | en |
License | http://creativecommons.org/licenses/by/4.0/ | en |
Date of Item | 2015-11-13 | - |
Date of Publication | 2010 | - |
Subject | Artificial Intelligence | en |
Bibliographic Citation | E. Rachelson and Michail G. Lagoudakis. (2010, Jan.). On the locality of action domination in sequential decision making. Presented at 11th International Symposium on Artificial Intelligence and Mathematics (ISAIM). [Online]. Available: http://www.researchgate.net/profile/Emmanuel_Rachelson/publication/221186156_On_the_locality_of_action_domination_in_sequential_decision_making/links/0fcfd5051c4eaad94f000000.pdf | en |