Institutional Repository [SANDBOX]
Technical University of Crete
EN  |  EL

Search

Browse

My Space

Algorithm selection using reinforcement learning

Lagoudakis Michael, Littman, M.

Simple record


URIhttp://purl.tuc.gr/dl/dias/75E77769-957E-4070-8DCA-33D273034342-
Identifierhttp://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.472.7494&rep=rep1&type=pdf-
Languageen-
Extent8 pagesen
TitleAlgorithm selection using reinforcement learningen
CreatorLagoudakis Michaelen
CreatorΛαγουδακης Μιχαηλel
CreatorLittman, M.en
Content SummaryMany computational problems can be solved by multiple algorithms, with different algorithms fastest for different problem sizes, input distributions, and hardware characteristics. We consider the problem of algorithm selection: dynamically choose an algorithm to attack an instance of a problem with the goal of minimizing the overall execution time. We formulate the problem as a kind of Markov decision process (MDP), and use ideas from reinforcement learning to solve it. This paper introduces a kind of MDP that models the algorithm selection problem by allowing multiple state transitions. The well known Q-learning algorithm is adapted for this case in a way that combines both Monte-Carlo and Temporal Difference methods. Also, this work uses, and extends in a way to control problems, the Least-Squares Temporal Difference algorithm (LSTD(0)) of Boyan. The experimental study focuses on the classic problems of order statistic selection and sorting. The encouraging results reveal the potential of applying learning methods to traditional computational problems.en
Type of ItemΠλήρης Δημοσίευση σε Συνέδριοel
Type of ItemConference Full Paperen
Licensehttp://creativecommons.org/licenses/by/4.0/en
Date of Item2015-11-14-
Date of Publication2000-
SubjectIntelligence, Computationalen
Subjectcomputational intelligenceen
Subjectintelligence computationalen
Bibliographic CitationMichail G. Lagoudakis and Michael L. Littman. (2000, June). Algorithm selection using reinforcement learning. [Online]. Available: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.472.7494&rep=rep1&type=pdfen

Services

Statistics