<efrbr:recordSet xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:efrbr="http://vfrbr.info/efrbr/1.1" xmlns:efrbr-work="http://vfrbr.info/efrbr/1.1/work" xmlns:efrbr-expression="http://vfrbr.info/efrbr/1.1/expression" xmlns:efrbr-manifestation="http://vfrbr.info/efrbr/1.1/manifestation" xmlns:efrbr-person="http://vfrbr.info/efrbr/1.1/person" xmlns:efrbr-corporateBody="http://vfrbr.info/efrbr/1.1/corporateBody" xmlns:efrbr-concept="http://vfrbr.info/efrbr/1.1/concept" xmlns:efrbr-structure="http://vfrbr.info/efrbr/1.1/structure" xmlns:efrbr-responsible="http://vfrbr.info/efrbr/1.1/responsible" xmlns:efrbr-subject="http://vfrbr.info/efrbr/1.1/subject" xmlns:efrbr-other="http://vfrbr.info/efrbr/1.1/other" xsi:schemaLocation="http://vfrbr.info/efrbr/1.1 http://vfrbr.info/schemas/1.1/efrbr.xsd"><efrbr:entities><efrbr-work:work identifier="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A"><efrbr-work:titleOfTheWork>Optimization of heuristic search using recursive algorithm selection and reinforcement learning</efrbr-work:titleOfTheWork></efrbr-work:work><efrbr-expression:expression identifier="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A"><efrbr-expression:titleOfTheExpression>Optimization of heuristic search using recursive algorithm selection and reinforcement learning</efrbr-expression:titleOfTheExpression><efrbr-expression:formOfExpression vocabulary="DIAS:TYPES">
            Peer-Reviewed Journal Publication
            Δημοσίευση σε Περιοδικό με Κριτές
         </efrbr-expression:formOfExpression><efrbr-expression:dateOfExpression type="issued">2015-10-27</efrbr-expression:dateOfExpression><efrbr-expression:dateOfExpression type="published">2010</efrbr-expression:dateOfExpression><efrbr-expression:languageOfExpression vocabulary="iso639-1">en</efrbr-expression:languageOfExpression><efrbr-expression:summarizationOfContent>The traditional approach to computational problem solving is to use one of the available algorithms to obtain solutions for all given instances of a problem. However, typically not all instances are the same, nor a single algorithm performs best on all instances. Our work investigates a more sophisticated approach to problem solving, called Recursive Algorithm Selection, whereby several algorithms for a problem (including some recursive ones) are available to an agent that makes an informed decision on which algorithm to select for handling each sub-instance of a problem at each recursive call made while solving an instance. Reinforcement learning methods are used for learning decision policies that optimize any given performance criterion (time, memory, or a combination thereof) from actual execution and profiling experience. This paper focuses on the well-known problem of state-space heuristic search and combines the A* and RBFS algorithms to yield a hybrid search algorithm, whose decision policy is learned using the Least-Squares Policy Iteration (LSPI) algorithm. Our benchmark problem domain involves shortest path finding problems in a real-world dataset encoding the entire street network of the District of Columbia (DC), USA. The derived hybrid algorithm exhibits better performance results than the individual algorithms in the majority of cases according to a variety of performance criteria balancing time and memory. It is noted that the proposed methodology is generic, can be applied to a variety of other problems, and requires no prior knowledge about the individual algorithms used or the properties of the underlying problem instances being solved.</efrbr-expression:summarizationOfContent><efrbr-expression:contextForTheExpression>Δημοσίευση σε επιστημονικό περιοδικό </efrbr-expression:contextForTheExpression><efrbr-expression:useRestrictionsOnTheExpression type="creative-commons">http://creativecommons.org/licenses/by/4.0/</efrbr-expression:useRestrictionsOnTheExpression><efrbr-expression:note type="journal name">Annals of Mathematics and Artificial Intelligence</efrbr-expression:note><efrbr-expression:note type="journal volume">1</efrbr-expression:note><efrbr-expression:note type="journal number">60</efrbr-expression:note><efrbr-expression:note type="page range">119-151</efrbr-expression:note></efrbr-expression:expression><efrbr-person:person identifier="8FE55CF9-9759-4BF4-84D8-BB11C58C040F"><efrbr-person:nameOfPerson vocabulary="">
             Vasilikos Vasileios
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~lagoudakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Lagoudakis Michael
            Λαγουδακης Μιχαηλ
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-corporateBody:corporateBody identifier="http://www.springerlink.com/?MUD=MP"><efrbr-corporateBody:nameOfTheCorporateBody vocabulary="S/R:PUBLISHERS">
            Springer Verlag
         </efrbr-corporateBody:nameOfTheCorporateBody></efrbr-corporateBody:corporateBody><efrbr-concept:concept identifier="39185D2B-31B5-4C5A-908D-56A10C8DD831"><efrbr-concept:termForTheConcept>
            Heuristic search 
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="05648625-6A6F-4CD7-96D0-3896A67FDCE7"><efrbr-concept:termForTheConcept>
            Algorithm selection
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="4964983A-C583-4A74-8CDD-A2D4A5D5E2F5"><efrbr-concept:termForTheConcept>
            Reinforcement learning 
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="DB269E06-9343-40A0-8AFF-777D46598FC6"><efrbr-concept:termForTheConcept>
            Software optimization 
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="7D1B7ACD-862A-42F4-B564-D482020FFA4A"><efrbr-concept:termForTheConcept>
            Hybrid algorithms
         </efrbr-concept:termForTheConcept></efrbr-concept:concept></efrbr:entities><efrbr:relationships><efrbr-structure:structureRelations><efrbr-structure:realizedThrough sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="expression" targetURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A"/></efrbr-structure:structureRelations><efrbr-responsible:responsibleRelations><efrbr-responsible:createdBy sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="person" targetURI="8FE55CF9-9759-4BF4-84D8-BB11C58C040F"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="person" targetURI="8FE55CF9-9759-4BF4-84D8-BB11C58C040F" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="person" targetURI="http://users.isc.tuc.gr/~lagoudakis" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="person" targetURI="http://www.springerlink.com/?MUD=MP" role="publisher"/></efrbr-responsible:responsibleRelations><efrbr-subject:subjectRelations><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="concept" targetURI="39185D2B-31B5-4C5A-908D-56A10C8DD831"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="concept" targetURI="05648625-6A6F-4CD7-96D0-3896A67FDCE7"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="concept" targetURI="4964983A-C583-4A74-8CDD-A2D4A5D5E2F5"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="concept" targetURI="DB269E06-9343-40A0-8AFF-777D46598FC6"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/A2EA3E49-6766-4040-9483-FAC140B6805A" targetEntity="concept" targetURI="7D1B7ACD-862A-42F4-B564-D482020FFA4A"/></efrbr-subject:subjectRelations><efrbr-other:otherRelations/></efrbr:relationships></efrbr:recordSet>