<efrbr:recordSet xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:efrbr="http://vfrbr.info/efrbr/1.1" xmlns:efrbr-work="http://vfrbr.info/efrbr/1.1/work" xmlns:efrbr-expression="http://vfrbr.info/efrbr/1.1/expression" xmlns:efrbr-manifestation="http://vfrbr.info/efrbr/1.1/manifestation" xmlns:efrbr-person="http://vfrbr.info/efrbr/1.1/person" xmlns:efrbr-corporateBody="http://vfrbr.info/efrbr/1.1/corporateBody" xmlns:efrbr-concept="http://vfrbr.info/efrbr/1.1/concept" xmlns:efrbr-structure="http://vfrbr.info/efrbr/1.1/structure" xmlns:efrbr-responsible="http://vfrbr.info/efrbr/1.1/responsible" xmlns:efrbr-subject="http://vfrbr.info/efrbr/1.1/subject" xmlns:efrbr-other="http://vfrbr.info/efrbr/1.1/other" xsi:schemaLocation="http://vfrbr.info/efrbr/1.1 http://vfrbr.info/schemas/1.1/efrbr.xsd"><efrbr:entities><efrbr-work:work identifier="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC"><efrbr-work:titleOfTheWork>The best of many worlds: scheduling machine learning inference on CPU-GPU integrated architectures</efrbr-work:titleOfTheWork></efrbr-work:work><efrbr-expression:expression identifier="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC"><efrbr-expression:titleOfTheExpression>The best of many worlds: scheduling machine learning inference on CPU-GPU integrated architectures</efrbr-expression:titleOfTheExpression><efrbr-expression:formOfExpression vocabulary="DIAS:TYPES">
            Πλήρης Δημοσίευση σε Συνέδριο
            Conference Full Paper
         </efrbr-expression:formOfExpression><efrbr-expression:dateOfExpression type="issued">2024-07-31</efrbr-expression:dateOfExpression><efrbr-expression:dateOfExpression type="published">2022</efrbr-expression:dateOfExpression><efrbr-expression:languageOfExpression vocabulary="iso639-1">en</efrbr-expression:languageOfExpression><efrbr-expression:summarizationOfContent>A plethora of applications are using machine learning, the operations of which are becoming more complex and require additional computing power. At the same time, typical commodity system setups (including desktops, servers, and embedded
devices) are now offering different processing devices, the most often of which are multi-core CPUs, integrated GPUs, and discrete GPUs. In this paper, we follow a data-driven approach, where we first show the performance of different processing
devices when executing a diversified set of inference engines; some processing devices perform better for different performance metrics (e.g., throughput, latency, and power consumption), while at the same time, these metrics may also deviate significantly among different applications. Based on these findings, we propose
an adaptive scheduling approach, tailored for machine learning inference operations, that enables the use of the most efficient processing device available. Our scheduler is device-agnostic and can respond quickly to dynamic fluctuations that occur at real-time, such as data bursts, application overloads and system changes. The experimental results show that it is able to match the peak throughput, by predicting correctly the optimal processing device with an accuracy of 92.5%, with energy savings up to 10%.</efrbr-expression:summarizationOfContent><efrbr-expression:contextForTheExpression>This work was supported by the projects CONCORDIA, C4IIoT, COLLABS, and MARVEL funded by the European Commission under Grant Agreements No. 830927, No. 833828, No. 871518, and No. 957337.</efrbr-expression:contextForTheExpression><efrbr-expression:useRestrictionsOnTheExpression type="creative-commons">http://creativecommons.org/licenses/by-nc-nd/4.0/</efrbr-expression:useRestrictionsOnTheExpression><efrbr-expression:note type="page range">55-64</efrbr-expression:note><efrbr-expression:note type="conference name">2022 IEEE International Parallel and Distributed Processing Symposium Workshops</efrbr-expression:note><efrbr-expression:note type="proceedings title">Proceedings of the 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW 2022)</efrbr-expression:note></efrbr-expression:expression><efrbr-manifestation:manifestation identifier="https://dias.library.tuc.gr/view/100591"><efrbr-manifestation:titleOfTheManifestation>Vasiliadis_et_al_IPDPSW_2022_preprint.pdf</efrbr-manifestation:titleOfTheManifestation><efrbr-manifestation:publicationDistribution><efrbr-manifestation:placeOfPublicationDistribution type="distribution">Chania [Greece]</efrbr-manifestation:placeOfPublicationDistribution><efrbr-manifestation:publisherDistributor type="distributor">Library of TUC</efrbr-manifestation:publisherDistributor><efrbr-manifestation:dateOfPublicationDistribution>2024-07-31</efrbr-manifestation:dateOfPublicationDistribution></efrbr-manifestation:publicationDistribution><efrbr-manifestation:formOfCarrier>application/pdf</efrbr-manifestation:formOfCarrier><efrbr-manifestation:extentOfTheCarrier>828.3 kB</efrbr-manifestation:extentOfTheCarrier><efrbr-manifestation:accessRestrictionsOnTheManifestation>free</efrbr-manifestation:accessRestrictionsOnTheManifestation></efrbr-manifestation:manifestation><efrbr-person:person identifier="D48097DC-833D-42CD-9D72-5CEFAB5A8F3B"><efrbr-person:nameOfPerson vocabulary="">
            Vasiliadis Giorgos
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="2F384856-A9DD-46E3-8CE9-892F9782694D"><efrbr-person:nameOfPerson vocabulary="">
            Tsirbas Rafail
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~sioannidis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Ioannidis Sotirios
            Ιωαννιδης Σωτηριος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-corporateBody:corporateBody identifier="https://v2.sherpa.ac.uk/id/publisher/38"><efrbr-corporateBody:nameOfTheCorporateBody vocabulary="S/R:PUBLISHERS">
            Institute of Electrical and Electronics Engineers
         </efrbr-corporateBody:nameOfTheCorporateBody></efrbr-corporateBody:corporateBody><efrbr-concept:concept identifier="FACF820D-161E-4E3F-A788-4BEC7ECC9E20"><efrbr-concept:termForTheConcept>
            Machine learning algorithms
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="FBDCB28B-2565-458A-ABE8-B5459F3AAD00"><efrbr-concept:termForTheConcept>
            Computer architecture
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="F1DB686A-81DB-4746-BC91-DA5929703AFD"><efrbr-concept:termForTheConcept>
            Performance evaluation
         </efrbr-concept:termForTheConcept></efrbr-concept:concept></efrbr:entities><efrbr:relationships><efrbr-structure:structureRelations><efrbr-structure:realizedThrough sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="expression" targetURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC"/><efrbr-structure:embodiedIn sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="manifestation" targetURI="http://purl.tuc.gr/dl/dias/217CBC55-CDB8-4B0B-ADA3-F27604DBBAC2"/></efrbr-structure:structureRelations><efrbr-responsible:responsibleRelations><efrbr-responsible:createdBy sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="person" targetURI="D48097DC-833D-42CD-9D72-5CEFAB5A8F3B"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="person" targetURI="D48097DC-833D-42CD-9D72-5CEFAB5A8F3B" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="person" targetURI="2F384856-A9DD-46E3-8CE9-892F9782694D" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="person" targetURI="http://users.isc.tuc.gr/~sioannidis" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="person" targetURI="https://v2.sherpa.ac.uk/id/publisher/38" role="publisher"/></efrbr-responsible:responsibleRelations><efrbr-subject:subjectRelations><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="concept" targetURI="FACF820D-161E-4E3F-A788-4BEC7ECC9E20"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="concept" targetURI="FBDCB28B-2565-458A-ABE8-B5459F3AAD00"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/30E0BFB7-D2EE-4943-B7A2-0F2D1C5DA2EC" targetEntity="concept" targetURI="F1DB686A-81DB-4746-BC91-DA5929703AFD"/></efrbr-subject:subjectRelations><efrbr-other:otherRelations/></efrbr:relationships></efrbr:recordSet>