<efrbr:recordSet xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:efrbr="http://vfrbr.info/efrbr/1.1" xmlns:efrbr-work="http://vfrbr.info/efrbr/1.1/work" xmlns:efrbr-expression="http://vfrbr.info/efrbr/1.1/expression" xmlns:efrbr-manifestation="http://vfrbr.info/efrbr/1.1/manifestation" xmlns:efrbr-person="http://vfrbr.info/efrbr/1.1/person" xmlns:efrbr-corporateBody="http://vfrbr.info/efrbr/1.1/corporateBody" xmlns:efrbr-concept="http://vfrbr.info/efrbr/1.1/concept" xmlns:efrbr-structure="http://vfrbr.info/efrbr/1.1/structure" xmlns:efrbr-responsible="http://vfrbr.info/efrbr/1.1/responsible" xmlns:efrbr-subject="http://vfrbr.info/efrbr/1.1/subject" xmlns:efrbr-other="http://vfrbr.info/efrbr/1.1/other" xsi:schemaLocation="http://vfrbr.info/efrbr/1.1 http://vfrbr.info/schemas/1.1/efrbr.xsd"><efrbr:entities><efrbr-work:work identifier="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701"><efrbr-work:titleOfTheWork>Implementation of decision trees for data streams in the Spark Streaming platform</efrbr-work:titleOfTheWork></efrbr-work:work><efrbr-expression:expression identifier="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701"><efrbr-expression:titleOfTheExpression>Implementation of decision trees for data streams in the Spark Streaming platform</efrbr-expression:titleOfTheExpression><efrbr-expression:titleOfTheExpression>Υλοποίηση δέντρων αποφάσεων για ροές δεδομένων στην πλατφόρμα Spark Streaming   </efrbr-expression:titleOfTheExpression><efrbr-expression:formOfExpression vocabulary="DIAS:TYPES">
            Διπλωματική Εργασία
            Diploma Work
         </efrbr-expression:formOfExpression><efrbr-expression:dateOfExpression type="issued">2018-09-17</efrbr-expression:dateOfExpression><efrbr-expression:dateOfExpression type="published">2018</efrbr-expression:dateOfExpression><efrbr-expression:languageOfExpression vocabulary="iso639-1">en</efrbr-expression:languageOfExpression><efrbr-expression:summarizationOfContent>In the era of big data, enormous amounts of data are created, replicated
and transferred every day. The current technology for handling and analyzing
vast amounts of data allows us to develop applications for various problems
(e.g., DNA sequence analysis, medical imaging, traffic control) that could not
previously be solved efficiently. More precisely, the time required to process
large volumes of data can be minimized by using distributed computing platforms
such as Apache Spark. The Apache Spark framework includes various
implementations for large-scale machine learning, distributed data streaming
processing and parallel graph analytics. The Spark Streaming platform provides
scalable and fault-tolerant data streaming processing. However, there is
only a limited number of implemented distributed incremental machine learning
algorithms available in the Spark Streaming platform.

In this thesis, we propose a parallel implementation of an incremental and
scalable tree learning method for classification in Spark Streaming, the Hoeffding
decision tree. Our proposed implementation performs horizontal data
parallelism in the shared-nothing architecture of Spark. The Hoeffding bound
guarantees with high confidence that the Hoeffding decision tree is asymptotically
identical to a batch-learning one. The high dimensional statistics, required
for evaluating splits, are stored as sparse matrices in main memory
across the Spark cluster. These statistics are instantly updated, when new
training instances are available. Furthermore, distributed computations are
performed in order to identify the optimal split and assess whether the splitting
criterion is satisfied. The generated model is used in order to make color
classification based on the spectral signature of each color. Each color has a
different chemical composition, and as a consequence a different spectral signature.</efrbr-expression:summarizationOfContent><efrbr-expression:useRestrictionsOnTheExpression type="creative-commons">http://creativecommons.org/licenses/by/4.0/</efrbr-expression:useRestrictionsOnTheExpression><efrbr-expression:note type="academic unit">Πολυτεχνείο Κρήτης::Σχολή Ηλεκτρολόγων Μηχανικών και Μηχανικών Υπολογιστών</efrbr-expression:note></efrbr-expression:expression><efrbr-manifestation:manifestation identifier="http://purl.tuc.gr/dl/dias/90DFEF93-14A9-4E78-B3EB-78BEDDC25B65"><efrbr-manifestation:titleOfTheManifestation>Ziakas_Christos_Dip_2018.pdf</efrbr-manifestation:titleOfTheManifestation><efrbr-manifestation:publicationDistribution><efrbr-manifestation:placeOfPublicationDistribution type="distribution">Chania [Greece]</efrbr-manifestation:placeOfPublicationDistribution><efrbr-manifestation:publisherDistributor type="distributor">Library of TUC</efrbr-manifestation:publisherDistributor><efrbr-manifestation:dateOfPublicationDistribution>2018-09-17</efrbr-manifestation:dateOfPublicationDistribution></efrbr-manifestation:publicationDistribution><efrbr-manifestation:formOfCarrier>application/pdf</efrbr-manifestation:formOfCarrier><efrbr-manifestation:extentOfTheCarrier>1.0 MB</efrbr-manifestation:extentOfTheCarrier><efrbr-manifestation:accessRestrictionsOnTheManifestation>free</efrbr-manifestation:accessRestrictionsOnTheManifestation></efrbr-manifestation:manifestation><efrbr-person:person identifier="http://users.isc.tuc.gr/~cziakas"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Ziakas Christos
            Ζιακας Χρηστος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~mgarofalakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Garofalakis Minos
            Γαροφαλακης Μινως
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~adeligiannakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Deligiannakis Antonios
            Δεληγιαννακης Αντωνιος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~vsamoladas"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Samoladas Vasilis
            Σαμολαδας Βασιλης
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-corporateBody:corporateBody identifier="B73FF5E5-749A-467D-827F-83F45348F918"><efrbr-corporateBody:nameOfTheCorporateBody vocabulary="">
            Πολυτεχνείο Κρήτης
            Technical University of Crete
         </efrbr-corporateBody:nameOfTheCorporateBody></efrbr-corporateBody:corporateBody><efrbr-concept:concept identifier="F9EA3AF8-DB56-449B-A56D-6CA9873D3AE2"><efrbr-concept:termForTheConcept>
            Spark
         </efrbr-concept:termForTheConcept></efrbr-concept:concept></efrbr:entities><efrbr:relationships><efrbr-structure:structureRelations><efrbr-structure:realizedThrough sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="expression" targetURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701"/><efrbr-structure:embodiedIn sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="manifestation" targetURI="http://purl.tuc.gr/dl/dias/90DFEF93-14A9-4E78-B3EB-78BEDDC25B65"/></efrbr-structure:structureRelations><efrbr-responsible:responsibleRelations><efrbr-responsible:createdBy sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="http://users.isc.tuc.gr/~cziakas"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="http://users.isc.tuc.gr/~cziakas" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="http://users.isc.tuc.gr/~mgarofalakis" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/1"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="http://users.isc.tuc.gr/~adeligiannakis" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/2"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="http://users.isc.tuc.gr/~vsamoladas" role="http://purl.tuc.gr/dl/dias/vocabs/contributor-roles/2"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="person" targetURI="B73FF5E5-749A-467D-827F-83F45348F918" role="publisher"/></efrbr-responsible:responsibleRelations><efrbr-subject:subjectRelations><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/E1E86B11-FFC0-4334-AE22-A90F29608701" targetEntity="concept" targetURI="F9EA3AF8-DB56-449B-A56D-6CA9873D3AE2"/></efrbr-subject:subjectRelations><efrbr-other:otherRelations/></efrbr:relationships></efrbr:recordSet>