<efrbr:recordSet xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:efrbr="http://vfrbr.info/efrbr/1.1" xmlns:efrbr-work="http://vfrbr.info/efrbr/1.1/work" xmlns:efrbr-expression="http://vfrbr.info/efrbr/1.1/expression" xmlns:efrbr-manifestation="http://vfrbr.info/efrbr/1.1/manifestation" xmlns:efrbr-person="http://vfrbr.info/efrbr/1.1/person" xmlns:efrbr-corporateBody="http://vfrbr.info/efrbr/1.1/corporateBody" xmlns:efrbr-concept="http://vfrbr.info/efrbr/1.1/concept" xmlns:efrbr-structure="http://vfrbr.info/efrbr/1.1/structure" xmlns:efrbr-responsible="http://vfrbr.info/efrbr/1.1/responsible" xmlns:efrbr-subject="http://vfrbr.info/efrbr/1.1/subject" xmlns:efrbr-other="http://vfrbr.info/efrbr/1.1/other" xsi:schemaLocation="http://vfrbr.info/efrbr/1.1 http://vfrbr.info/schemas/1.1/efrbr.xsd"><efrbr:entities><efrbr-work:work identifier="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998"><efrbr-work:titleOfTheWork>An FPGA-based data pre-processing architecture to accelerate De-novo genome assembly</efrbr-work:titleOfTheWork></efrbr-work:work><efrbr-expression:expression identifier="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998"><efrbr-expression:titleOfTheExpression>An FPGA-based data pre-processing architecture to accelerate De-novo genome assembly</efrbr-expression:titleOfTheExpression><efrbr-expression:formOfExpression vocabulary="DIAS:TYPES">
            Πλήρης Δημοσίευση σε Συνέδριο
            Conference Full Paper
         </efrbr-expression:formOfExpression><efrbr-expression:dateOfExpression type="issued">2023-05-12</efrbr-expression:dateOfExpression><efrbr-expression:dateOfExpression type="published">2021</efrbr-expression:dateOfExpression><efrbr-expression:languageOfExpression vocabulary="iso639-1">en</efrbr-expression:languageOfExpression><efrbr-expression:otherDistinguishingCharacteristic>This work has been partially funded by the EU FP7 Project “QualiMaster - A Configurable Real-time Data Processing Infrastructure Mastering Autonomous Quality Adaptation”, Reference Number 619525, funded under the call FP7-ICT- 2013-1.</efrbr-expression:otherDistinguishingCharacteristic><efrbr-expression:summarizationOfContent>Genome assembly is a field of bioinformatics which refers to the process of taking small fragments of genetic material and putting them back together in order to reconstruct the original DNA sequence from which the fragments originated. As the DNA genome assembly input datasets in most cases have a very large amount of data, it is important to develop custom architectures in order to speed up these processes and gain significant execution time reduction. In this paper we present the Reads Matching Filter (RMF), an input dataset prefiltering process, based on string matching and implemented on Field Programmable Gate Array (FPGA) technology, in order to reduce the genome assembly execution time. The outputs of the RMF running on the FPGA as well as the original input dataset are given as input to the Velvet genome assembler which produces the assembly of the input sequences. The Velvet genome assembler is based on the manipulation of de Bruijn graphs, and produces its output via the removal of errors and the simplication of repeated regions. The FPGA-based RMF pre-filtering process manages to speedup the entire genome assembly processing, including I/O, by up to 6 times, while maintaining the quality of the output sequence contigs (i.e. the series of overlapping DNA sequences).</efrbr-expression:summarizationOfContent><efrbr-expression:useRestrictionsOnTheExpression type="creative-commons">http://creativecommons.org/licenses/by/4.0/</efrbr-expression:useRestrictionsOnTheExpression><efrbr-expression:note type="conference name">2021 IEEE 21st International Conference on Bioinformatics and Bioengineering</efrbr-expression:note></efrbr-expression:expression><efrbr-person:person identifier="http://users.isc.tuc.gr/~ggalanos"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Galanos Georgios
            Γαλανος Γεωργιος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~pmalakonakis"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Malakonakis Pavlos
            Μαλακωνακης Παυλος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-person:person identifier="http://users.isc.tuc.gr/~adollas"><efrbr-person:nameOfPerson vocabulary="TUC:LDAP">
            Dollas Apostolos
            Δολλας Αποστολος
         </efrbr-person:nameOfPerson></efrbr-person:person><efrbr-corporateBody:corporateBody identifier="https://v2.sherpa.ac.uk/id/publisher/38"><efrbr-corporateBody:nameOfTheCorporateBody vocabulary="S/R:PUBLISHERS">
            Institute of Electrical and Electronics Engineers
         </efrbr-corporateBody:nameOfTheCorporateBody></efrbr-corporateBody:corporateBody><efrbr-concept:concept identifier="00D51F8E-12D6-4EAF-97DE-FB56FCB6F5C4"><efrbr-concept:termForTheConcept>
            FPGA accelerator
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="BF41ADDA-1091-43E7-85EE-A0ED9B83D982"><efrbr-concept:termForTheConcept>
            Genome assembly
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="E7714553-E663-4738-889D-FFA9CB51C5E9"><efrbr-concept:termForTheConcept>
            Dataset filtering
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="C9799D63-6BF1-4AAE-8B18-4A37C0460B5E"><efrbr-concept:termForTheConcept>
            Velvet
         </efrbr-concept:termForTheConcept></efrbr-concept:concept><efrbr-concept:concept identifier="0310BAF1-9B44-4E19-8FD3-5EA68A96FF81"><efrbr-concept:termForTheConcept>
            de Bruijn graphs
         </efrbr-concept:termForTheConcept></efrbr-concept:concept></efrbr:entities><efrbr:relationships><efrbr-structure:structureRelations><efrbr-structure:realizedThrough sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="expression" targetURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998"/></efrbr-structure:structureRelations><efrbr-responsible:responsibleRelations><efrbr-responsible:createdBy sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="person" targetURI="http://users.isc.tuc.gr/~ggalanos"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="person" targetURI="http://users.isc.tuc.gr/~ggalanos" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="person" targetURI="http://users.isc.tuc.gr/~pmalakonakis" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="person" targetURI="http://users.isc.tuc.gr/~adollas" role="author"/><efrbr-responsible:realizedBy sourceEntity="expression" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="person" targetURI="https://v2.sherpa.ac.uk/id/publisher/38" role="publisher"/></efrbr-responsible:responsibleRelations><efrbr-subject:subjectRelations><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="concept" targetURI="00D51F8E-12D6-4EAF-97DE-FB56FCB6F5C4"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="concept" targetURI="BF41ADDA-1091-43E7-85EE-A0ED9B83D982"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="concept" targetURI="E7714553-E663-4738-889D-FFA9CB51C5E9"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="concept" targetURI="C9799D63-6BF1-4AAE-8B18-4A37C0460B5E"/><efrbr-subject:hasSubject sourceEntity="work" sourceURI="http://purl.tuc.gr/dl/dias/AE8A9926-209C-4135-92C6-1B1F113BC998" targetEntity="concept" targetURI="0310BAF1-9B44-4E19-8FD3-5EA68A96FF81"/></efrbr-subject:subjectRelations><efrbr-other:otherRelations/></efrbr:relationships></efrbr:recordSet>