Ιδρυματικό Αποθετήριο [SANDBOX]
Πολυτεχνείο Κρήτης
EN  |  EL

Αναζήτηση

Πλοήγηση

Ο Χώρος μου

Extraction and mapping of CIDOC-CRM encodings from texts and other digital formats

Mania Aikaterini, Arnold ,D , Niccolucci, Franco

Απλή Εγγραφή


URIhttp://purl.tuc.gr/dl/dias/48AA2199-404D-41FD-B109-08F575FDD290-
Αναγνωριστικόhttp://lipn.univ-paris13.fr/~genereux/pub_fichiers/short_paper_1074_060920.pdf-
Γλώσσαen-
Μέγεθος6 pagesen
ΤίτλοςExtraction and mapping of CIDOC-CRM encodings from texts and other digital formatsen
ΔημιουργόςMania Aikaterinien
ΔημιουργόςΜανια Αικατερινηel
ΔημιουργόςArnold ,D en
ΔημιουργόςNiccolucci, Francoen
ΕκδότηςThe Eurographics Association en
ΠερίληψηCIDOC-CRM is a new standard for encoding a wide range of information for Cultural Heritage (CH). At present, existing CH collections are stored using all sorts of formats, sometimes proprietary, often defined roughly, which makes it difficult to share or access heterogeneous information among the CH community. There is a need for a tool to map diverse formats into CIDOC-CRM, assisted by another tool using intelligent language technology to help the mapping whenever fields are underspecified or loosely described, both tools being complementary. In some cases, it may even be better to build fragments of a CIDOC database directly from informal descriptions in natural language only, as the CH community may be reluctant to switch to new formats of data entry. Therefore, this paper focus primarily on the mapping of CH data described in natural language into CIDOC-CRM triples, the building blocks of the full CIDOC-CRM ontology. The methods exploits the propositional nature of CIDOC-CRM triples. Using WordNet as a lexical database and the WEB as corpus, we first extract triples from examples provided in the CIDOC-CRM literature, and then from text describing the medieval city of Wolfenbüttel. We show the strong points of the system and suggest where and how it could be improved. Although the triples extracted automatically from texts do not provide a full picture of the CIDOC-CRM structure buried in the textual description, our results indicate that it provides a sound initial working basis for the mapping/translation process, saving time on what would otherwise have to be done by hand.en
ΤύποςΣύντομη Δημοσίευση σε Συνέδριοel
ΤύποςConference Short Paperen
Άδεια Χρήσηςhttp://creativecommons.org/licenses/by/4.0/en
Ημερομηνία2015-11-01-
Ημερομηνία Δημοσίευσης2006-
Θεματική ΚατηγορίαDigital libraries--Administrationen
Θεματική Κατηγορίαdigital libraries managementen
Θεματική Κατηγορίαdigital libraries administrationen
Βιβλιογραφική ΑναφοράM. Ioannides, D. Arnold, F. Niccolucci, K. Mania .(2006).Extraction and mapping of CIDOC-CRM encodings from texts and other digital formats .Presented at 7th International Symposium on Virtual Reality, Archaeology and Cultural Heritage VAST (2006).[online].Available:http://lipn.univ-paris13.fr/~genereux/pub_fichiers/short_paper_1074_060920.pdfen

Υπηρεσίες

Στατιστικά