SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs

dc.bibliographicCitation.firstPage3039eng
dc.bibliographicCitation.lastPage3046eng
dc.contributor.authorIglesias, Enrique
dc.contributor.authorJozashoori, Samaneh
dc.contributor.authorChaves-Fraga, David
dc.contributor.authorCollarana, Diego
dc.contributor.authorVidal, Maria-Esther
dc.date.accessioned2021-04-28T13:59:07Z
dc.date.available2021-04-28T13:59:07Z
dc.date.issued2020
dc.description.abstractIn recent years, the amount of data has increased exponentially, and knowledge graphs have gained attention as data structures to integrate data and knowledge harvested from myriad data sources. However, data complexity issues like large volume, high-duplicate rate, and heterogeneity usually characterize these data sources, being required data management tools able to address the negative impact of these issues on the knowledge graph creation process. In this paper, we propose the SDM-RDFizer, an interpreter of the RDF Mapping Language (RML), to transform raw data in various formats into an RDF knowledge graph. SDM-RDFizer implements novel algorithms to execute the logical operators between mappings in RML, allowing thus to scale up to complex scenarios where data is not only broad but has a high-duplication rate. We empirically evaluate the SDM-RDFizer performance against diverse testbeds with diverse configurations of data volume, duplicates, and heterogeneity. The observed results indicate that SDM-RDFizer is two orders of magnitude faster than state of the art, thus, meaning that SDM-RDFizer an interoperable and scalable solution for knowledge graph creation. SDM-RDFizer is publicly available as a resource through a Github repository and a DOI.eng
dc.description.versionacceptedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/6162
dc.identifier.urihttps://doi.org/10.34657/5210
dc.language.isoengeng
dc.publisherNew York City, NY : Association for Computing Machineryeng
dc.relation.doihttps://doi.org/10.1145/3340531.3412881
dc.relation.isbn978-1-4503-6859-9
dc.relation.ispartofCIKM '20: Proceedings of the 29th ACM International Conference on Information & Knowledge Managementeng
dc.rights.licenseEs gilt deutsches Urheberrecht. Das Dokument darf zum eigenen Gebrauch kostenfrei genutzt, aber nicht im Internet bereitgestellt oder an Außenstehende weitergegeben werden.eng
dc.subjectKnowledge Grapheng
dc.subjectRDFeng
dc.subjectRMLeng
dc.subject.classificationKonferenzschriftger
dc.subject.ddc004eng
dc.titleSDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphseng
dc.typebookParteng
dc.typeTexteng
tib.accessRightsopenAccesseng
tib.relation.conferenceCIKM '20: The 29th ACM International Conference on Information and Knowledge Management, October 2020, onlineeng
wgl.contributorTIBeng
wgl.subjectInformatikeng
wgl.typeBuchkapitel / Sammelwerksbeitrageng
wgl.typeKonferenzbeitrageng
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Iglesias2020, Postprint.pdf
Size:
3.9 MB
Format:
Adobe Portable Document Format
Description: