Context-Based Entity Matching for Big Data

dc.bibliographicCitation.firstPage122eng
dc.bibliographicCitation.lastPage146eng
dc.bibliographicCitation.volume12072eng
dc.contributor.authorTasnim, Mayesha
dc.contributor.authorCollarana, Diego
dc.contributor.authorGraux, Damien
dc.contributor.authorVidal, Maria-Esther
dc.contributor.editorJanev, Valentina
dc.contributor.editorGraux, Damien
dc.contributor.editorJabeen, Hajira
dc.contributor.editorSallinger, Emanuel
dc.date.accessioned2021-03-18T15:46:51Z
dc.date.available2021-03-18T15:46:51Z
dc.date.issued2020
dc.description.abstractIn the Big Data era, where variety is the most dominant dimension, the RDF data model enables the creation and integration of actionable knowledge from heterogeneous data sources. However, the RDF data model allows for describing entities under various contexts, e.g., people can be described from its demographic context, but as well from their professional contexts. Context-aware description poses challenges during entity matching of RDF datasets—the match might not be valid in every context. To perform a contextually relevant entity matching, the specific context under which a data-driven task, e.g., data integration is performed, must be taken into account. However, existing approaches only consider inter-schema and properties mapping of different data sources and prevent users from selecting contexts and conditions during a data integration process. We devise COMET, an entity matching technique that relies on both the knowledge stated in RDF vocabularies and a context-based similarity metric to map contextually equivalent RDF graphs. COMET follows a two-fold approach to solve the problem of entity matching in RDF graphs in a context-aware manner. In the first step, COMET computes the similarity measures across RDF entities and resorts to the Formal Concept Analysis algorithm to map contextually equivalent RDF entities. Finally, COMET combines the results of the first step and executes a 1-1 perfect matching algorithm for matching RDF entities based on the combined scores. We empirically evaluate the performance of COMET on testbed from DBpedia. The experimental results suggest that COMET accurately matches equivalent RDF graphs in a context-dependent manner.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/6098
dc.identifier.urihttps://doi.org/10.34657/5080
dc.language.isoengeng
dc.publisherCham : Springereng
dc.relation.doihttps://doi.org/10.1007/978-3-030-53199-7_8
dc.relation.essn1611-3349
dc.relation.isbn978-3-030-53198-0
dc.relation.ispartofKnowledge Graphs and Big Data Processingeng
dc.relation.ispartofseriesLecture Notes in Computer Scienceeng
dc.relation.issn0302-9743
dc.rights.licenseCC BY 4.0 Unportedeng
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/eng
dc.subjectLAMBDA Projecteng
dc.subjectRDFeng
dc.subjectBig Dataeng
dc.subject.ddc004eng
dc.titleContext-Based Entity Matching for Big Dataeng
dc.typebookParteng
dc.typeTexteng
dcterms.bibliographicCitation.journalTitleLecture Notes in Computer Scienceeng
tib.accessRightsopenAccesseng
wgl.contributorTIBeng
wgl.subjectInformatikeng
wgl.typeBuchkapitel / Sammelwerksbeitrageng
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Tasnim2020_Chapter_Chapter8Context-BasedEntityMat.pdf
Size:
3.57 MB
Format:
Adobe Portable Document Format
Description: