Context-Based Entity Matching for Big Data

Tasnim, Mayesha; Collarana, Diego; Graux, Damien; Vidal, Maria-Esther

doi:https://doi.org/10.34657/5080

Context-Based Entity Matching for Big Data

dc.bibliographicCitation.bookTitle	Knowledge Graphs and Big Data Processing	eng
dc.bibliographicCitation.firstPage	122	eng
dc.bibliographicCitation.journalTitle	Lecture Notes in Computer Science	eng
dc.bibliographicCitation.lastPage	146	eng
dc.bibliographicCitation.volume	12072	eng
dc.contributor.author	Tasnim, Mayesha
dc.contributor.author	Collarana, Diego
dc.contributor.author	Graux, Damien
dc.contributor.author	Vidal, Maria-Esther
dc.contributor.editor	Janev, Valentina
dc.contributor.editor	Graux, Damien
dc.contributor.editor	Jabeen, Hajira
dc.contributor.editor	Sallinger, Emanuel
dc.date.accessioned	2021-03-18T15:46:51Z
dc.date.available	2021-03-18T15:46:51Z
dc.date.issued	2020
dc.description.abstract	In the Big Data era, where variety is the most dominant dimension, the RDF data model enables the creation and integration of actionable knowledge from heterogeneous data sources. However, the RDF data model allows for describing entities under various contexts, e.g., people can be described from its demographic context, but as well from their professional contexts. Context-aware description poses challenges during entity matching of RDF datasets—the match might not be valid in every context. To perform a contextually relevant entity matching, the specific context under which a data-driven task, e.g., data integration is performed, must be taken into account. However, existing approaches only consider inter-schema and properties mapping of different data sources and prevent users from selecting contexts and conditions during a data integration process. We devise COMET, an entity matching technique that relies on both the knowledge stated in RDF vocabularies and a context-based similarity metric to map contextually equivalent RDF graphs. COMET follows a two-fold approach to solve the problem of entity matching in RDF graphs in a context-aware manner. In the first step, COMET computes the similarity measures across RDF entities and resorts to the Formal Concept Analysis algorithm to map contextually equivalent RDF entities. Finally, COMET combines the results of the first step and executes a 1-1 perfect matching algorithm for matching RDF entities based on the combined scores. We empirically evaluate the performance of COMET on testbed from DBpedia. The experimental results suggest that COMET accurately matches equivalent RDF graphs in a context-dependent manner.	eng
dc.description.version	publishedVersion	eng
dc.identifier.uri	https://oa.tib.eu/renate/handle/123456789/6098
dc.identifier.uri	https://doi.org/10.34657/5080
dc.language.iso	eng	eng
dc.publisher	Cham : Springer	eng
dc.relation.doi	https://doi.org/10.1007/978-3-030-53199-7_8
dc.relation.essn	1611-3349
dc.relation.isbn	978-3-030-53198-0
dc.relation.issn	0302-9743
dc.rights.license	CC BY 4.0 Unported	eng
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	eng
dc.subject.ddc	004	eng
dc.subject.other	LAMBDA Project	eng
dc.subject.other	RDF	eng
dc.subject.other	Big Data	eng
dc.title	Context-Based Entity Matching for Big Data	eng
dc.type	BookPart	eng
dc.type	Text	eng
tib.accessRights	openAccess	eng
wgl.contributor	TIB	eng
wgl.subject	Informatik	eng
wgl.type	Buchkapitel / Sammelwerksbeitrag	eng

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Tasnim2020_Chapter_Chapter8Context-BasedEntityMat.pdf
Size:: 3.57 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Informationswissenschaften