Information extraction pipelines for knowledge graphs

dc.bibliographicCitation.firstPage1989
dc.bibliographicCitation.lastPage2016
dc.bibliographicCitation.volume65
dc.contributor.authorJaradeh, Mohamad Yaser
dc.contributor.authorSingh, Kuldeep
dc.contributor.authorStocker, Markus
dc.contributor.authorBoth, Andreas
dc.contributor.authorAuer, Sören
dc.date.accessioned2023-04-04T08:15:22Z
dc.date.available2023-04-04T08:15:22Z
dc.date.issued2023
dc.description.abstractIn the last decade, a large number of knowledge graph (KG) completion approaches were proposed. Albeit effective, these efforts are disjoint, and their collective strengths and weaknesses in effective KG completion have not been studied in the literature. We extend Plumber, a framework that brings together the research community’s disjoint efforts on KG completion. We include more components into the architecture of Plumber to comprise 40 reusable components for various KG completion subtasks, such as coreference resolution, entity linking, and relation extraction. Using these components, Plumber dynamically generates suitable knowledge extraction pipelines and offers overall 432 distinct pipelines. We study the optimization problem of choosing optimal pipelines based on input sentences. To do so, we train a transformer-based classification model that extracts contextual embeddings from the input and finds an appropriate pipeline. We study the efficacy of Plumber for extracting the KG triples using standard datasets over three KGs: DBpedia, Wikidata, and Open Research Knowledge Graph. Our results demonstrate the effectiveness of Plumber in dynamically generating KG completion pipelines, outperforming all baselines agnostic of the underlying KG. Furthermore, we provide an analysis of collective failure cases, study the similarities and synergies among integrated components and discuss their limitations.eng
dc.description.versionpublishedVersioneng
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/11910
dc.identifier.urihttp://dx.doi.org/10.34657/10943
dc.language.isoeng
dc.publisherLondon : Springer
dc.relation.doihttps://doi.org/10.1007/s10115-022-01826-x
dc.relation.essn0219-3116
dc.relation.ispartofseriesKnowledge and Information Systems 65 (2023)eng
dc.relation.issn0219-1377
dc.rights.licenseCC BY 4.0 Unported
dc.rights.urihttps://creativecommons.org/licenses/by/4.0
dc.subjectInformation extractioneng
dc.subjectNLP pipelineseng
dc.subjectSemantic searcheng
dc.subjectSemantic webeng
dc.subjectSoftware reusabilityeng
dc.subject.ddc004
dc.subject.ddc070
dc.titleInformation extraction pipelines for knowledge graphseng
dc.typearticle
dc.typeText
dcterms.bibliographicCitation.journalTitleKnowledge and Information Systems
tib.accessRightsopenAccess
wgl.contributorTIB
wgl.subjectInformatikger
wgl.typeZeitschriftenartikelger
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s10115-022-01826-x.pdf
Size:
2.1 MB
Format:
Adobe Portable Document Format
Description:
Collections