The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

dc.bibliographicCitation.firstPage7240
dc.bibliographicCitation.journalTitleScientific reportseng
dc.bibliographicCitation.volume13
dc.contributor.authorAuer, Sören
dc.contributor.authorBarone, Dante A.C.
dc.contributor.authorBartz, Cassiano
dc.contributor.authorCortes, Eduardo G.
dc.contributor.authorJaradeh, Mohamad Yaser
dc.contributor.authorKarras, Oliver
dc.contributor.authorKoubarakis, Manolis
dc.contributor.authorMouromtsev, Dmitry
dc.contributor.authorPliukhin, Dmitrii
dc.contributor.authorRadyush, Daniil
dc.contributor.authorShilin, Ivan
dc.contributor.authorStocker, Markus
dc.contributor.authorTsalapati, Eleni
dc.date.accessioned2023-07-06T07:25:40Z
dc.date.available2023-07-06T07:25:40Z
dc.date.issued2023
dc.description.abstractKnowledge graphs have gained increasing popularity in the last decade in science and technology. However, knowledge graphs are currently relatively simple to moderate semantic structures that are mainly a collection of factual statements. Question answering (QA) benchmarks and systems were so far mainly geared towards encyclopedic knowledge graphs such as DBpedia and Wikidata. We present SciQA a scientific QA benchmark for scholarly knowledge. The benchmark leverages the Open Research Knowledge Graph (ORKG) which includes almost 170,000 resources describing research contributions of almost 15,000 scholarly articles from 709 research fields. Following a bottom-up methodology, we first manually developed a set of 100 complex questions that can be answered using this knowledge graph. Furthermore, we devised eight question templates with which we automatically generated further 2465 questions, that can also be answered with the ORKG. The questions cover a range of research fields and question types and are translated into corresponding SPARQL queries over the ORKG. Based on two preliminary evaluations, we show that the resulting SciQA benchmark represents a challenging task for next-generation QA systems. This task is part of the open competitions at the 22nd International Semantic Web Conference 2023 as the Scholarly Question Answering over Linked Data (QALD) Challenge.eng
dc.description.versionpublishedVersion
dc.identifier.urihttps://oa.tib.eu/renate/handle/123456789/12417
dc.identifier.urihttps://doi.org/10.34657/11447
dc.language.isoeng
dc.publisherLondon : Nature Publishing Group
dc.relation.doihttps://doi.org/10.1038/s41598-023-33607-z
dc.relation.essn2045-2322
dc.rights.licenseCC BY 4.0 Unported
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subject.ddc500
dc.subject.ddc600
dc.subject.otherComputer scienceeng
dc.subject.otherInformation technologyeng
dc.subject.otherScientific dataeng
dc.titleThe SciQA Scientific Question Answering Benchmark for Scholarly Knowledge
dc.typeArticleeng
dc.typeTexteng
tib.accessRightsopenAccess
wgl.contributorTIB
wgl.subjectErziehung, Schul-und Bildungswesen
wgl.typeZeitschriftenartikel
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
s41598-023-33607-z.pdf
Size:
1.51 MB
Format:
Adobe Portable Document Format
Description: