The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

Auer, Sören; Barone, Dante A.C.; Bartz, Cassiano; Cortes, Eduardo G.; Jaradeh, Mohamad Yaser; Karras, Oliver; Koubarakis, Manolis; Mouromtsev, Dmitry; Pliukhin, Dmitrii; Radyush, Daniil; Shilin, Ivan; Stocker, Markus; Tsalapati, Eleni

doi:https://doi.org/10.34657/11447

The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge

dc.bibliographicCitation.firstPage	7240
dc.bibliographicCitation.journalTitle	Scientific reports	eng
dc.bibliographicCitation.volume	13
dc.contributor.author	Auer, Sören
dc.contributor.author	Barone, Dante A.C.
dc.contributor.author	Bartz, Cassiano
dc.contributor.author	Cortes, Eduardo G.
dc.contributor.author	Jaradeh, Mohamad Yaser
dc.contributor.author	Karras, Oliver
dc.contributor.author	Koubarakis, Manolis
dc.contributor.author	Mouromtsev, Dmitry
dc.contributor.author	Pliukhin, Dmitrii
dc.contributor.author	Radyush, Daniil
dc.contributor.author	Shilin, Ivan
dc.contributor.author	Stocker, Markus
dc.contributor.author	Tsalapati, Eleni
dc.date.accessioned	2023-07-06T07:25:40Z
dc.date.available	2023-07-06T07:25:40Z
dc.date.issued	2023
dc.description.abstract	Knowledge graphs have gained increasing popularity in the last decade in science and technology. However, knowledge graphs are currently relatively simple to moderate semantic structures that are mainly a collection of factual statements. Question answering (QA) benchmarks and systems were so far mainly geared towards encyclopedic knowledge graphs such as DBpedia and Wikidata. We present SciQA a scientific QA benchmark for scholarly knowledge. The benchmark leverages the Open Research Knowledge Graph (ORKG) which includes almost 170,000 resources describing research contributions of almost 15,000 scholarly articles from 709 research fields. Following a bottom-up methodology, we first manually developed a set of 100 complex questions that can be answered using this knowledge graph. Furthermore, we devised eight question templates with which we automatically generated further 2465 questions, that can also be answered with the ORKG. The questions cover a range of research fields and question types and are translated into corresponding SPARQL queries over the ORKG. Based on two preliminary evaluations, we show that the resulting SciQA benchmark represents a challenging task for next-generation QA systems. This task is part of the open competitions at the 22nd International Semantic Web Conference 2023 as the Scholarly Question Answering over Linked Data (QALD) Challenge.	eng
dc.description.version	publishedVersion
dc.identifier.uri	https://oa.tib.eu/renate/handle/123456789/12417
dc.identifier.uri	https://doi.org/10.34657/11447
dc.language.iso	eng
dc.publisher	London : Nature Publishing Group
dc.relation.doi	https://doi.org/10.1038/s41598-023-33607-z
dc.relation.essn	2045-2322
dc.rights.license	CC BY 4.0 Unported
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.subject.ddc	500
dc.subject.ddc	600
dc.subject.other	Computer science	eng
dc.subject.other	Information technology	eng
dc.subject.other	Scientific data	eng
dc.title	The SciQA Scientific Question Answering Benchmark for Scholarly Knowledge
dc.type	Article	eng
dc.type	Text	eng
tib.accessRights	openAccess
wgl.contributor	TIB
wgl.subject	Erziehung, Schul-und Bildungswesen
wgl.type	Zeitschriftenartikel

Files

Original bundle

Now showing 1 - 1 of 1

Name:: s41598-023-33607-z.pdf
Size:: 1.51 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Informationswissenschaften