Search Results

Now showing 1 - 2 of 2
Loading...
Thumbnail Image
Item

SciBERT-based Semantification of Bioassays in the Open Research Knowledge Graph

2020, Anteghini, Marco, D'Souza, Jennifer, Martins dos Santos, Vitor A.P., Auer, Sören

As a novel contribution to the problem of semantifying bio- logical assays, in this paper, we propose a neural-network-based approach to automatically semantify, thereby structure, unstructured bioassay text descriptions. Experimental evaluations, to this end, show promise as the neural-based semantification significantly outperforms a naive frequencybased baseline approach. Specifically, the neural method attains 72% F1 versus 47% F1 from the frequency-based method. The work in this paper aligns with the present cutting-edge trend of the scholarly knowledge digitalization impetus which aim to convert the long-standing document-based format of scholarly content into knowledge graphs (KG). To this end, our selected data domain of bioassays are a prime candidate for structuring into KGs.

Loading...
Thumbnail Image
Item

Easy Semantification of Bioassays

2022, Anteghini, Marco, D’Souza, Jennifer, dos Santos, Vitor A. P. Martins, Auer, Sören

Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. We propose a solution for automatically semantifying biological assays. Our solution contrasts the problem of automated semantification as labeling versus clustering where the two methods are on opposite ends of the method complexity spectrum. Characteristically modeling our problem, we find the clustering solution significantly outperforms a deep neural network state-of-the-art labeling approach. This novel contribution is based on two factors: 1) a learning objective closely modeled after the data outperforms an alternative approach with sophisticated semantic modeling; 2) automatically semantifying biological assays achieves a high performance F1 of nearly 83%, which to our knowledge is the first reported standardized evaluation of the task offering a strong benchmark model.