Search Results

Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Item

Easy Semantification of Bioassays

2022, Anteghini, Marco, D’Souza, Jennifer, dos Santos, Vitor A. P. Martins, Auer, Sören

Biological data and knowledge bases increasingly rely on Semantic Web technologies and the use of knowledge graphs for data integration, retrieval and federated queries. We propose a solution for automatically semantifying biological assays. Our solution contrasts the problem of automated semantification as labeling versus clustering where the two methods are on opposite ends of the method complexity spectrum. Characteristically modeling our problem, we find the clustering solution significantly outperforms a deep neural network state-of-the-art labeling approach. This novel contribution is based on two factors: 1) a learning objective closely modeled after the data outperforms an alternative approach with sophisticated semantic modeling; 2) automatically semantifying biological assays achieves a high performance F1 of nearly 83%, which to our knowledge is the first reported standardized evaluation of the task offering a strong benchmark model.

Loading...
Thumbnail Image
Item

Clustering Semantic Predicates in the Open Research Knowledge Graph

2022, Arab Oghli, Omar, D’Souza, Jennifer, Auer, Sören

When semantically describing knowledge graphs (KGs), users have to make a critical choice of a vocabulary (i.e. predicates and resources). The success of KG building is determined by the convergence of shared vocabularies so that meaning can be established. The typical lifecycle for a new KG construction can be defined as follows: nascent phases of graph construction experience terminology divergence, while later phases of graph construction experience terminology convergence and reuse. In this paper, we describe our approach tailoring two AI-based clustering algorithms for recommending predicates (in RDF statements) about resources in the Open Research Knowledge Graph (ORKG) https://orkg.org/. Such a service to recommend existing predicates to semantify new incoming data of scholarly publications is of paramount importance for fostering terminology convergence in the ORKG. Our experiments show very promising results: a high precision with relatively high recall in linear runtime performance. Furthermore, this work offers novel insights into the predicate groups that automatically accrue loosely as generic semantification patterns for semantification of scholarly knowledge spanning 44 research fields.

Loading...
Thumbnail Image
Item

DoMoRe – A recommender system for domain modeling

2018, Agt-Rickauer, Henning, Kutsche, Ralf-Detlef, Sack, Harald, Hammoudi, Slimane, Ferreira Pires, Luis, Selic, Bran

Domain modeling is an important activity in early phases of software projects to achieve a shared understanding of the problem field among project participants. Domain models describe concepts and relations of respective application fields using a modeling language and domain-specific terms. Detailed knowledge of the domain as well as expertise in model-driven development is required for software engineers to create these models. This paper describes DoMoRe, a system for automated modeling recommendations to support the domain modeling process. We describe an approach in which modeling benefits from formalized knowledge sources and information extraction from text. The system incorporates a large network of semantically related terms built from natural language data sets integrated with mediator-based knowledge base querying in a single recommender system to provide context-sensitive suggestions of model elements.

Loading...
Thumbnail Image
Item

Research Information Infrastructure in Ukraine: First steps towards building a national CRIS

2022, Kaliuzhna, Nataliia, Auhunas, Sabina

Development and implementation of Current Research Information Systems (CRIS) is one of the most transparent and practical approaches to curate research information on a national level. The process of building and implementing such systems is a complex and time consuming where successful results heavily depend on the established research information infrastructure of a country, the interoperability of the systems and the quality of the information which reside in them. The purpose of this paper is to analyse the existing Ukrainian Research Information Infrastructure and identify which databases could be reused and integrated with a national Ukrainian Current Research Information System (URIS). The analysis showed that there are functional databases and registries that collect data on research activities and could be used as a data sources for the URIS. In particular, the Unified State Electronic Database on Education is a potential data source on higher educational institutions, the National Repository of Academic Texts - on metadata on research output, internal database of the National Research Foundation of Ukraine and database on research projects maintained by Ukrainian Institute of Scientific Technical and Economic Information - on projects. Secondly, it was identified that Ukrainian research infrastructure lacks complete, up-to-date registry on researchers. Finally, we discussed the challenges and solutions for further steps in building national CRIS.