Search Results

Now showing 1 - 6 of 6
Loading...
Thumbnail Image
Item

Detecting Cross-Language Plagiarism using Open Knowledge Graphs

2021, Stegmüller, Johannes, Bauer-Marquart, Fabian, Meuschke, Norman, Ruas, Terry, Schubotz, Moritz, Gipp, Bela, Zhang, Chengzhi, Mayr, Philipp, Lu, Wie, Zhang, Yi

Identifying cross-language plagiarism is challenging, especially for distant language pairs and sense-for-sense translations. We introduce the new multilingual retrieval model Cross-Language Ontology-Based Similarity Analysis (CL-OSA) for this task. CL-OSA represents documents as entity vectors obtained from the open knowledge graph Wikidata. Opposed to other methods, CL-OSA does not require computationally expensive machine translation, nor pre-training using comparable or parallel corpora. It reliably disambiguates homonyms and scales to allow its application toWebscale document collections. We show that CL-OSA outperforms state-of-the-art methods for retrieving candidate documents from five large, topically diverse test corpora that include distant language pairs like Japanese-English. For identifying cross-language plagiarism at the character level, CL-OSA primarily improves the detection of sense-for-sense translations. For these challenging cases, CL-OSA’s performance in terms of the well-established PlagDet score exceeds that of the best competitor by more than factor two. The code and data of our study are openly available.

Loading...
Thumbnail Image
Item

From Floppy Disks to 5-Star LOD: FAIR Research Infrastructure for NFDI4Culture

2023, Tietz, Tabea, Bruns, Oleksandra, Söhn, Linnaea, Tolksdorf, Julia, Posthumus, Etienne, Steller, Jonatan Jalle, Fliegl, Heike, Norouzi, Ebrahim, Waitelonis, Jörg, Schrade, Torsten, Sack, Harald

NFDI4Culture is establishing an infrastructure for research data on material and immaterial cultural heritage in the context of the German National Research Data Infrastructure (NFDI) in compliance with the FAIR principles. The NFDI4Culture Knowledge Graph is developed and integrated with the Culture Information Portal to aggregate diverse and isolated data from the culture research landscape and thereby increase the discoverability, interoperability and reusability of cultural heritage data. This paper presents the research data management strategy in the long-term project NFDI4Culture, which combines a CMS and a Knowledge Graph-based infrastructure to enable an intuitive and meaningful interaction with research resources in the cultural heritage domain.

Loading...
Thumbnail Image
Item

TRANSRAZ Data Model: Towards a Geosocial Representation of Historical Cities

2023, Bruns, Oleksandra, Tietz, Tabea, Göller, Sandra, Sack, Harald, Acosta, M., Peroni, S., Vahdati, S., Gentile, A.-L., Pellegrini, T., Kalo, J.-C.

Preserving historical city architectures and making them (publicly) available has emerged as an important field of the cultural heritage and digital humanities research domain. In this context, the TRANSRAZ project is creating an interactive 3D environment of the historical city of Nuremberg which spans over different periods of time. Next to the exploration of the city’s historical architecture, TRANSRAZ is also integrating information about its inhabitants, organizations, and important events, which are extracted from historical documents semi-automatically. Knowledge Graphs have proven useful and valuable to integrate and enrich these heterogeneous data. However, this task also comes with versatile data modeling challenges. This paper contributes the TRANSRAZ data model, which integrates agents, architectural objects, events, and historical documents into the 3D research environment by means of ontologies. Goal is to explore Nuremberg’s multifaceted past in different time layers in the context of its architectural, social, economical, and cultural developments.

Loading...
Thumbnail Image
Item

Causal Relationship over Knowledge Graphs

2022, Huang, Hao, Al Hasan, Mohammad, Xiong, Li

Causality has been discussed for centuries, and the theory of causal inference over tabular data has been broadly studied and utilized in multiple disciplines. However, only a few works attempt to infer the causality while exploiting the meaning of the data represented in a data structure like knowledge graph. These works offer a glance at the possibilities of causal inference over knowledge graphs, but do not yet consider the metadata, e.g., cardinalities, class subsumption and overlap, and integrity constraints. We propose CareKG, a new formalism to express causal relationships among concepts, i.e., classes and relations, and enable causal queries over knowledge graphs using semantics of metadata. We empirically evaluate the expressiveness of CareKG in a synthetic knowledge graph concerning cardinalities, class subsumption and overlap, integrity constraints. Our initial results indicate that CareKG can represent and measure causal relations with some semantics which are uncovered by state-of-the-art approaches.

Loading...
Thumbnail Image
Item

Responsible Knowledge Management in Energy Data Ecosystems

2022, Janev, Valentina, Vidal, Maria-Esther, Pujić, Dea, Popadić, Dušan, Iglesias, Enrique, Sakor, Ahmad, Čampa, Andrej

This paper analyzes the challenges and requirements of establishing energy data ecosystems (EDEs) as data-driven infrastructures that overcome the limitations of currently fragmented energy applications. It proposes a new data- and knowledge-driven approach for management and processing. This approach aims to extend the analytics services portfolio of various energy stakeholders and achieve two-way flows of electricity and information for optimized generation, distribution, and electricity consumption. The approach is based on semantic technologies to create knowledge-based systems that will aid machines in integrating and processing resources contextually and intelligently. Thus, a paradigm shift in the energy data value chain is proposed towards transparency and the responsible management of data and knowledge exchanged by the various stakeholders of an energy data space. The approach can contribute to innovative energy management and the adoption of new business models in future energy data spaces.

Loading...
Thumbnail Image
Item

Knowledge Graphs - Working Group Charter (NFDI section-metadata) (1.2)

2023, Stocker, Markus, Rossenova, Lozana, Shigapov, Renat, Betancort, Noemi, Dietze, Stefan, Murphy, Bridget, Bölling, Christian, Schubotz, Moritz, Koepler, Oliver

Knowledge Graphs are a key technology for implementing the FAIR principles in data infrastructures by ensuring interoperability for both humans and machines. The Working Group "Knowledge Graphs" in Section "(Meta)data, Terminologies, Provenance" of the German National Research Data Infrastructure (Nationale Forschungsdateninfrastruktur (NFDI) e.V.) aims to promote the use of knowledge graphs in all NFDI consortia, to facilitate cross-domain data interlinking and federation following the FAIR principles, and to contribute to the joint development of tools and technologies that enable transformation of structured and unstructured data into semantically reusable knowledge across different domains.