Harmonising, Harvesting, and Searching Metadata Across a Repository Federation

Loading...
Thumbnail Image
Date
2023
Volume
1
Issue
Journal
Proceedings of the Conference on Research Data Infrastructure
Series Titel
Book Title
Publisher
Hannover : TIB Open Publishing
Link to publishers version
Abstract

The collection of metadata for research data is an important aspect in the FAIR principles. The schema.org and Bioschemas initiatives created a vocabulary to embed markup for many different types, including BioChemEntity, ChemicalSubstance, Gene, MolecularEntity, Protein, and others relevant in the Natural and Life Sciences with immediate benefits for findability of data packages. To bridge the gap between the worlds of semantic-web-driven JSON+LD metadata on the one hand, and established but separately developed interface services in libraries, we have designed an architecture for harmonising, federating and harvesting metadata from several resources. Our approach is to serve JSON+LD embedded in an XML container through a central OAI-Provider. Several resources in NFDI4Chem provide such domain-specific metadata. The CKAN-based NFDI4Chem search service can harvest this metadata using an OAI-PMH harvester extension that can extract the XML-encapsulated JSON+LD metadata, and has search capabilities relevant in the chemistry domain. We invite the community to collaborate and reach a critical mass of providers and consumers in the NFDI.

Description
Keywords
Metadata; Structured Markup; JSON LD; schema.org; Bioschemas; OAI-PMH; Harvesting
Citation
Neumann, S., Bach, F., Castro, L. J., Fischer, T., Hofmann, S., Huang, P.-C., et al. (2023). Harmonising, Harvesting, and Searching Metadata Across a Repository Federation. 1. https://doi.org//10.52825/CoRDI.v1i.202
Collections
License
CC BY 4.0 Unported