Search Results

Now showing 1 - 4 of 4
  • Item
    A short guide to increase FAIRness of atmospheric model data
    (Stuttgart : E. Schweizerbart Science Publishers, 2020) Ganske, Anette; Heydebreck, Daniel; Höck, Daniel; Kraft, Angelina; Quaas, Johannes; Kaiser, Amandine
    The generation, processing and analysis of atmospheric model data are expensive, as atmospheric model runs are often computationally intensive and the costs of ‘fast’ disk space are rising. Moreover, atmospheric models are mostly developed by groups of scientists over many years and therefore only few appropriate models exist for specific analyses, e.g. for urban climate. Hence, atmospheric model data should be made available for reuse by scientists, the public sector, companies and other stakeholders. Thereby, this leads to an increasing need for swift, user-friendly adaptation of standards.The FAIR data principles (Findable, Accessible, Interoperable, Reusable) were established to foster the reuse of data. Research data become findable and accessible if they are published in public repositories with general metadata and Persistent Identifiers (PIDs), e.g. DataCite DOIs. The use of PIDs should ensure that describing metadata is persistently available. Nevertheless, PIDs and basic metadata do not guarantee that the data are indeed interoperable and reusable without project-specific knowledge. Additionally, the lack of standardised machine-readable metadata reduces the FAIRness of data. Unfortunately, there are no common standards for non-climate models, e.g. for mesoscale models, available. This paper proposes a concept to improve the FAIRness of archived atmospheric model data. This concept was developed within the AtMoDat project (Atmospheric Model Data). The approach consists of several aspects, each of which is easy to implement: requirements for rich metadata with controlled vocabulary, the landing pages, file formats (netCDF) and the structure within the files. The landing pages are a core element of this concept as they should be human- and machine readable, hold discipline-specific metadata and present metadata on simulation and variable level. This guide is meant to help data producers and curators to prepare data for publication. Furthermore, this guide provides information for the choice of keywords, which supports data reusers in their search for data with search engines. © 2020 The authors
  • Item
    Publication of Atmospheric Model Data using the ATMODAT Standard
    (Stuttgart : E. Schweizerbart Science Publishers, 2022) Ganske, Anette; Heil, Angelika; Lammert, Andrea; Kretzschmar, Jan; Quaas, Johannes
    Scientific data should be published in a way so that other scientists can benefit from these data, enabling further research. The FAIR Data Principles are defining the basic prerequisite for a good data publication: data should be Findable, Accessible, Interoperable, and Reusable. Increasingly, research communities are developing discipline-specific data publication standards under consideration of the FAIR Data Principles. A very comprehensive yet strict data standard has been developed for the climate model output within the Climate Model Intercomparison Project (CMIP), which largely builds upon the Climate and Forecast Metadata Conventions (CF conventions). There are, however, many areas of atmospheric modelling where data cannot be standardised according to the CMIP data standard because, e.g., the data contain specific variables which are not covered by the CMIP standard. Furthermore, fulfilling the strict CMIP data standard for smaller Model Intercomparison Projects (MIPs) requires much effort (in time and manpower) and hence the outcome of these MIPs often remains non-standardised. For innovative model diagnostics, preexisting standards are also not flexible enough. For that reason, the ATMODAT standard, a quality guideline for atmospheric model data, was created. The ATMODAT standard defines a set of requirements that aim at ensuring the high reusability of atmospheric model data publications. The requirements include the use of the netCDF file format, the application of the CF conventions, rich and standardised file metadata, and the publication of the data with a DataCite DOI. Additionally, a tool for checking the conformity of data and metadata to this standard, the atmodat data checker, was developed and is available on GitHub under an open licence. By using the more flexible ATMODAT standard, the publication of standardised datasets is simplified for smaller MIPs. This standardisation process is presented as an example using the data of an aerosol-climate model from the AeroCOM MIP. Furthermore, the landing pages of ATMODAT-compliant data publications can be highlighted with the EASYDAB logo. EASYDAB (Earth System Data Branding) is a newly developed quality label for carefully curated and highly standardised data publications. The ATMODAT data standardisation can easily be transferred to data from other disciplines and contribute to their improved reusability.
  • Item
    AtMoDat: Improving the reusability of ATmospheric MOdel DATa with DataCite DOIs paving the path towards FAIR data
    (München : European Geosciences Union, 2020) Neumann, Daniel; Ganske, Anette; Voss, Vivien; Kraft, Angelina; Höck, Heinke; Peters, Karsten; Quaas, Johannes; Schluenzen, Heinke; Thiemann, Hannes
    The generation of high quality research data is expensive. The FAIR principles were established to foster the reuse of such data for the benefit of the scientific community and beyond. Publishing research data with metadata and DataCite DOIs in public repositories makes them findable and accessible (FA of FAIR). However, DOIs and basic metadata do not guarantee the data are actually reusable without discipline-specific knowledge: if data are saved in proprietary or undocumented file formats, if detailed discipline-specific metadata are missing and if quality information on the data and metadata are not provided. In this contribution, we present ongoing work in the AtMoDat project, -a consortium of atmospheric scientists and infrastructure providers, which aims on improving the reusability of atmospheric model data. Consistent standards are necessary to simplify the reuse of research data. Although standardization of file structure and metadata is well established for some subdomains of the earth system modeling community – e.g. CMIP –, several other subdomains are lacking such standardization. Hence, scientists from the Universities of Hamburg and Leipzig and infrastructure operators cooperate in the AtMoDat project in order to advance standardization for model output files in specific subdomains of the atmospheric modeling community. Starting from the demanding CMIP6 standard, the aim is to establish an easy-to-use standard that is at least compliant with the Climate and Forecast (CF) conventions. In parallel, an existing netCDF file convention checker is extended to check for the new standards. This enhanced checker is designed to support the creation of compliant files and thus lower the hurdle for data producers to comply with the new standard. The transfer of this approach to further sub-disciplines of the earth system modeling community will be supported by a best-practice guide and other documentation. A showcase of a standard for the urban atmospheric modeling community will be presented in this session. The standard is based on CF Conventions and adapts several global attributes and controlled vocabularies from the well-established CMIP6 standard. Additionally, the AtMoDat project aims on introducing a generic quality indicator into the DataCite metadata schema to foster further reuse of data. This quality indicator should require a discipline-specific implementation of a quality standard linked to the indicator. We will present the concept of the generic quality indicator in general and in the context of urban atmospheric modeling data.
  • Item
    ATMODAT Standard v3.0
    (Hamburg : DKRZ, 2020) Gasnke, Anette; Kraft, Angelina; Kaiser, Amandine; Heydebreck, Daniel; Lammert, Andrea; Höck, Heinke; Thiemann, Hannes; Voss, Vivien; Grawe, David; Leitl, Bernd; Schlünzen, K. Heinke; Kretzschmar, Jan; Quaas, Johannes
    Within the AtMoDat project (Atmospheric Model Data), a standard has been developed which is meant for improving the FAIRness of atmospheric model data published in repositories. The ATMODAT standard includes concrete recommendations related to the maturity, publication and enhanced FAIRness of atmospheric model data. The suggestions include requirements for rich metadata with controlled vocabularies, structured landing pages, file formats (netCDF) and the structure within files. Human- and machine readable landing pages are a core element of this standard, and should hold and present discipline-specific metadata on simulation and variable level. This standard is an updated and translated version of "Bericht über initialen Kernstandard und Kurationskriterien des AtMoDat Projektes (v2.4)