Search Results

Now showing 1 - 5 of 5
  • Item
    A PDF Test-Set for Well-Formedness Validation in JHOVE - The Good, the Bad and the Ugly
    (Zenodo, 2017) Lindlar, Michelle; Tunnat, Yvonne; Wilson, Carl
    Digital preservation and active software stewardship are both cyclical processes. While digital preservation strategies have to be reevaluated regularly to ensure that they still meet technological and organizational requirements, software needs to be tested with every new release to ensure that it functions correctly. JHOVE is an open source format validation tool which plays a central role in many digital preservation workflows and the PDF module is one of its most important features. Unlike tools such as Adobe PreFlight or veraPDF which check against requirements at profile level, JHOVE’s PDF-module is the only tool that can validate the syntax and structure of PDF files. Despite JHOVE’s widespread and long-standing adoption, the underlying validation rules are not formally or thoroughly tested, leading to bugs going undetected for a long time. Furthermore, there is no ground-truth data set which can be used to understand and test PDF validation at the structural level. The authors present a corpus of light-weight files designed to test the validation criteria of JHOVE’s PDF module against “well-formedness”. We conclude by measuring the code coverage of the test corpus within JHOVE PDF validation and by feeding detected inconsistencies of the PDF-module back into the open source development process.
  • Item
    Functional access to electronic media collections using emulation-as-a-service
    (2014) Bähr, Thomas; Lindlar, Michelle; Rechert, Klaus; Liebetraut, Thomas
    Over the last 30 years the German National Library of Science and Technology (TIB) accumulated a large collection of various electronic media, such as floppies or CD-ROMs. This poster describes both practical workflows as well as technical infrastructure to provide authentic and interactive access to the TIB’s large electronic media collection.
  • Item
    Building information modeling – A game changer for interoperability and a chance for digital preservation of architectural data?
    (2014) Lindlar, Michelle
    Digital data associated with the architectural design-andconstruction process is an essential resource alongside -and even past- the lifecycle of the construction object it describes. Despite this, digital architectural data remains to be largely neglected in digital preservation research – and vice versa, digital preservation is so far neglected in the design-and-construction process. In the last 5 years, Building Information Modeling (BIM) has seen a growing adoption in the architecture and construction domains, marking a large step towards much needed interoperability. The open standard IFC (Industry Foundation Classes) is one way in which data is exchanged in BIM processes. This paper presents a first digital preservation based look at BIM processes, highlighting the history and adoption of the methods as well as the open file format standard IFC (Industry Foundation Classes) as one way to store and preserve BIM data.
  • Item
    The ties that bind - On the impact of losing a consortium member in a cooperatively operated digital preservation system
    (2016) Lindlar, Michelle
    Cooperatively operated digital preservation systems offer institutions of varying size the chance to actively participate in digital preservation. In current times of budget cuts they are also a valuable asset to larger memory institutions. While the benefits of cooperatively operated systems have been discussed before, the risks associated with a consortial solution have not been analyzed in detail. TIB hosts the Goportis Digital Archive which is used by two large national subject libraries as well as by TIB itself. As the host of this comparatively small preservation network, TIB has started to analyze the particular risk which losing a consortium member poses to the overall system operation. This paper presents the current status of this work-in-progress and highlights two areas: risk factors associated with cost and risk factors associated with the content. While the paper is strictly written from the viewpoint of the consortial leader/ host of this specific network, the underlying processes shall be beneficial to other cooperatively operated digital preservation systems.
  • Item
    The DURAARK project – Long-term preservation of architectural 3D-data
    (2014) Lindlar, Michelle; Saemann, Hedda
    [no abstract available]