Informatik

Permanent URI for this collection

https://oa.tib.eu/renate/handle/123456789/4453

Browse

Now showing 1 - 5 of 218

Final Report on DFG Project "Automatic Transcription of Conversations"
(Hannover : Technische Informationsbibliothek, 2025) Häb-Umbach, Reinhold; Schlüter, Ralf
Multi-talker conversational speech recognition is concerned with transcribing meetings recorded with distant microphones. The difficulty of the task can be attributed to three factors. First, the recording conditions are challenging: The speech signal captured by microphones from a distance is noisy and reverberated and often contains nonstationary acoustic distortions, which makes it hard to decode. Second, there is a significant percentage of time with overlapped speech, where multiple speakers talk at the same time. Finally, the interaction dynamics of the scenario are challenging because speakers articulate themselves in an intermittent manner with alternating segments of speech inactivity, single-, and multi-talker speech. This project was concerned with developing a transcription system that can operate on arbitrarily long input, correctly handles segments of overlapped as well as non-overlapped speech, and transcribes the speech of different speakers consistently into separate output streams. Such a multi-talker Automatic Speech Recognition (ASR) system typically consists of the following three components: a source separation and enhancement block, a diarization stage, that attributes segments of input speech to speakers, and an ASR stage, whereby different orders of processing have been proposed. Those orders differ in when to do diarization. While existing approaches employed separately trained subsystems for diarization, separation, and recognition, our research hypothesis was that a joint approach, which is optimized under a single training objective, should lead to superior solutions compared to the separate optimization of individual components. Such a coherent formulation, however, would not necessarily mean that the three aforementioned tasks had to be carried out in a single, monolithic (probably neural) integrated system. Indeed, the research carried out showed that it is beneficial to have separate subsystems, however, with a tight coupling between them. Examples of such systems we developed are • TS-SEP, which carries out diarization and separation/enhancement, with a tight coupling in-between. • Mixture encoder, which leverages explicit speech separation, but also forwards the not yet separated speech to the ASR module to mitigate error propagation from the separator to the recognizer. • Joint diarization and separation, realized by a statistical mixture model, which integrates a mixture model for diarization and one for separation, that share a common hidden state variable. • Transcription-supported diarization, which uses sentence- and word-level boundaries of the ASR module to support speaker turn detection. Furthermore, we developed new approaches to the individual subsystems and shared several tools and data sets with the research community.
DFG Final Report for Automatic Fact Checking for Biomedical Information in Social Media and Scientific Literature (FIBISS), project number 667374
(Hannover : Technische Informationsbibliothek, 2025-04-10) Klinger, Roman; Wührl, Amelie
Research into methods for the automatic verification of facts, i.e., computational models that can distinguish correct information from misinformation or disinformation, is largely focused on the news domain and on the analysis of posts in social media. Among other things, texts are checked for their truthfulness. This can be done by analyzing linguistic features that suggest an intention to deceive or by comparing them with other sources that make comparable statements in terms of content. Most studies focus on politically relevant areas. The biomedical domain is also an area of particular social relevance. In social media, various actors and medical laypersons share reports on treatment methods, successes and failures, such as the (disproven) method of treating viral infections with deworming agents or disinfectants. There are also reports on (disproven) links between treatments and adverse effects, such as the causation of autism by vaccination. However, the biomedical domain, unlike other areas relevant for automated fact checking, benefits from a large resource of reliable scientific articles. The aim of the FIBISS project was therefore to develop and evaluate methods that can extract biomedical claims in social media and compare them with reliable sources. One challenge here is that social media does not typically use technical language, so different vocabularies have to be combined. The approach in FIBISS was therefore to develop generalizing information extraction methods. In the course of the project, large language models also became prominent as a further methodological approach. The project was therefore adapted to optimize general representations of claims in such a way that they are suitable for comparison using automatic fact-checking procedures. As a result, we contribute text corpora that are used to develop and evaluate automated biomedical fact-checking systems. We propose methods that automatically reformulate claims so that they are suitable to be automatically verified. Furthermore, we present approaches that can automatically assess the credibility of claims, even independently of existing evidence.
Final Report of the DFG Project "Drawing Graphs: Geometric Aspects Beyond Planarity" (project number 654838)
(Hannover : Technische Informationsbibliothek, 2025-04) Wolff, Alexander
The aim of our project was to get a better understanding of the mathematical structures that correspond to the different ways of measuring the visual complexity of a drawing of a graph. Examples for such measures are the local crossing number, that is, the maximum number of crossings per edge, the slope number, that is, the number of different slopes in a crossing-free straight-line drawing, the segment number or the line cover number, that is, the number of straight-line segments or straight lines needed to cover a crossing-free straight-line drawing. For a graph, the measures are defined as the minimum over all drawings (of the corresponding type). The center of our studies became the measure segment number, which is known to be NP-hard to compute. In particular, we showed that there is a parameterized algorithm for computing the segment number of a given graph with respect to the several parameters; the natural parameter, the line cover number, and the vertex cover number. The latter proof was the technically most challenging. In a different work, we showed that it is ETR-complete to compute the segment number of a given graph, that is, the segment number of a graph can be expressed in terms of the existential theory of the reals, but its computation is at least as hard as every problem in the complexity class ETR. Moreover, we extended a result concerning the segment number of triconnected cu- bic planar graphs by showing that the segment number of every triconnected 4-regular planar graph with n vertices is at most n + 3, which is tight up to the additive constant. We have proved the first linear universal lower bounds for the segment number of out- erpaths, maximal outerplanar graphs, 2-trees, and planar 3-trees. This shows that the existing algorithms for these graph classes are in fact constant-factor approximation algorithms. For maximal outerpaths, our universal lower bound is best possible.
Multiscale phenomena: Green's functions, the Dirichlet-to-Neumann formulation, subgrid scale models, bubbles and the origins of stabilized methods
(Amsterdam [u.a.] : Elsevier Science, 1995) Hughes, Thomas J. R.
An approach is developed for deriving variational methods capable of representing multiscale phenomena. The ideas are first illustrated on the exterior problem for the Helmholtz equation. This leads to the well-known Dirichlet-to-Neumann formulation. Next, a class of subgrid scale models is developed and the relationships to 'bubble function' methods and stabilized methods are established. It is shown that both the latter methods are approximate subgrid scale models. The identification for stabilized methods leads to an analytical formula for τ, the 'intrinsic time scale', whose origins have been a mystery heretofore. © 1995.
Implementation of an adaptive BDF2 formula and comparison with the MATLAB Ode15s
(Amsterdam [u.a.] : Elsevier, 2014) Celaya, E. Alberdi; Aguirrezabala, J. J. Anza; Chatzipantelidis, P.
After applying the Finite Element Method (FEM) to the diffusion-type and wave-type Partial Differential Equations (PDEs), a first order and a second order Ordinary Differential Equation (ODE) systems are obtained respectively. These ODE systems usually present high stiffness, so numerical methods with good stability properties are required in their resolution. MATLAB offers a set of open source adaptive step functions for solving ODEs. One of these functions is the ode15s recommended to solve stiff problems and which is based on the Backward Differentiation Formulae (BDF). We describe the error estimation and the step size control implemented in this function. The ode15s is a variable order algorithm, and even though it has an adaptive step size implementation, the advancing formula and the local error estimation that uses correspond to the constant step size formula. We have focused on the second order accurate and unconditionally stable BDF (BDF2) and we have implemented a real adaptive step size BDF2 algorithm using the same strategy as the BDF2 implemented in the ode15s, resulting the new algorithm more efficient than the one implemented in MATLAB. © The Authors. Published by Elsevier B.V.

Browse

Recent Submissions