Informatik

Permanent URI for this collection

https://oa.tib.eu/renate/handle/123456789/4453

Browse

Now showing 1 - 5 of 220

Final Report for the Emmy Noether Project : ConcSys: Reliable and Efficient Complex, Concurrent Software Systems
(Hannover : Technische Informationsbibliothek, 2025-06) Pradel, Michael
The ConcSys project aims to develop techniques for testing and analyzing complex software systems, with a focus on increasing the correctness and performance of such systems. The project was running from March 2015 until December 2024. In this period, we made significant progress, both in terms of scientific results and in terms of building up a research group. The scientific results include novel techniques for (i) finding and preventing concurrency bugs, (ii) understanding and analyzing software performance, (iii) automated test generation, (iv) program analysis for WebAssembly, and (v) foundations of dynamic analysis. These results are presented in 83 peer-reviewed publications at top-tier conferences and journals in software engineering and programming languages, e.g., ICSE, OOPSLA, PLDI, and FSE. Beyond these scientific results, the project has enabled the PI, Michael Pradel, to build up his own a research group, to establish himself as an internationally recognized leader in the field, and to secure a permanent professorship at the University of Stuttgart. The project has directly and indirectly contributed to the careers of 12 doctoral students, out of which seven have been partially funded by the project and six have already graduated.
DFG Final Report: "LIVE: Empirical Studies on the Effects of Liveness on Programming"
(Hannover : Technische Informationsbibliothek, 2025-06-13) Hirschfeld, Robert
Liveness in programming tools is the impression of changing a program while it is running. Various tools support liveness, including commercial programming systems, such as MS Excel and Jupyter Notebooks. Tool designers assume that liveness improves the programming experience, but this assumption has insufficient and inconclusive empirical backing. This lack of evidence might lead to the promotion of liveness in unsuitable settings and the neglect of important settings, which would waste design and implementation efforts. In this project, we investigated the effects of live tools on debugging. In two controlled experiments we studied the influence of task complexity and delayed interactions on the effects of live tools. Compared to previous experiments on liveness, the participants in our experiments had considerable experience with live tools. In our first experiment we tested whether the influence of live tools on debugging time differs for simple and complex tasks. We found that live tools significantly shorten the time needed to debug defects. At the same time, we could not confirm our main hypothesis that task complexity moderates this effect. However, our results indicate that task complexity indeed influences the effect, but less than suggested by the pilot. For programming tool researchers and designers, our results show that programmers can benefit from live tools, but that they need to consider task complexity and participants' experience with liveness when preparing studies or building tools. With our second experiment, we aimed to better understand the first experiment's observations. Based on Information Foraging Theory, we assumed that live tools reduce the perceived cost of obtaining dynamic information so that programmers consult it more often when helpful. Therefore, we tested whether programmers use live tools less frequently if access to them is delayed. The experiment did not yield sufficiently enough results for a thorough analysis, but the collected data shows no clear decline in live tool usage. Yet, an ongoing post hoc analysis using edit-run cycles suggests that participants' workflows changed. During the first experiment, we found that it is a great challenge to operationalize the complexity of maintenance tasks in programming tool studies. Thus, we conducted a survey to curate a collection of factors from related studies that can help shape the complexity of such tasks. With this collection, researchers can deliberately decide on the complexity level for their studies' tasks. This project also resulted in a novel concept for teaching debugging through contests and improved setups for related studies on liveness conducted in our group.
Final Report on DFG Project "Automatic Transcription of Conversations"
(Hannover : Technische Informationsbibliothek, 2025) Häb-Umbach, Reinhold; Schlüter, Ralf
Multi-talker conversational speech recognition is concerned with transcribing meetings recorded with distant microphones. The difficulty of the task can be attributed to three factors. First, the recording conditions are challenging: The speech signal captured by microphones from a distance is noisy and reverberated and often contains nonstationary acoustic distortions, which makes it hard to decode. Second, there is a significant percentage of time with overlapped speech, where multiple speakers talk at the same time. Finally, the interaction dynamics of the scenario are challenging because speakers articulate themselves in an intermittent manner with alternating segments of speech inactivity, single-, and multi-talker speech. This project was concerned with developing a transcription system that can operate on arbitrarily long input, correctly handles segments of overlapped as well as non-overlapped speech, and transcribes the speech of different speakers consistently into separate output streams. Such a multi-talker Automatic Speech Recognition (ASR) system typically consists of the following three components: a source separation and enhancement block, a diarization stage, that attributes segments of input speech to speakers, and an ASR stage, whereby different orders of processing have been proposed. Those orders differ in when to do diarization. While existing approaches employed separately trained subsystems for diarization, separation, and recognition, our research hypothesis was that a joint approach, which is optimized under a single training objective, should lead to superior solutions compared to the separate optimization of individual components. Such a coherent formulation, however, would not necessarily mean that the three aforementioned tasks had to be carried out in a single, monolithic (probably neural) integrated system. Indeed, the research carried out showed that it is beneficial to have separate subsystems, however, with a tight coupling between them. Examples of such systems we developed are • TS-SEP, which carries out diarization and separation/enhancement, with a tight coupling in-between. • Mixture encoder, which leverages explicit speech separation, but also forwards the not yet separated speech to the ASR module to mitigate error propagation from the separator to the recognizer. • Joint diarization and separation, realized by a statistical mixture model, which integrates a mixture model for diarization and one for separation, that share a common hidden state variable. • Transcription-supported diarization, which uses sentence- and word-level boundaries of the ASR module to support speaker turn detection. Furthermore, we developed new approaches to the individual subsystems and shared several tools and data sets with the research community.
DFG Final Report for Automatic Fact Checking for Biomedical Information in Social Media and Scientific Literature (FIBISS), project number 667374
(Hannover : Technische Informationsbibliothek, 2025-04-10) Klinger, Roman; Wührl, Amelie
Research into methods for the automatic verification of facts, i.e., computational models that can distinguish correct information from misinformation or disinformation, is largely focused on the news domain and on the analysis of posts in social media. Among other things, texts are checked for their truthfulness. This can be done by analyzing linguistic features that suggest an intention to deceive or by comparing them with other sources that make comparable statements in terms of content. Most studies focus on politically relevant areas. The biomedical domain is also an area of particular social relevance. In social media, various actors and medical laypersons share reports on treatment methods, successes and failures, such as the (disproven) method of treating viral infections with deworming agents or disinfectants. There are also reports on (disproven) links between treatments and adverse effects, such as the causation of autism by vaccination. However, the biomedical domain, unlike other areas relevant for automated fact checking, benefits from a large resource of reliable scientific articles. The aim of the FIBISS project was therefore to develop and evaluate methods that can extract biomedical claims in social media and compare them with reliable sources. One challenge here is that social media does not typically use technical language, so different vocabularies have to be combined. The approach in FIBISS was therefore to develop generalizing information extraction methods. In the course of the project, large language models also became prominent as a further methodological approach. The project was therefore adapted to optimize general representations of claims in such a way that they are suitable for comparison using automatic fact-checking procedures. As a result, we contribute text corpora that are used to develop and evaluate automated biomedical fact-checking systems. We propose methods that automatically reformulate claims so that they are suitable to be automatically verified. Furthermore, we present approaches that can automatically assess the credibility of claims, even independently of existing evidence.
Final Report of the DFG Project "Drawing Graphs: Geometric Aspects Beyond Planarity" (project number 654838)
(Hannover : Technische Informationsbibliothek, 2025-04) Wolff, Alexander
The aim of our project was to get a better understanding of the mathematical structures that correspond to the different ways of measuring the visual complexity of a drawing of a graph. Examples for such measures are the local crossing number, that is, the maximum number of crossings per edge, the slope number, that is, the number of different slopes in a crossing-free straight-line drawing, the segment number or the line cover number, that is, the number of straight-line segments or straight lines needed to cover a crossing-free straight-line drawing. For a graph, the measures are defined as the minimum over all drawings (of the corresponding type). The center of our studies became the measure segment number, which is known to be NP-hard to compute. In particular, we showed that there is a parameterized algorithm for computing the segment number of a given graph with respect to the several parameters; the natural parameter, the line cover number, and the vertex cover number. The latter proof was the technically most challenging. In a different work, we showed that it is ETR-complete to compute the segment number of a given graph, that is, the segment number of a graph can be expressed in terms of the existential theory of the reals, but its computation is at least as hard as every problem in the complexity class ETR. Moreover, we extended a result concerning the segment number of triconnected cu- bic planar graphs by showing that the segment number of every triconnected 4-regular planar graph with n vertices is at most n + 3, which is tight up to the additive constant. We have proved the first linear universal lower bounds for the segment number of out- erpaths, maximal outerplanar graphs, 2-trees, and planar 3-trees. This shows that the existing algorithms for these graph classes are in fact constant-factor approximation algorithms. For maximal outerpaths, our universal lower bound is best possible.

Browse

Recent Submissions