Publications
“VANILLA: Validated knowledge graph completion—A Normalization-based framework for Integrity, Link prediction, and Logical Accuracy”
September 2025
Knowledge-Based Systems, 325, Article 113939
CAIMed Groups:
AI & Active Agents
Semantic Models
Knowledge graphs (KGs) are expressive data structures for integrating and describing heterogeneous data by unifying factual information and domain knowledge. However, under the Open World Assumption (OWA), the absence of facts does not imply falsity—only incompleteness. Inductive learning methods, particularly numerical techniques such as Knowledge Graph Embeddings (KGEs) and Graph Neural Networks (GNNs), are widely used for link prediction and classification tasks in KGs. These models excel at capturing latent patterns and exploiting structural properties at scale. Nevertheless, their performance can be significantly degraded by anomalies in KG representations—semantic inconsistencies and modeling artifacts that arise from unconstrained data integration. Such anomalies obscure the intended meaning of relations, introduce noise, and mislead numerical learning models. To address this issue, we introduce a normalization theory for KGs that enforces semantic consistency through normal forms. These forms restructure KGs to eliminate representational anomalies, ensuring that the data adheres to well-defined semantic constraints. We present VANILLA, a neuro-symbolic framework that combines symbolic rule learning, numerical inductive models, and constraint-based validation. By aligning inductive predictions with normalized, ontology-aware KG structures, VANILLA enables accurate and semantically grounded KG completion. Experimental results show that our approach significantly improves predictive performance while maintaining semantic integrity, demonstrating the value of normalization in hybrid KG learning systems. VANILLA is publicly available on GitHub https://github.com/SDM-TIB/VANILLA
The full text can be found here .
Funded by CAIMed
“Probabilistic Domain Adaptation for Biomedical Image Segmentation”
August 2025
ICCVW 2025
Anwai Archit, Constantin Pape
Segmentation is a crucial analysis task in biomedical imaging. Given the diverse experimental settings in this field, the lack of generalization limits the use of deep learning in practice. Domain adaptation is a promising remedy: it involves training a model for a given task on a source dataset with labels and adapts it to a target dataset without additional labels. We introduce a probabilistic domain adaptation method, building on self-training approaches and the Probabilistic UNet. We use the latter to sample multiple segmentation hypotheses to implement better pseudo-label filtering. We further study joint and separate source-target training strategies and evaluate our method on three challenging domain adaptation tasks for biomedical segmentation.
The full text can be found here .
Funded by CAIMed
„Agent-based modeling for realistic reproduction of human mobility and contact behavior to evaluate test and isolation strategies in epidemic infectious disease spread^“
July 2025
Computers in Biology and Medicine, 193, 110269
David Kerkmann, Sascha Korf, Khoa Nguyen, Daniel Abele, Alain Schengen, Carlotta Gerstein, Jens Henrik Göbbert, Achim Basermann, Martin J. Kühn, Michael Meyer-Hermann
CAIMed Groups:
AI & Active Agents
Human Centered AI
Agent-based models have proven to be useful tools in supporting decision-making processes in different application domains. The advent of modern computers and supercomputers has enabled these bottom-up approaches to realistically model human mobility and contact behavior.
The COVID-19 pandemic showcased the urgent need for detailed and informative models that can answer research questions on transmission dynamics. We present a sophisticated agent-based model to simulate the spread of respiratory diseases. The model is highly modularized and can be used on various scales, from a small collection of buildings up to cities or countries. Although not being the focus of this paper, the model has undergone performance engineering on a single core and provides an efficient intra- and inter-simulation parallelization for time-critical decision-making processes.
In order to allow answering research questions on individual level resolution, nonpharmaceutical intervention strategies such as face masks or venue closures can be implemented for particular locations or agents. In particular, we allow for sophisticated testing and isolation strategies to study the effects of minimal-invasive infectious disease mitigation.
With realistic human mobility patterns for the region of Brunswick, Germany, we study the effects of different interventions between March 1st and May 30, 2021 in the SARS-CoV-2 pandemic. Our analyses suggest that symptom-independent testing has limited impact on the mitigation of disease dynamics if the dark figure in symptomatic cases is high. Furthermore, we found that quarantine length is more important than quarantine efficiency but that, with sufficient symptomatic control, also short quarantines can have a substantial effect.
The full text can be found here .
Funded by CAIMed
“A rule-based clinical decision support system for detection of acute kidney injury after pediatric cardiac surgery”
July 2025
Computers in Biology and Medicine, 193, 110382
Janice Wachenbrunner, Marcel Mast, Julia Böhnke, Nicole Rübsamen, Louisa Bode, André Karch, Henning Rathert, Alexander Horke, Philipp Beerbaum, Michael Marschollek, Thomas Jack, Martin Böhne
CAIMed Groups:
AI & Decisions
Clinical Decision Support
Acute kidney injury (AKI) is common in children with congenital heart disease following open-heart surgery with cardiopulmonary bypass (CPB). Early AKI detection in critically ill children requires clinician expertise to compile various data from different sources within a stressful and time-sensitive environment. However, as electronic health records provide data in a machine-readable format, this process could be supported by computerized systems. Therefore, we developed a time-aware, rule-based clinical decision support system (CDSS) to detect, stage, and track temporal AKI progression in children.
The full text can be found here .
Funded by CAIMed
„Segment Anything for Histopathology”
March 2025
MIDL 2025
Titus Griebel, Anwai Archit, Constantin Pape
Nucleus segmentation is an important analysis task in digital pathology. However, methods for automatic segmentation often struggle with new data from a different distribution, requiring users to manually annotate nuclei and retrain data-specific models. Vision foundation models (VFMs), such as the Segment Anything Model (SAM), offer a more robust alternative for automatic and interactive segmentation. Despite their success in natural images, a foundation model for nucleus segmentation in histopathology is still missing. Initial efforts to adapt SAM have shown some success, but did not yet introduce a comprehensive model for diverse segmentation tasks. To close this gap, we introduce PathoSAM, a VFM for nucleus segmentation, based on training SAM on a diverse dataset. Our extensive experiments show that it is the new state-of-the-art model for automatic and interactive nucleus instance segmentation in histopathology. We also demonstrate how it can be adapted for other segmentation tasks, including semantic nucleus segmentation. For this task, we show that it yields results better than popular methods, while not yet beating the state-of-the-art, CellViT. Our models are open-source and compatible with popular tools for data annotation. We also provide scripts for whole-slide image segmentation.
The full text can be found here .
Funded by CAIMed
„Parameter Efficient Fine-Tuning of Segment Anything Model for Biomedical Imaging”
March 2025
MIDL 2025
Carolin Teuber, Anwai Archit, Constantin Pape
CAIMed Groups:
Segmentation is an important analysis task for biomedical images, enabling the study of individual organelles, cells or organs. Deep learning has massively improved segmentation methods, but challenges remain in generalization to new conditions, requiring costly data annotation. Vision foundation models, such as Segment Anything Model (SAM), address this issue through improved generalization. However, these models still require finetuning on annotated data, although with less annotations, to achieve optimal results for new conditions. As a downside, they require more computational resources. This makes parameter-efficient finetuning (PEFT) relevant. We contribute the first comprehensive study of PEFT for SAM applied to biomedical images. We find that the placement of PEFT layers is more important for efficiency than the type of layer for vision transformers and we provide a recipe for resource-efficient finetuning.
The full text can be found here .
Funded by CAIMed
“Using Photon-Counting CT Images for Lung Nodule Classification”
Leonie Thieme, Zahra Ahmadi, Steffen Oeltze-Jafra, Eike Petersen, Hoen-oh Shin, Andrea Schenk
CAIMed Groups:
AI & Decisions
Human-Centered AI
An automatic classification of the malignancy of lung nodules in computed tomography (CT) scans can support early detection of lung cancer, which is crucial for the treatment success. The novel photon-counting CT (PCCT) technology enables high image quality with a low radiation dose and provides additional spectral information. This research focuses on whether PCCT scans offer a benefit in the automatic classification of lung nodules. Establishing a dataset of PCCT images poses several challenges, such as the extraction of annotations or the data imbalance.
The full text can be found here .
Funded by CAIMed
“Transient silencing of hypermutation preserves B cell affinity during clonal bursting”
March 2025
Nature 641, 486–494
Juhee Pae, Niklas Schwan, Bertrand Ottino-Loffler, William S. DeWitt, Amar Garg, Juliana Bortolatto, Ashni A. Vora, Jin-Jie Shen, Alvaro Hobbs, Tiago B. R. Castro, Luka Mesin, Frederick A. Matsen IV, Michael Meyer-Hermann & Gabriel D. Victora
CAIMed Groups:
AI & Active Agents
Mathematical Models
In the course of antibody affinity maturation, germinal centre (GC) B cells mutate their immunoglobulin heavy- and light-chain genes in a process known as somatic hypermutation (SHM). Panels of mutant B cells with different binding affinities for antigens are then selected in a Darwinian manner, which leads to a progressive increase in affinity among the population. As with any Darwinian process, rare gain-of-fitness mutations must be identified and common loss-of-fitness mutations avoided. Progressive acquisition of mutations therefore poses a risk during large proliferative bursts, when GC B cells undergo several cell cycles in the absence of affinity-based selection. Using a combination of in vivo mouse experiments and mathematical modelling, here we show that GCs achieve this balance by strongly suppressing SHM during clonal-burst-type expansion, so that a large fraction of the progeny generated by these bursts does not deviate from their ancestral genotype. Intravital imaging and image-based cell sorting of a mouse strain carrying a reporter of cyclin-dependent kinase 2 (CDK2) activity showed that B cells that are actively undergoing proliferative bursts lack the transient CDK2low ‘G0-like’ phase of the cell cycle in which SHM takes place. We propose a model in which inertially cycling B cells mostly delay SHM until the G0-like phase that follows their final round of division in the GC dark zone, thus maintaining affinity as they clonally expand in the absence of selection.
The full text can be found here .
Funded by CAIMed
Preprints
“Auto-nnU-Net: Towards Automated Medical Image Segmentation”
22 May 2025 (submission date)
arxiv.org
CAIMed Groups:
AI & Decision
Human-Centered AI
Medical Image Segmentation (MIS) includes diverse tasks, from bone to organ segmentation, each with its own challenges in finding the best segmentation model. The state-of-the-art AutoML-related MIS-framework nnU-Net automates many aspects of model configuration but remains constrained by fixed hyperparameters and heuristic design choices. As a full-AutoML framework for MIS, we propose Auto-nnU-Net, a novel nnU-Net variant enabling hyperparameter optimization (HPO), neural architecture search (NAS), and hierarchical NAS (HNAS). Additionally, we propose Regularized PriorBand to balance model accuracy with the computational resources required for training, addressing the resource constraints often faced in real-world medical settings that limit the feasibility of extensive training procedures. We evaluate our approach across diverse MIS datasets from the well-established Medical Segmentation Decathlon, analyzing the impact of AutoML techniques on segmentation performance, computational efficiency, and model design choices. The results demonstrate that our AutoML approach substantially improves the segmentation performance of nnU-Net on 6 out of 10 datasets and is on par on the other datasets while maintaining practical resource requirements. Our code is available at this URL.
The full text can be found here .
Funded by CAIMed