DAGM GCPR | 2025
DAGM German Conference on Pattern Recognition, Freiburg

Accepted Papers
The following papers that have been accepted to GCPR 2025. Congratulations to all the respective authors! See you soon at GCPR 2025 in Freiburg.
EVCS: A Benchmark for Fine-Grained Electric Vehicle Charging Station Detection
Chen, Lin; Südbeck, Sönke; Riggers, Christoph; Geib, Tobias; Cordes, Kai; Broszio, Hellward
MCUCoder: Adaptive Bitrate Learned Video Compression for IoT Devices
Hojjat, Ali; Haberer, Janek; Landsiedel, Olaf
NaT-ReX: Naturalness Assessment with Transformer-Based Reliable Explainability
Emam, Ahmed; Farag, Mohamed; Russwurm, Marc; Roscher, Ribana
subCellSAM: Zero-Shot (Sub-)Cellular Segmentation for Hit Validation in Drug Discovery
Hanimann, Jacob; Siegismund, Daniel; Wieser, Mario; Steigele, Stephan
Efficient Masked Attention Transformer for Few-Shot Classification and Segmentation
Carrion, Dustin; Roth, Stefan; Schaub-Meyer, Simone
Common Data Properties Limit Object-Attribute Binding in CLIP
Guring, Bijay; Hoffmann, David; Brox, Thomas
MT-Occ: Single-View 3D Occupancy Prediction via Multi-Task Distillation
Li, Zhi; Aljundi, Rahaf; Reino, Daniel; Schiele, Bernt
SegSLR: Promptable Video Segmentation for Isolated Sign Language Recognition
Schreiber, Sven; Sarhan, Noha; Frintrop, Simone; Wilms, Christian
synth-dacl: Does Synthetic Defect Data Enhance Segmentation Accuracy and Robustness for Real-World Bridge Inspections?
Flotzinger, Johannes; Deuser, Fabian; Jaziri, Achref ; Neumann, Heiko; Oswald, Norbert; Ramesh, Visvanathan ; Braml, Thomas
Hierarchical Insights: Exploiting Structural Similarities for Reliable 3D Semantic Segmentation
Dreissig, Mariella; Ruehle, Simon; Piewak, Florian; Boedecker, Joschka
VisualChef: Generating Visual Aids in Cooking via Mask Inpainting
Kuzyk, Oleh; Li, Zuoyue; Pollefeys, Marc; Wang, Xi
FedPCE: Federated Personalized Client Embeddings for Post-training Knowledge Distillation
Hansel, Soma; Kobler, Erich; Effland, Alexander
CoProU-VO: Combining Projected Uncertainty for End-to-End Unsupervised Monocular Visual Odometry
Xie, Jingchao; Dhaouadi, Oussema; Chen, Weirong; Meier, Johannes; Kaiser, Jacques; Cremers, Daniel
Video Object Segmentation-aware Audio Generation
Viertola, Ilpo; Iashin, Vladimir; Rahtu, Esa
Object Risk Estimation for Autonomous Driving Safety
Khan, Abdul Hannan; Shafiq, Syed; van Elst, Ludger; Dengel, Andreas
Rethinking Semi-supervised Segmentation Beyond Accuracy: Robustness and Reliability
Landgraf, Steven; Hillemann, Markus; Ulrich, Markus
A Cascaded Dilated Convolution Approach for Mpox Lesion Classification
Deshmukh, Ayush
Assessing Foundation Models for Mold Colony Detection with Limited Training Data
Pichler, Henrik; Keuper, Janis; Copping, Matthew
Semantic Segmentation of Structural Damage: A Comparative Study of YOLO11 and Encoder-Decoder Networks
Krefft, Lorenz; Hoegner, Ludwig
Structured Universal Adversarial Attacks on Object Detection for Video Sequences
Jacob, Sven; Shao, Weijia; Kasneci, Gjergji
Investigating Structural Pruning and Recovery Techniques for Compressing Multimodal Large Language Models: An Empirical Study
Huang, Yiran; Thede, Lukas; Mancini, Massimiliano; Xu, Wenjia; Akata, Zeynep
Detection of Synthetic Face Images: Accuracy, Robustness, Generalization
Petrželková, Nela; Čech, Jan
HistDiST: Histopathological Diffusion-based Stain Transfer
Grosskopf, Erik; Bundele, Valay; Hosseinzadeh, Mehran; Lensch, Hendrik
Deep Learning-Assisted Dynamic Mode Decomposition for Non-resonant Background Removal in CARS Spectroscopy
Chalain Valapil, Adithya Ashok; Messerschmidt, Carl; Shadaydeh, Maha; Schmitt, Michael; Popp, Jürgen; Denzler, Joachim
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization
Panek, Vojtech; Sattler, Torsten; Kukelova, Zuzana
Unlocking In-Context Learning for Natural Datasets Beyond Language Modelling
Bratulić, Jelena; Mittal, Sudhanshu; Hoffmann, David; Böhm, Samuel; Schirrmeister, Robin; Ball, Tonio; Rupprecht, Christian; Brox, Thomas
Graph Roof Reconstruction with Synthetic Data from Misaligned Labels
Amrullah, Chaikal; Bittner, Ksenia
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
Najafli, Eyvaz; Kästingschäfer, Marius; Bernhard, Sebastian; Brox, Thomas; Geiger, Andreas
Can Multitask Learning Enhance Model Explainability?
Najjar, Hiba; Alshbib, Bushra; Dengel, Andreas
Out-of-Distribution Detection in LiDAR Semantic Segmentation Using Epistemic Uncertainty from Hierarchical GMMs
Shojaei Miandashti, Hanieh; Brenner, Claus
γ-Quant: Towards Learnable Quantization for Low-bit Pattern Recognition
Fatima, Mishal; Agnihotri, Shashank; Bock, Marius; Gandikota, Kanchana; Van Laerhoven , Kristof; Moeller, Michael; Keuper, Margret
RadarSeq: A Temporal Vision Framework for User Churn Prediction via Radar Sequence Chart
Najafi, Sina; Sepanj, M.Hadi; Jafari, Fahimeh
Using Knowledge Graphs to harvest datasets for efficient CLIP model training
Ging, Simon; Walter, Sebastian; Bratulić, Jelena; Dienert, Johannes; Bast, Hannah; Brox, Thomas
StorySync: Training-Free Subject Consistency via Region Harmonization
Gaur, Gopalji; Zolfaghari, Mohammadreza; Brox, Thomas
Don’t Miss Out on Novelty: Importance of Novel Features for Deep Anomaly Detection
Sivaprasad, Sarath; Fritz, Mario
LADB: Latent Aligned Diffusion Bridges for Semi-Supervised Domain Translation
Wang, Xuqin; Wu, Tao; Zhang, Yanfeng; Liu, Lu; Wang, Dong; Sun, Mingwei; Wang, Yongliang; Zeller, Niclas; Cremers, Daniel
On the Dangers of Bootstrapping Generation for Continual Learning and Beyond
Zverev, Daniil; Koepke, Almut Sophia; Henriques, Joao
Road Obstacle Video Segmentation
Rai, Shyam Nandan; Karthik, Shyamgopal; Georgescu, Iuliana; Caputo, Barbara; Masone, Carlo ; Akata, Zeynep
Combined Image Data Augmentations diminish the benefits of Adaptive Label Smoothing
Siedel, Georg; Gupta, Ekagra; Shao, Weijia; Vock, Silvia; Morozov, Andrey
Accepted Papers: Nectar Track
Can We Talk Models Into Seeing the World Differently?
Gavrikov, Paul; Lukasik, Jovita; Jung, Steffen; Geirhos, Robert; Mirza, M. Jehanzeb; Keuper, Margret; Keuper, Janis
FastCAV: Efficient Computation of Concept Activation Vectors for Explaining Deep Neural Networks
Schmalwasser, Laines; Penzel, Niklas; Denzler, Joachim; Niebling, Julia
Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models
Venkataramanan, Aishwarya; Bodesheim, Paul; Denzler, Joachim
SEED4D: A Synthetic Ego–Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark
Kästingschäfer, Marius; Gieruc, Theo; Bernhard, Sebastian; Campbell, Dylan; Insafutdinov, Eldar; Najafli, Eyvaz; Brox, Thomas
Electromyography-Informed Facial Expression Reconstruction for Physiological-Based Synthesis and Analysis
Büchner, Tim; Anders, Christoph; Guntinas-Lichius, Orlando; Denzler, Joachim
Implicit Language Models are RNNs: Balancing Parallelization and Expressivity
Schoene, Mark; Rahmani, Babak; Kremer, Heiner; Falck, Fabian; Ballani, Hitesh; Gladrow, Jannes
HydraViT: Stacking Heads for a Scalable ViT
Haberer, Janek; Hojjat, Ali; Landsiedel, Olaf
Banded Square Root Matrix Factorization for Differentially Private Model Training
Kalinin, Nikita; Lampert, Christoph
CausalRivers - Scaling up benchmarking of causal discovery for real-world time-series
Stein, Gideon; Shadaydeh, Maha; Blunk, Jan; Penzel, Niklas; Denzler, Joachim
DCBM: Data-Efficient Visual Concept Bottleneck Models
Prasse, Katharina; Knab, Patrick; Marton, Sascha; Bartelt, Christian; Keuper, Margret
Prompt-Tuning SAM: From Generalist to Specialist with only 2048 Parameters and 16 Training Images
Piater, Tristan; Barz, Björn; Freytag, Alexander
FAIR-TAT: Improving Model Fairness Using Targeted Adversarial Training
Medi, Tejaswini; Jung, Steffen; Keuper, Margret;
Higher-Order Ratio Cycles for Fast and Globally Optimal Shape Matching
Roetzer, Paul; Ehm, Viktoria; Cremers, Daniel; Lähner, Zorah; Bernard, Florian
PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?
Spitznagel, Martin; Vaillant, Jan; Keuper, Janis
High-Resolution 3D Shape Matching with Global Optimality and Geometric Consistency
El Amrani, Nafie; Rötzer, Paul; Bernard, Florian
Removing Cost Volumes from Optical Flow Estimators
Kiefhaber, Simon; Roth, Stefan; Schaub-Meyer, Simone
Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?
Zverev, Egor; Abdelnabi, Sahar; Tabesh, Soroush; Fritz, Mario ; Lampert, Christoph
SOS: Segment Object System for Open-World Instance Segmentation With Object Priors
Wilms, Christian; Rolff, Tim; Hillemann, Maris; Johanson, Robert; Frintrop, Simone
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
Schrodi, Simon; Hoffmann, David; Argus, Max; Fischer, Volker; Brox, Thomas
When and How Does CLIP Enable Domain and Compositional Generalization?
Kempf, Elias; Schrodi, Simon; Argus, Max; Brox, Thomas
Towards Optimizing Large-Scale Multi-Graph Matching in Bioimaging
Kahl, Max; Stricker, Sebastian; Hutschenreither, Lisa; Bernard, Florian; Rother, Carsten; Savchynskyy, Bogdan
Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling
Davtyan, Aram; Dadi, Leello Tadesse; Cevher, Volkan; Favaro, Paolo
CAGE: Unsupervised Visual Composition and Animation for Controllable Video Generation
Davtyan, Aram; Sameni, Sepehr; Ommer, Björn; Favaro, Paolo
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation
Zatsarynna, Olga; Bahrami, Emad; Abu Farha, Yazan; Francesca, Gianpiero; Gall, Juergen
Scene-Centric Unsupervised Panoptic Segmentation
Hahn, Oliver; Reich, Christoph; Araslanov, Nikita; Cremers, Daniel; Rupprecht, Christian; Roth, Stefan
Using Shapley interactions to understand how models use structure
Misra, Diganta
CutS3D: Cutting Semantics in 3D for 2D Unsupervised Instance Segmentation
Sick, Leon; Engel, Dominik; Hartwig, Sebastian; Hermosilla, Pedro; Ropinski, Timo
VITAL: More Understandable Feature Visualization through Distribution Alignment and Relevant Information Flow
Görgün, Ada; Schiele, Bernt; Fischer, Jonas
AIM: Amending Inherent Interpretability via Self-Supervised Masking
Alshami, Eyad; Agnihotri, Shashank; Schiele, Bernt; Keuper, Margret
The GOOSE Dataset for Perception in Unstructured Environments
Mortimer, Peter; Hagmanns, Raphael; Granero, Miguel; Petereit, Janko; Luettel, Thorsten
DASH: Detection and Assessment of Systematic Hallucinations of VLMs
Augustin, Maximilian; Neuhaus, Yannic; Hein, Matthias
Spatial Reasoning with Denoising Models
Wewer, Christopher; Pogodzinski, Bartlomiej; Schiele, Bernt; Lenssen, Jan
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Jevtić, Aleksandar; Reich, Christoph; Wimbauer, Felix; Hahn, Oliver; Rupprecht, Christian; Roth, Stefan; Cremers, Daniel
Activation Subspaces for Out-of-Distribution Detection
Zöngür, Barış; Hesse, Robin; Roth, Stefan
LeGrad: An Explainability Method for Vision Transformers via Feature Formation Sensitivity
Bousselham, Walid; Boggust, Angie; Chaybouti, Sofian; Strobelt, Hendrik; Kuehne, Hilde
B-cosification: Transforming Deep Neural Networks to be Inherently Interpretable
Arya, Shreyash; Rao, Sukrut; Böhle, Moritz; Schiele, Bernt
VGGSounder: Audio-Visual Evaluations for Foundation Models
Zverev, Daniil; Wiedemer, Thaddäus; Prabhu, Ameya; Bethge, Matthias; Brendel, Wieland; Koepke, A. Sophia
Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets
Boettcher, Wolfgang; Hoyer, Lukas; Uenal, Ozan; Lenssen, Jan Eric; Schiele, Bernt
Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels
Dünkel, Olaf; Wimmer, Thomas; Theobalt, Christian; Rupprecht, Christian; Kortylewski, Adam
TikZero: Zero-Shot Text-Guided Graphics Program Synthesis
Belouadi, Jonas; Ilg, Eddy; Keuper, Margret; Tanaka, Hideki; Utiyama, Masao; Dabre, Raj; Eger, Steffen; Ponzetto, Simone Paolo;
FlowBench: Benchmarking Optical Flow Estimation Methods for Reliability and Generalization
Agnihotri, Shashank; Caspary, Julian; Schwarz, Luca; Gao, Xinyan; Schmalfuss, Jenny; Bruhn, Andres; Keuper, Margret