Home | Publications | HMM+25

GECO: Geometrically Consistent Embedding With Lightspeed Inference

MCML Authors

Regine Hartwig

* Former Member

→ Group Daniel Cremers
Computer Vision & Artificial Intelligence

Dominik Muhle

→ Group Daniel Cremers
Computer Vision & Artificial Intelligence

Riccardo Marin

Prof. Dr.

Associate

Machine Learning for Robotics

Daniel Cremers

Prof. Dr.

Director

Computer Vision & Artificial Intelligence

Abstract

Recent advances in feature learning have shown that self-supervised vision foundation models can capture semantic correspondences but often lack awareness of underlying 3D geometry. GECO addresses this gap by producing geometrically coherent features that semantically distinguish parts based on geometry (e.g., left/right eyes, front/back legs). We propose a training framework based on optimal transport, enabling supervision beyond keypoints, even under occlusions and disocclusions. With a lightweight architecture, GECO runs at 30 fps, 98.2% faster than prior methods, while achieving state-of-the-art performance on PFPascal, APK, and CUB, improving PCK by 6.0%, 6.2%, and 4.1%, respectively. Finally, we show that PCK alone is insufficient to capture geometric quality and introduce new metrics and insights for more geometry-aware feature learning.

inproceedings HMM+25

ICCV 2025

IEEE/CVF International Conference on Computer Vision. Honolulu, Hawai'i, Oct 19-23, 2025. To be published. Preprint available.

Authors

R. Hartwig • D. Muhle • R. Marin • D. Cremers

Links

URL GitHub

Research Areas

B1 | Computer Vision

B3 | Multimodal Perception

BibTeXKey: HMM+25

#p-cremers #p-marin-riccardo