TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Telling Left from Right: Identifying Geometry-Aware Semant...

Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence

Junyi Zhang, Charles Herrmann, Junhwa Hur, Eric Chen, Varun Jampani, Deqing Sun, Ming-Hsuan Yang

2023-11-28CVPR 2024 1Semantic correspondencePose EstimationAnimal Pose Estimation
PaperPDFCode(official)

Abstract

While pre-trained large-scale vision models have shown significant promise for semantic correspondence, their features often struggle to grasp the geometry and orientation of instances. This paper identifies the importance of being geometry-aware for semantic correspondence and reveals a limitation of the features of current foundation models under simple post-processing. We show that incorporating this information can markedly enhance semantic correspondence performance with simple but effective solutions in both zero-shot and supervised settings. We also construct a new challenging benchmark for semantic correspondence built from an existing animal pose estimation dataset, for both pre-training validating models. Our method achieves a PCK@0.10 score of 65.4 (zero-shot) and 85.6 (supervised) on the challenging SPair-71k dataset, outperforming the state of the art by 5.5p and 11.0p absolute gains, respectively. Our code and datasets are publicly available at: https://telling-left-from-right.github.io/.

Results

TaskDatasetMetricValueModel
Image MatchingSPair-71kPCK85.6GeoAware-SC (Supervised, AP-10K P.T.)
Image MatchingSPair-71kPCK82.9GeoAware-SC (Supervised)
Image MatchingSPair-71kPCK68.5GeoAware-SC (Zero-Shot)
Image MatchingPF-PASCALPCK95.7GeoAware-SC (Supervised, AP-10K P.T.)
Image MatchingPF-PASCALPCK95.1GeoAware-SC (Supervised)
Image MatchingPF-PASCALPCK82.6GeoAware-SC (Zero-Shot)
Semantic correspondenceSPair-71kPCK85.6GeoAware-SC (Supervised, AP-10K P.T.)
Semantic correspondenceSPair-71kPCK82.9GeoAware-SC (Supervised)
Semantic correspondenceSPair-71kPCK68.5GeoAware-SC (Zero-Shot)
Semantic correspondencePF-PASCALPCK95.7GeoAware-SC (Supervised, AP-10K P.T.)
Semantic correspondencePF-PASCALPCK95.1GeoAware-SC (Supervised)
Semantic correspondencePF-PASCALPCK82.6GeoAware-SC (Zero-Shot)

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model2025-07-17From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation2025-07-17AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16