TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TransforMatcher: Match-to-Match Attention for Semantic Cor...

TransforMatcher: Match-to-Match Attention for Semantic Correspondence

SeungWook Kim, Juhong Min, Minsu Cho

2022-05-23CVPR 2022 1Semantic correspondence
PaperPDFCode(official)

Abstract

Establishing correspondences between images remains a challenging task, especially under large appearance changes due to different viewpoints or intra-class variations. In this work, we introduce a strong semantic image matching learner, dubbed TransforMatcher, which builds on the success of transformer networks in vision domains. Unlike existing convolution- or attention-based schemes for correspondence, TransforMatcher performs global match-to-match attention for precise match localization and dynamic refinement. To handle a large number of matches in a dense correlation map, we develop a light-weight attention architecture to consider the global match-to-match interactions. We also propose to utilize a multi-channel correlation map for refinement, treating the multi-level scores as features instead of a single score to fully exploit the richer layer-wise semantics. In experiments, TransforMatcher sets a new state of the art on SPair-71k while performing on par with existing SOTA methods on the PF-PASCAL dataset.

Results

TaskDatasetMetricValueModel
Image MatchingSPair-71kPCK53.7TransforMatcher
Semantic correspondenceSPair-71kPCK53.7TransforMatcher

Related Papers

RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control2025-06-15Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence2025-06-09Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels2025-06-05MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation2025-06-03Cora: Correspondence-aware image editing using few step diffusion2025-05-29Semantic Correspondence: Unified Benchmarking and a Strong Baseline2025-05-23TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval2025-04-07SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations2025-03-28