TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-scale Matching Networks for Semantic Correspondence

Multi-scale Matching Networks for Semantic Correspondence

Dongyang Zhao, Ziyang Song, Zhenghao Ji, Gangming Zhao, Weifeng Ge, Yizhou Yu

2021-07-31ICCV 2021 10Semantic correspondence
PaperPDFCode(official)

Abstract

Deep features have been proven powerful in building accurate dense semantic correspondences in various previous works. However, the multi-scale and pyramidal hierarchy of convolutional neural networks has not been well studied to learn discriminative pixel-level features for semantic correspondence. In this paper, we propose a multi-scale matching network that is sensitive to tiny semantic differences between neighboring pixels. We follow the coarse-to-fine matching strategy and build a top-down feature and matching enhancement scheme that is coupled with the multi-scale hierarchy of deep convolutional neural networks. During feature enhancement, intra-scale enhancement fuses same-resolution feature maps from multiple layers together via local self-attention and cross-scale enhancement hallucinates higher-resolution feature maps along the top-down hierarchy. Besides, we learn complementary matching details at different scales thus the overall matching score is refined by features of different semantic levels gradually. Our multi-scale matching network can be trained end-to-end easily with few additional learnable parameters. Experimental results demonstrate that the proposed method achieves state-of-the-art performance on three popular benchmarks with high computational efficiency.

Results

TaskDatasetMetricValueModel
Image MatchingSPair-71kPCK50.4MMNet
Semantic correspondenceSPair-71kPCK50.4MMNet

Related Papers

RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control2025-06-15Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence2025-06-09Do It Yourself: Learning Semantic Correspondence from Pseudo-Labels2025-06-05MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation2025-06-03Cora: Correspondence-aware image editing using few step diffusion2025-05-29Semantic Correspondence: Unified Benchmarking and a Strong Baseline2025-05-23TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval2025-04-07SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations2025-03-28