TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Noisy-Correspondence Learning for Text-to-Image Person Re-...

Noisy-Correspondence Learning for Text-to-Image Person Re-identification

Yang Qin, Yingke Chen, Dezhong Peng, Xi Peng, Joey Tianyi Zhou, Peng Hu

2023-08-19CVPR 2024 1Text-based Person Retrieval with Noisy CorrespondencePerson Re-IdentificationText based Person Retrieval
PaperPDFCode(official)

Abstract

Text-to-image person re-identification (TIReID) is a compelling topic in the cross-modal community, which aims to retrieve the target person based on a textual query. Although numerous TIReID methods have been proposed and achieved promising performance, they implicitly assume the training image-text pairs are correctly aligned, which is not always the case in real-world scenarios. In practice, the image-text pairs inevitably exist under-correlated or even false-correlated, a.k.a noisy correspondence (NC), due to the low quality of the images and annotation errors. To address this problem, we propose a novel Robust Dual Embedding method (RDE) that can learn robust visual-semantic associations even with NC. Specifically, RDE consists of two main components: 1) A Confident Consensus Division (CCD) module that leverages the dual-grained decisions of dual embedding modules to obtain a consensus set of clean training data, which enables the model to learn correct and reliable visual-semantic associations. 2) A Triplet Alignment Loss (TAL) relaxes the conventional Triplet Ranking loss with the hardest negative samples to a log-exponential upper bound over all negative ones, thus preventing the model collapse under NC and can also focus on hard-negative samples for promising performance. We conduct extensive experiments on three public benchmarks, namely CUHK-PEDES, ICFG-PEDES, and RSTPReID, to evaluate the performance and robustness of our RDE. Our method achieves state-of-the-art results both with and without synthetic noisy correspondences on all three datasets. Code is available at https://github.com/QinYang79/RDE.

Results

TaskDatasetMetricValueModel
Text based Person RetrievalICFG-PEDESR@167.68RDE
Text based Person RetrievalICFG-PEDESR@1087.36RDE
Text based Person RetrievalICFG-PEDESR@582.47RDE
Text based Person RetrievalICFG-PEDESmAP40.06RDE
Text based Person RetrievalICFG-PEDESmINP7.87RDE
Text based Person RetrievalRSTPReidR@165.35RDE
Text based Person RetrievalRSTPReidR@1089.9RDE
Text based Person RetrievalRSTPReidR@583.95RDE
Text based Person RetrievalRSTPReidmAP50.88RDE
Text based Person RetrievalRSTPReidmINP28.08RDE
Text-based Person Retrieval with Noisy CorrespondenceICFG-PEDESRank 166.54RDE
Text-based Person Retrieval with Noisy CorrespondenceICFG-PEDESRank-1086.7RDE
Text-based Person Retrieval with Noisy CorrespondenceICFG-PEDESRank-581.7RDE
Text-based Person Retrieval with Noisy CorrespondenceICFG-PEDESmAP39.08RDE
Text-based Person Retrieval with Noisy CorrespondenceICFG-PEDESmINP7.55RDE
Text-based Person Retrieval with Noisy CorrespondenceRSTPReidRank 164.45RDE
Text-based Person Retrieval with Noisy CorrespondenceRSTPReidRank 1090RDE
Text-based Person Retrieval with Noisy CorrespondenceRSTPReidRank 583.5RDE
Text-based Person Retrieval with Noisy CorrespondenceRSTPReidmAP49.78RDE
Text-based Person Retrieval with Noisy CorrespondenceRSTPReidmINP27.43RDE
Text-based Person Retrieval with Noisy CorrespondenceCUHK-PEDESRank 1093.63RDE
Text-based Person Retrieval with Noisy CorrespondenceCUHK-PEDESRank-174.46RDE
Text-based Person Retrieval with Noisy CorrespondenceCUHK-PEDESRank-589.42RDE
Text-based Person Retrieval with Noisy CorrespondenceCUHK-PEDESmAP66.13RDE
Text-based Person Retrieval with Noisy CorrespondenceCUHK-PEDESmINP49.66RDE

Related Papers

Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning2025-07-17WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding2025-07-17Try Harder: Hard Sample Generation and Learning for Clothes-Changing Person Re-ID2025-07-15Mind the Gap: Bridging Occlusion in Gait Recognition via Residual Gap Correction2025-07-15KeyRe-ID: Keypoint-Guided Person Re-Identification using Part-Aware Representation in Videos2025-07-10CORE-ReID V2: Advancing the Domain Adaptation for Object Re-Identification with Optimized Training and Ensemble Fusion2025-07-04Following the Clues: Experiments on Person Re-ID using Cross-Modal Intelligence2025-07-02DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding2025-06-16