TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Prototypical Contrastive Learning-based CLIP Fine-tuning f...

Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

Jiachen Li, Xiaojin Gong

2023-10-26Unsupervised Vehicle Re-IdentificationContrastive LearningPerson Re-IdentificationUnsupervised Person Re-Identification
PaperPDFCode(official)

Abstract

This work aims to adapt large-scale pre-trained vision-language models, such as contrastive language-image pretraining (CLIP), to enhance the performance of object reidentification (Re-ID) across various supervision settings. Although prompt learning has enabled a recent work named CLIP-ReID to achieve promising performance, the underlying mechanisms and the necessity of prompt learning remain unclear due to the absence of semantic labels in ReID tasks. In this work, we first analyze the role prompt learning in CLIP-ReID and identify its limitations. Based on our investigations, we propose a simple yet effective approach to adapt CLIP for supervised object Re-ID. Our approach directly fine-tunes the image encoder of CLIP using a prototypical contrastive learning (PCL) loss, eliminating the need for prompt learning. Experimental results on both person and vehicle Re-ID datasets demonstrate the competitiveness of our method compared to CLIP-ReID. Furthermore, we extend our PCL-based CLIP fine-tuning approach to unsupervised scenarios, where we achieve state-of-the art performance.

Results

TaskDatasetMetricValueModel
Person Re-IdentificationMSMT17Rank-189.8PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMSMT17Rank-1096PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMSMT17Rank-594.7PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMSMT17mAP76.1PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMSMT17Rank-189.2PCL-CLIP (L_pcl)
Person Re-IdentificationMSMT17Rank-1095.8PCL-CLIP (L_pcl)
Person Re-IdentificationMSMT17Rank-594.7PCL-CLIP (L_pcl)
Person Re-IdentificationMSMT17mAP73.8PCL-CLIP (L_pcl)
Person Re-IdentificationMarket-1501Rank-196.1PCL-CLIP (L_pcl)
Person Re-IdentificationMarket-1501Rank-598.8PCL-CLIP (L_pcl)
Person Re-IdentificationMarket-1501mAP91PCL-CLIP (L_pcl)
Person Re-IdentificationMarket-1501Rank-195.9PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMarket-1501Rank-598.5PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMarket-1501mAP91.4PCL-CLIP (L_pcl+L_id)
Person Re-IdentificationMSMT17Rank-184.9PCL-CLIP (O2CAP)
Person Re-IdentificationMSMT17Rank-1094PCL-CLIP (O2CAP)
Person Re-IdentificationMSMT17Rank-592PCL-CLIP (O2CAP)
Person Re-IdentificationMSMT17mAP65.5PCL-CLIP (O2CAP)
Person Re-IdentificationMSMT17Rank-179PCL-CLIP (CAP)
Person Re-IdentificationMSMT17Rank-1091.1PCL-CLIP (CAP)
Person Re-IdentificationMSMT17Rank-588.4PCL-CLIP (CAP)
Person Re-IdentificationMSMT17mAP53.6PCL-CLIP (CAP)
Person Re-IdentificationMSMT17Rank-177.9PCL-CLIP (CC)
Person Re-IdentificationMSMT17Rank-1087.2PCL-CLIP (CC)
Person Re-IdentificationMSMT17Rank-585.2PCL-CLIP (CC)
Person Re-IdentificationMSMT17mAP56.4PCL-CLIP (CC)
Person Re-IdentificationMarket-1501MAP88.4PCL-CLIP (O2CAP)
Person Re-IdentificationMarket-1501Rank-194.8PCL-CLIP (O2CAP)
Person Re-IdentificationMarket-1501Rank-1098.7PCL-CLIP (O2CAP)
Person Re-IdentificationMarket-1501Rank-598PCL-CLIP (O2CAP)
Person Re-IdentificationMarket-1501MAP87.4PCL-CLIP (CAP)
Person Re-IdentificationMarket-1501Rank-193.9PCL-CLIP (CAP)
Person Re-IdentificationMarket-1501Rank-1098.5PCL-CLIP (CAP)
Person Re-IdentificationMarket-1501Rank-597.7PCL-CLIP (CAP)
Person Re-IdentificationMarket-1501MAP86.9PCL-CLIP (CC)
Person Re-IdentificationMarket-1501Rank-194.2PCL-CLIP (CC)
Person Re-IdentificationMarket-1501Rank-1098.7PCL-CLIP (CC)
Person Re-IdentificationMarket-1501Rank-597.8PCL-CLIP (CC)

Related Papers

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Weakly Supervised Visible-Infrared Person Re-Identification via Heterogeneous Expert Collaborative Consistency Learning2025-07-17WhoFi: Deep Person Re-Identification via Wi-Fi Channel Signal Encoding2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16LLM-Driven Dual-Level Multi-Interest Modeling for Recommendation2025-07-15