Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

Jiachen Li, Xiaojin Gong

2023-10-26Unsupervised Vehicle Re-Identification Contrastive Learning Person Re-Identification Unsupervised Person Re-Identification

Paper PDF Code(official)

Abstract

This work aims to adapt large-scale pre-trained vision-language models, such as contrastive language-image pretraining (CLIP), to enhance the performance of object reidentification (Re-ID) across various supervision settings. Although prompt learning has enabled a recent work named CLIP-ReID to achieve promising performance, the underlying mechanisms and the necessity of prompt learning remain unclear due to the absence of semantic labels in ReID tasks. In this work, we first analyze the role prompt learning in CLIP-ReID and identify its limitations. Based on our investigations, we propose a simple yet effective approach to adapt CLIP for supervised object Re-ID. Our approach directly fine-tunes the image encoder of CLIP using a prototypical contrastive learning (PCL) loss, eliminating the need for prompt learning. Experimental results on both person and vehicle Re-ID datasets demonstrate the competitiveness of our method compared to CLIP-ReID. Furthermore, we extend our PCL-based CLIP fine-tuning approach to unsupervised scenarios, where we achieve state-of-the art performance.

Results

Task	Dataset	Metric	Value	Model
Person Re-Identification	MSMT17	Rank-1	89.8	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	MSMT17	Rank-10	96	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	MSMT17	Rank-5	94.7	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	MSMT17	mAP	76.1	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	MSMT17	Rank-1	89.2	PCL-CLIP (L_pcl)
Person Re-Identification	MSMT17	Rank-10	95.8	PCL-CLIP (L_pcl)
Person Re-Identification	MSMT17	Rank-5	94.7	PCL-CLIP (L_pcl)
Person Re-Identification	MSMT17	mAP	73.8	PCL-CLIP (L_pcl)
Person Re-Identification	Market-1501	Rank-1	96.1	PCL-CLIP (L_pcl)
Person Re-Identification	Market-1501	Rank-5	98.8	PCL-CLIP (L_pcl)
Person Re-Identification	Market-1501	mAP	91	PCL-CLIP (L_pcl)
Person Re-Identification	Market-1501	Rank-1	95.9	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	Market-1501	Rank-5	98.5	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	Market-1501	mAP	91.4	PCL-CLIP (L_pcl+L_id)
Person Re-Identification	MSMT17	Rank-1	84.9	PCL-CLIP (O2CAP)
Person Re-Identification	MSMT17	Rank-10	94	PCL-CLIP (O2CAP)
Person Re-Identification	MSMT17	Rank-5	92	PCL-CLIP (O2CAP)
Person Re-Identification	MSMT17	mAP	65.5	PCL-CLIP (O2CAP)
Person Re-Identification	MSMT17	Rank-1	79	PCL-CLIP (CAP)
Person Re-Identification	MSMT17	Rank-10	91.1	PCL-CLIP (CAP)
Person Re-Identification	MSMT17	Rank-5	88.4	PCL-CLIP (CAP)
Person Re-Identification	MSMT17	mAP	53.6	PCL-CLIP (CAP)
Person Re-Identification	MSMT17	Rank-1	77.9	PCL-CLIP (CC)
Person Re-Identification	MSMT17	Rank-10	87.2	PCL-CLIP (CC)
Person Re-Identification	MSMT17	Rank-5	85.2	PCL-CLIP (CC)
Person Re-Identification	MSMT17	mAP	56.4	PCL-CLIP (CC)
Person Re-Identification	Market-1501	MAP	88.4	PCL-CLIP (O2CAP)
Person Re-Identification	Market-1501	Rank-1	94.8	PCL-CLIP (O2CAP)
Person Re-Identification	Market-1501	Rank-10	98.7	PCL-CLIP (O2CAP)
Person Re-Identification	Market-1501	Rank-5	98	PCL-CLIP (O2CAP)
Person Re-Identification	Market-1501	MAP	87.4	PCL-CLIP (CAP)
Person Re-Identification	Market-1501	Rank-1	93.9	PCL-CLIP (CAP)
Person Re-Identification	Market-1501	Rank-10	98.5	PCL-CLIP (CAP)
Person Re-Identification	Market-1501	Rank-5	97.7	PCL-CLIP (CAP)
Person Re-Identification	Market-1501	MAP	86.9	PCL-CLIP (CC)
Person Re-Identification	Market-1501	Rank-1	94.2	PCL-CLIP (CC)
Person Re-Identification	Market-1501	Rank-10	98.7	PCL-CLIP (CC)
Person Re-Identification	Market-1501	Rank-5	97.8	PCL-CLIP (CC)

Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

Abstract

Results

Related Papers

Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification

Abstract

Results

Related Papers