TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Shengcai Liao, Ling Shao

2021-05-30NeurIPS 2021 12Image Classification Representation Learning Metric Learning Person Re-Identification Generalizable Person Re-identification

Paper PDF Code(official)Code(official)

Abstract

Transformers have recently gained increasing attention in computer vision. However, existing studies mostly use Transformers for feature representation learning, e.g. for image classification and dense predictions, and the generalizability of Transformers is unknown. In this work, we further investigate the possibility of applying Transformers for image matching and metric learning given pairs of images. We find that the Vision Transformer (ViT) and the vanilla Transformer with decoders are not adequate for image matching due to their lack of image-to-image attention. Thus, we further design two naive solutions, i.e. query-gallery concatenation in ViT, and query-gallery cross-attention in the vanilla Transformer. The latter improves the performance, but it is still limited. This implies that the attention mechanism in Transformers is primarily designed for global feature aggregation, which is not naturally suitable for image matching. Accordingly, we propose a new simplified decoder, which drops the full attention implementation with the softmax weighting, keeping only the query-key similarity computation. Additionally, global max pooling and a multilayer perceptron (MLP) head are applied to decode the matching result. This way, the simplified decoder is computationally more efficient, while at the same time more effective for image matching. The proposed method, called TransMatcher, achieves state-of-the-art performance in generalizable person re-identification, with up to 6.1% and 5.7% performance gains in Rank-1 and mAP, respectively, on several popular datasets. Code is available at https://github.com/ShengcaiLiao/QAConv.

Results

Task	Dataset	Metric	Value	Model
Person Re-Identification	MSMT17	ClonedPerson->Rank-1	51.6	TransMatcher
Person Re-Identification	MSMT17	ClonedPerson->mAP	20.8	TransMatcher
Person Re-Identification	MSMT17	Market-1501->Rank1	47.3	TransMatcher
Person Re-Identification	MSMT17	Market-1501->mAP	18.4	TransMatcher
Person Re-Identification	MSMT17	RandPerson->Rank-1	48.3	TransMatcher
Person Re-Identification	MSMT17	RandPerson->mAP	17.7	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	ClonedPerson->Rank-1	25.4	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	ClonedPerson->mAP	24.4	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	MSMT17->Rank-1	23.7	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	MSMT17->mAP	22.5	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	MSMT17-All->Rank-1	31.9	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	MSMT17-All->mAP	30.7	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	Market-1501->Rank-1	22.2	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	Market-1501->mAP	21.4	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	RandPerson->Rank-1	17.1	TransMatcher
Person Re-Identification	CUHK03-NP (detected)	RandPerson->mAP	16	TransMatcher
Person Re-Identification	Market-1501	ClonedPerson->Rank-1	84.8	TransMatcher
Person Re-Identification	Market-1501	ClonedPerson->mAP	62.3	TransMatcher
Person Re-Identification	Market-1501	MSMT17->Rank-1	80.1	TransMatcher
Person Re-Identification	Market-1501	MSMT17->mAP	52	TransMatcher
Person Re-Identification	Market-1501	MSMT17-All->Rank-1	82.6	TransMatcher
Person Re-Identification	Market-1501	MSMT17-All->mAP	58.4	TransMatcher
Person Re-Identification	Market-1501	RandPerson->Rank-1	77.3	TransMatcher
Person Re-Identification	Market-1501	RandPerson->mAP	49.1	TransMatcher

TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Abstract

Results

Related Papers

TransMatcher: Deep Image Matching Through Transformers for Generalizable Person Re-identification

Abstract

Results

Related Papers