MixVPR: Feature Mixing for Visual Place Recognition

Amar Ali-bey, Brahim Chaib-Draa, Philippe Giguère

2023-03-03Metric Learning Visual Place Recognition Autonomous Driving Image Retrieval

Abstract

Visual Place Recognition (VPR) is a crucial part of mobile robotics and autonomous driving as well as other computer vision tasks. It refers to the process of identifying a place depicted in a query image using only computer vision. At large scale, repetitive structures, weather and illumination changes pose a real challenge, as appearances can drastically change over time. Along with tackling these challenges, an efficient VPR technique must also be practical in real-world scenarios where latency matters. To address this, we introduce MixVPR, a new holistic feature aggregation technique that takes feature maps from pre-trained backbones as a set of global features. Then, it incorporates a global relationship between elements in each feature map in a cascade of feature mixing, eliminating the need for local or pyramidal aggregation as done in NetVLAD or TransVPR. We demonstrate the effectiveness of our technique through extensive experiments on multiple large-scale benchmarks. Our method outperforms all existing techniques by a large margin while having less than half the number of parameters compared to CosPlace and NetVLAD. We achieve a new all-time high recall@1 score of 94.6% on Pitts250k-test, 88.0% on MapillarySLS, and more importantly, 58.4% on Nordland. Finally, our method outperforms two-stage retrieval techniques such as Patch-NetVLAD, TransVPR and SuperGLUE all while being orders of magnitude faster. Our code and trained models are available at https://github.com/amaralibey/MixVPR.

Results

Task	Dataset	Metric	Value	Model
Visual Place Recognition	Nardo-Air R	Recall@1	76.06	MixVPR
Visual Place Recognition	Oxford RobotCar Dataset	Recall@1	90.05	MixVPR
Visual Place Recognition	Nardo-Air	Recall@1	32.39	MixVPR
Visual Place Recognition	Nordland	Recall@1	76	MixVPR
Visual Place Recognition	Nordland	Recall@5	89.2	MixVPR
Visual Place Recognition	Mid-Atlantic Ridge	Recall@1	25.74	MixVPR
Visual Place Recognition	St Lucia	Recall@1	99.66	MixVPR
Visual Place Recognition	Pittsburgh-250k-test	Recall@1	94.6	MixVPR
Visual Place Recognition	Pittsburgh-250k-test	Recall@10	99	MixVPR
Visual Place Recognition	Pittsburgh-250k-test	Recall@5	98.3	MixVPR
Visual Place Recognition	Hawkins	Recall@1	25.42	MixVPR
Visual Place Recognition	Laurel Caverns	Recall@1	29.46	MixVPR
Visual Place Recognition	Gardens Point	Recall@1	91.5	MixVPR
Visual Place Recognition	SPED	Recall@1	85.2	MixVPR
Visual Place Recognition	SPED	Recall@10	94.6	MixVPR
Visual Place Recognition	SPED	Recall@5	92.1	MixVPR
Visual Place Recognition	Pittsburgh-30k-test	Recall@1	91.52	MixVPR
Visual Place Recognition	Pittsburgh-30k-test	Recall@5	95.9	MixVPR
Visual Place Recognition	VP-Air	Recall@1	10.31	MixVPR
Visual Place Recognition	Mapillary val	Recall@1	88.2	MixVPR
Visual Place Recognition	Mapillary val	Recall@10	94.3	MixVPR
Visual Place Recognition	Mapillary val	Recall@5	93.1	MixVPR
Visual Place Recognition	Mapillary test	Recall@1	64	MixVPR
Visual Place Recognition	Mapillary test	Recall@10	80.6	MixVPR
Visual Place Recognition	Mapillary test	Recall@5	75.9	MixVPR
Visual Place Recognition	17 Places	Recall@1	63.79	MixVPR
Visual Place Recognition	Baidu Mall	Recall@1	64.44	MixVPR

MixVPR: Feature Mixing for Visual Place Recognition

Abstract

Results

Related Papers

MixVPR: Feature Mixing for Visual Place Recognition

Abstract

Results

Related Papers