AnyLoc: Towards Universal Visual Place Recognition

Nikhil Keetha, Avneesh Mishra, Jay Karhade, Krishna Murthy Jatavallabhula, Sebastian Scherer, Madhava Krishna, Sourav Garg

2023-08-01Visual Place Recognition Image Retrieval

Paper PDF Code(official)

Abstract

Visual Place Recognition (VPR) is vital for robot localization. To date, the most performant VPR approaches are environment- and task-specific: while they exhibit strong performance in structured environments (predominantly urban driving), their performance degrades severely in unstructured environments, rendering most approaches brittle to robust real-world deployment. In this work, we develop a universal solution to VPR -- a technique that works across a broad range of structured and unstructured environments (urban, outdoors, indoors, aerial, underwater, and subterranean environments) without any re-training or fine-tuning. We demonstrate that general-purpose feature representations derived from off-the-shelf self-supervised models with no VPR-specific training are the right substrate upon which to build such a universal VPR solution. Combining these derived features with unsupervised feature aggregation enables our suite of methods, AnyLoc, to achieve up to 4X significantly higher performance than existing approaches. We further obtain a 6% improvement in performance by characterizing the semantic properties of these features, uncovering unique domains which encapsulate datasets from similar environments. Our detailed experiments and analysis lay a foundation for building VPR solutions that may be deployed anywhere, anytime, and across anyview. We encourage the readers to explore our project page and interactive demos: https://anyloc.github.io/.

Results

Task	Dataset	Metric	Value	Model
Visual Place Recognition	Nardo-Air R	Recall@1	94.37	AnyLoc-VLAD-DINO
Visual Place Recognition	Nardo-Air R	Recall@1	85.92	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Nardo-Air R	Recall@1	61.97	CLIP
Visual Place Recognition	Oxford RobotCar Dataset	Recall@1	98.95	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Oxford RobotCar Dataset	Recall@1	34.55	CLIP
Visual Place Recognition	Nardo-Air	Recall@1	76.06	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Nardo-Air	Recall@1	42.25	CLIP
Visual Place Recognition	Mid-Atlantic Ridge	Recall@1	34.65	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Mid-Atlantic Ridge	Recall@1	25.74	CLIP
Visual Place Recognition	St Lucia	Recall@1	96.17	AnyLoc-VLAD-DINOv2
Visual Place Recognition	St Lucia	Recall@1	62.7	CLIP
Visual Place Recognition	Hawkins	Recall@1	65.25	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Hawkins	Recall@1	33.05	CLIP
Visual Place Recognition	Laurel Caverns	Recall@1	61.61	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Laurel Caverns	Recall@1	36.61	CLIP
Visual Place Recognition	Gardens Point	Recall@1	95.5	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Gardens Point	Recall@1	42.5	CLIP
Visual Place Recognition	Pittsburgh-30k-test	Recall@1	87.66	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Pittsburgh-30k-test	Recall@1	54.97	CLIP
Visual Place Recognition	VP-Air	Recall@1	66.74	AnyLoc-VLAD-DINOv2
Visual Place Recognition	VP-Air	Recall@1	36.59	CLIP
Visual Place Recognition	17 Places	Recall@1	65.02	AnyLoc-VLAD-DINOv2
Visual Place Recognition	17 Places	Recall@1	59.36	CLIP
Visual Place Recognition	Baidu Mall	Recall@1	75.22	AnyLoc-VLAD-DINOv2
Visual Place Recognition	Baidu Mall	Recall@1	56.02	CLIP

AnyLoc: Towards Universal Visual Place Recognition

Abstract

Results

Related Papers

AnyLoc: Towards Universal Visual Place Recognition

Abstract

Results

Related Papers