TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generalized Contrastive Optimization of Siamese Networks f...

Generalized Contrastive Optimization of Siamese Networks for Place Recognition

María Leyva-Vallina, Nicola Strisciuglio, Nicolai Petkov

2021-03-11Representation LearningVisual Place RecognitionImage Retrieval
PaperPDFCode(official)

Abstract

Visual place recognition is a challenging task in computer vision and a key component of camera-based localization and navigation systems. Recently, Convolutional Neural Networks (CNNs) achieved high results and good generalization capabilities. They are usually trained using pairs or triplets of images labeled as either similar or dissimilar, in a binary fashion. In practice, the similarity between two images is not binary, but continuous. Furthermore, training these CNNs is computationally complex and involves costly pair and triplet mining strategies. We propose a Generalized Contrastive loss (GCL) function that relies on image similarity as a continuous measure, and use it to train a siamese CNN. Furthermore, we present three techniques for automatic annotation of image pairs with labels indicating their degree of similarity, and deploy them to re-annotate the MSLS, TB-Places, and 7Scenes datasets. We demonstrate that siamese CNNs trained using the GCL function and the improved annotations consistently outperform their binary counterparts. Our models trained on MSLS outperform the state-of-the-art methods, including NetVLAD, NetVLAD-SARE, AP-GeM and Patch-NetVLAD, and generalize well on the Pittsburgh30k, Tokyo 24/7, RobotCar Seasons v2 and Extended CMU Seasons datasets. Furthermore, training a siamese network using the GCL function does not require complex pair mining. We release the source code at https://github.com/marialeyvallina/generalized_contrastive_loss.

Results

TaskDatasetMetricValueModel
Visual Place RecognitionPittsburgh-30k-testRecall@181.94GCL [trained only on MSLS]
Visual Place RecognitionPittsburgh-30k-testRecall@594.56GCL [trained only on MSLS]
Visual Place RecognitionTokyo247Recall@169.84GCL [trained only on MSLS]
Visual Place RecognitionTokyo247Recall@1080.63GCL [trained only on MSLS]
Visual Place RecognitionTokyo247Recall@584.76GCL [trained only on MSLS]
Visual Place RecognitionMapillary valRecall@180.9ResNeXt GCL + PCA
Visual Place RecognitionMapillary valRecall@1092.6ResNeXt GCL + PCA
Visual Place RecognitionMapillary valRecall@590.7ResNeXt GCL + PCA
Visual Place RecognitionMapillary valRecall@175.5ResNeXt GCL
Visual Place RecognitionMapillary valRecall@1088.5ResNeXt GCL
Visual Place RecognitionMapillary valRecall@586.1ResNeXt GCL
Visual Place RecognitionMapillary testRecall@162.3ResNeXt-GCL-PCA
Visual Place RecognitionMapillary testRecall@1081.1ResNeXt-GCL-PCA
Visual Place RecognitionMapillary testRecall@576.2ResNeXt-GCL-PCA
Visual Place RecognitionMapillary testRecall@156RexNeXt-GCL
Visual Place RecognitionMapillary testRecall@1075.1RexNeXt-GCL
Visual Place RecognitionMapillary testRecall@570.8RexNeXt-GCL

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20Visual Place Recognition for Large-Scale UAV Applications2025-07-20Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17FAR-Net: Multi-Stage Fusion Network with Enhanced Semantic Alignment and Adaptive Reconciliation for Composed Image Retrieval2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16