TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Localizing Objects with Self-Supervised Transformers and n...

Localizing Objects with Self-Supervised Transformers and no Labels

Oriane Siméoni, Gilles Puy, Huy V. Vo, Simon Roburin, Spyros Gidaris, Andrei Bursuc, Patrick Pérez, Renaud Marlet, Jean Ponce

2021-09-29Object DiscoverySingle-object discoveryWeakly-Supervised Object Localization
PaperPDFCodeCode(official)

Abstract

Localizing objects in image collections without supervision can help to avoid expensive annotation campaigns. We propose a simple approach to this problem, that leverages the activation features of a vision transformer pre-trained in a self-supervised manner. Our method, LOST, does not require any external object proposal nor any exploration of the image collection; it operates on a single image. Yet, we outperform state-of-the-art object discovery methods by up to 8 CorLoc points on PASCAL VOC 2012. We also show that training a class-agnostic detector on the discovered objects boosts results by another 7 points. Moreover, we show promising results on the unsupervised object discovery task. The code to reproduce our results can be found at https://github.com/valeoai/LOST.

Results

TaskDatasetMetricValueModel
Object Localization CUB-200-2011Top-1 Localization Accuracy71.3LOST
Single-object discoveryCOCO_20kCorLoc57.5LOST + CAD
Single-object discoveryCOCO_20kCorLoc50.7LOST

Related Papers

When Does Pruning Benefit Vision Representations?2025-07-02FORLA:Federated Object-centric Representation Learning with Slot Attention2025-06-03Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization2025-05-08Binding threshold units with artificial oscillatory neurons2025-05-06Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning2025-05-04Are We Done with Object-Centric Learning?2025-04-09PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI Localization2025-03-31CTRL-O: Language-Controllable Object-Centric Visual Representation Learning2025-03-27