On learning to localize objects with minimal supervision

Hyun Oh Song, Ross Girshick, Stefanie Jegelka, Julien Mairal, Zaid Harchaoui, Trevor Darrell

2014-03-05Weakly Supervised Object Detection

Abstract

Learning to localize objects with minimal supervision is an important problem in computer vision, since large fully annotated datasets are extremely costly to obtain. In this paper, we propose a new method that achieves this goal with only image-level labels of whether the objects are present or not. Our approach combines a discriminative submodular cover problem for automatically discovering a set of positive object windows with a smoothed latent SVM formulation. The latter allows us to leverage efficient quasi-Newton optimization techniques. Our experiments demonstrate that the proposed approach provides a 50% relative improvement in mean average precision over the current state-of-the-art on PASCAL VOC 2007 detection.

Results

Task	Dataset	Metric	Value	Model
Object Detection	PASCAL VOC 2007	MAP	22.7	Cover + SLSVM
3D	PASCAL VOC 2007	MAP	22.7	Cover + SLSVM
2D Classification	PASCAL VOC 2007	MAP	22.7	Cover + SLSVM
2D Object Detection	PASCAL VOC 2007	MAP	22.7	Cover + SLSVM
16k	PASCAL VOC 2007	MAP	22.7	Cover + SLSVM

Related Papers

Class Agnostic Instance-level Descriptor for Visual Instance Search2025-06-20 Self-Classification Enhancement and Correction for Weakly Supervised Object Detection2025-05-22 Enhancing Weakly-Supervised Object Detection on Static Images through (Hallucinated) Motion2024-09-15 Weakly Supervised Object Detection for Automatic Tooth-marked Tongue Recognition2024-08-29 Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video2024-07-22 HUWSOD: Holistic Self-training for Unified Weakly Supervised Object Detection2024-06-27 Smart Feature is What You Need2024-06-22 Negative Prototypes Guided Contrastive Learning for WSOD2024-06-04