Weakly Supervised Localization using Deep Feature Maps

Archith J. Bency, Heesung Kwon, Hyungtae Lee, S. Karthikeyan, B. S. Manjunath

2016-03-01Weakly Supervised Object Detection Object Recognition Object Localization General Classification

Abstract

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image labels. This weakly supervised method leverages local spatial and semantic patterns captured in the convolutional layers of classification networks. We propose an efficient beam search based approach to detect and localize multiple objects in images. The proposed method significantly outperforms the state-of-the-art in standard object localization data-sets with a 8 point increase in mAP scores.

Results

Task	Dataset	Metric	Value	Model
Object Detection	COCO (Common Objects in Context)	MAP	47.9	Deep Feature Maps
3D	COCO (Common Objects in Context)	MAP	47.9	Deep Feature Maps
2D Classification	COCO (Common Objects in Context)	MAP	47.9	Deep Feature Maps
2D Object Detection	COCO (Common Objects in Context)	MAP	47.9	Deep Feature Maps
16k	COCO (Common Objects in Context)	MAP	47.9	Deep Feature Maps

Related Papers

GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08 Out-of-distribution detection in 3D applications: a review2025-07-01 Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval2025-06-28 VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding2025-06-28 RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23 Class Agnostic Instance-level Descriptor for Visual Instance Search2025-06-20 CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion2025-06-17 SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds2025-06-16