TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Weakly Supervised Localization using Deep Feature Maps

Weakly Supervised Localization using Deep Feature Maps

Archith J. Bency, Heesung Kwon, Hyungtae Lee, S. Karthikeyan, B. S. Manjunath

2016-03-01Weakly Supervised Object DetectionObject RecognitionObject LocalizationGeneral Classification
PaperPDF

Abstract

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image labels. This weakly supervised method leverages local spatial and semantic patterns captured in the convolutional layers of classification networks. We propose an efficient beam search based approach to detect and localize multiple objects in images. The proposed method significantly outperforms the state-of-the-art in standard object localization data-sets with a 8 point increase in mAP scores.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO (Common Objects in Context)MAP47.9Deep Feature Maps
3DCOCO (Common Objects in Context)MAP47.9Deep Feature Maps
2D ClassificationCOCO (Common Objects in Context)MAP47.9Deep Feature Maps
2D Object DetectionCOCO (Common Objects in Context)MAP47.9Deep Feature Maps
16kCOCO (Common Objects in Context)MAP47.9Deep Feature Maps

Related Papers

GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08Out-of-distribution detection in 3D applications: a review2025-07-01Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval2025-06-28VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding2025-06-28RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23Class Agnostic Instance-level Descriptor for Visual Instance Search2025-06-20CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion2025-06-17SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds2025-06-16