TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ContextLocNet: Context-Aware Deep Network Models for Weakl...

ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization

Vadim Kantorov, Maxime Oquab, Minsu Cho, Ivan Laptev

2016-09-14Weakly Supervised Object DetectionObject RecognitionObject LocalizationWeakly-Supervised Object Localization
PaperPDFCode

Abstract

We aim to localize objects in images using image-level supervision only. Previous approaches to this problem mainly focus on discriminative object regions and often fail to locate precise object boundaries. We address this problem by introducing two types of context-aware guidance models, additive and contrastive models, that leverage their surrounding context regions to improve localization. The additive model encourages the predicted object region to be supported by its surrounding context region. The contrastive model encourages the predicted object region to be outstanding from its surrounding context region. Our approach benefits from the recent success of convolutional neural networks for object recognition and extends Fast R-CNN to weakly supervised object localization. Extensive experimental evaluation on the PASCAL VOC 2007 and 2012 benchmarks shows hat our context-aware approach significantly improves weakly supervised localization and detection.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 2007MAP36.3WSDDN + context
Object DetectionPASCAL VOC 2012 testMAP35.3WSDDN + context
Object DetectionCharadesMAP1.12ContextLocNet
3DPASCAL VOC 2007MAP36.3WSDDN + context
3DPASCAL VOC 2012 testMAP35.3WSDDN + context
3DCharadesMAP1.12ContextLocNet
2D ClassificationPASCAL VOC 2007MAP36.3WSDDN + context
2D ClassificationPASCAL VOC 2012 testMAP35.3WSDDN + context
2D ClassificationCharadesMAP1.12ContextLocNet
2D Object DetectionPASCAL VOC 2007MAP36.3WSDDN + context
2D Object DetectionPASCAL VOC 2012 testMAP35.3WSDDN + context
2D Object DetectionCharadesMAP1.12ContextLocNet
16kPASCAL VOC 2007MAP36.3WSDDN + context
16kPASCAL VOC 2012 testMAP35.3WSDDN + context
16kCharadesMAP1.12ContextLocNet

Related Papers

GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08Out-of-distribution detection in 3D applications: a review2025-07-01Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval2025-06-28VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding2025-06-28RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23Class Agnostic Instance-level Descriptor for Visual Instance Search2025-06-20CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion2025-06-17SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds2025-06-16