TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Precise End-to-end Weakly Supervised Object Detect...

Towards Precise End-to-end Weakly Supervised Object Detection Network

Ke Yang, Dongsheng Li, Yong Dou

2019-11-27ICCV 2019 10Weakly Supervised Object DetectionregressionMultiple Instance Learningobject-detectionObject Detection
PaperPDFCode

Abstract

It is challenging for weakly supervised object detection network to precisely predict the positions of the objects, since there are no instance-level category annotations. Most existing methods tend to solve this problem by using a two-phase learning procedure, i.e., multiple instance learning detector followed by a fully supervised learning detector with bounding-box regression. Based on our observation, this procedure may lead to local minima for some object categories. In this paper, we propose to jointly train the two phases in an end-to-end manner to tackle this problem. Specifically, we design a single network with both multiple instance learning and bounding-box regression branches that share the same backbone. Meanwhile, a guided attention module using classification loss is added to the backbone for effectively extracting the implicit location information in the features. Experimental results on public datasets show that our method achieves state-of-the-art performance.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 2007MAP54.5Our-Ens
Object DetectionPASCAL VOC 2012 testMAP49.5Our-Ens
3DPASCAL VOC 2007MAP54.5Our-Ens
3DPASCAL VOC 2012 testMAP49.5Our-Ens
2D ClassificationPASCAL VOC 2007MAP54.5Our-Ens
2D ClassificationPASCAL VOC 2012 testMAP49.5Our-Ens
2D Object DetectionPASCAL VOC 2007MAP54.5Our-Ens
2D Object DetectionPASCAL VOC 2012 testMAP49.5Our-Ens
16kPASCAL VOC 2007MAP54.5Our-Ens
16kPASCAL VOC 2012 testMAP49.5Our-Ens

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16