TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Soft Proposal Networks for Weakly Supervised Object Locali...

Soft Proposal Networks for Weakly Supervised Object Localization

Yi Zhu, Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao

2017-09-06ICCV 2017 10Weakly Supervised Object DetectionObject LocalizationWeakly-Supervised Object Localization
PaperPDFCode

Abstract

Weakly supervised object localization remains challenging, where only image labels instead of bounding boxes are available during training. Object proposal is an effective component in localization, but often computationally expensive and incapable of joint optimization with some of the remaining modules. In this paper, to the best of our knowledge, we for the first time integrate weakly supervised object proposal into convolutional neural networks (CNNs) in an end-to-end learning manner. We design a network component, Soft Proposal (SP), to be plugged into any standard convolutional architecture to introduce the nearly cost-free object proposal, orders of magnitude faster than state-of-the-art methods. In the SP-augmented CNNs, referred to as Soft Proposal Networks (SPNs), iteratively evolved object proposals are generated based on the deep feature maps then projected back, and further jointly optimized with network parameters, with image-level supervision only. Through the unified learning process, SPNs learn better object-centric filters, discover more discriminative visual evidence, and suppress background interference, significantly boosting both weakly supervised object localization and classification performance. We report the best results on popular benchmarks, including PASCAL VOC, MS COCO, and ImageNet.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO (Common Objects in Context)MAP55.3SPNs
3DCOCO (Common Objects in Context)MAP55.3SPNs
2D ClassificationCOCO (Common Objects in Context)MAP55.3SPNs
2D Object DetectionCOCO (Common Objects in Context)MAP55.3SPNs
16kCOCO (Common Objects in Context)MAP55.3SPNs

Related Papers

Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval2025-06-28VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding2025-06-28RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23Class Agnostic Instance-level Descriptor for Visual Instance Search2025-06-20CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion2025-06-17UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data2025-06-16WoMAP: World Models For Embodied Open-Vocabulary Object Localization2025-06-02Self-Classification Enhancement and Correction for Weakly Supervised Object Detection2025-05-22