TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Rethinking the Route Towards Weakly Supervised Object Loca...

Rethinking the Route Towards Weakly Supervised Object Localization

Chen-Lin Zhang, Yun-Hao Cao, Jianxin Wu

2020-02-26CVPR 2020 6Object LocalizationWeakly-Supervised Object LocalizationGeneral Classification
PaperPDFCode(official)

Abstract

Weakly supervised object localization (WSOL) aims to localize objects with only image-level labels. Previous methods often try to utilize feature maps and classification weights to localize objects using image level annotations indirectly. In this paper, we demonstrate that weakly supervised object localization should be divided into two parts: class-agnostic object localization and object classification. For class-agnostic object localization, we should use class-agnostic methods to generate noisy pseudo annotations and then perform bounding box regression on them without class labels. We propose the pseudo supervised object localization (PSOL) method as a new way to solve WSOL. Our PSOL models have good transferability across different datasets without fine-tuning. With generated pseudo bounding boxes, we achieve 58.00% localization accuracy on ImageNet and 74.97% localization accuracy on CUB-200, which have a large edge over previous models.

Results

TaskDatasetMetricValueModel
Object Localization CUB-200-2011Top-1 Localization Accuracy74.97PSOL-DenseNet161-Sep

Related Papers

Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval2025-06-28VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding2025-06-28RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base2025-06-23CDP: Towards Robust Autoregressive Visuomotor Policy Learning via Causal Diffusion2025-06-17UAV Object Detection and Positioning in a Mining Industrial Metaverse with Custom Geo-Referenced Data2025-06-16WoMAP: World Models For Embodied Open-Vocabulary Object Localization2025-06-02Multispectral Detection Transformer with Infrared-Centric Sensor Fusion2025-05-21Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels2025-05-20