TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PCL: Proposal Cluster Learning for Weakly Supervised Objec...

PCL: Proposal Cluster Learning for Weakly Supervised Object Detection

Peng Tang, Xinggang Wang, Song Bai, Wei Shen, Xiang Bai, Wenyu Liu, Alan Yuille

2018-07-09Weakly Supervised Object DetectionMultiple Instance LearningObject Recognitionobject-detectionObject Detection
PaperPDFCodeCodeCodeCode(official)

Abstract

Weakly Supervised Object Detection (WSOD), using only image-level annotations to train object detectors, is of growing importance in object recognition. In this paper, we propose a novel deep network for WSOD. Unlike previous networks that transfer the object detection problem to an image classification problem using Multiple Instance Learning (MIL), our strategy generates proposal clusters to learn refined instance classifiers by an iterative process. The proposals in the same cluster are spatially adjacent and associated with the same object. This prevents the network from concentrating too much on parts of objects instead of whole objects. We first show that instances can be assigned object or background labels directly based on proposal clusters for instance classifier refinement, and then show that treating each cluster as a small new bag yields fewer ambiguities than the directly assigning label method. The iterative instance classifier refinement is implemented online using multiple streams in convolutional neural networks, where the first is an MIL network and the others are for instance classifier refinement supervised by the preceding one. Experiments are conducted on the PASCAL VOC, ImageNet detection, and MS-COCO benchmarks for WSOD. Results show that our method outperforms the previous state of the art significantly.

Results

TaskDatasetMetricValueModel
Object DetectionHICO-DETMAP3.62PCL
Object DetectionPASCAL VOC 2007MAP48.8PCL-OB-G-Ens + FRCNN
Object DetectionPASCAL VOC 2012 testMAP44.2PCL-OB-G-Ens + FRCNN
Object DetectionCharadesMAP2.83PCL
Object DetectionImageNetMAP19.6PCL-OB-G-Ens + FRCNN
3DHICO-DETMAP3.62PCL
3DPASCAL VOC 2007MAP48.8PCL-OB-G-Ens + FRCNN
3DPASCAL VOC 2012 testMAP44.2PCL-OB-G-Ens + FRCNN
3DCharadesMAP2.83PCL
3DImageNetMAP19.6PCL-OB-G-Ens + FRCNN
2D ClassificationHICO-DETMAP3.62PCL
2D ClassificationPASCAL VOC 2007MAP48.8PCL-OB-G-Ens + FRCNN
2D ClassificationPASCAL VOC 2012 testMAP44.2PCL-OB-G-Ens + FRCNN
2D ClassificationCharadesMAP2.83PCL
2D ClassificationImageNetMAP19.6PCL-OB-G-Ens + FRCNN
2D Object DetectionHICO-DETMAP3.62PCL
2D Object DetectionPASCAL VOC 2007MAP48.8PCL-OB-G-Ens + FRCNN
2D Object DetectionPASCAL VOC 2012 testMAP44.2PCL-OB-G-Ens + FRCNN
2D Object DetectionCharadesMAP2.83PCL
2D Object DetectionImageNetMAP19.6PCL-OB-G-Ens + FRCNN
16kHICO-DETMAP3.62PCL
16kPASCAL VOC 2007MAP48.8PCL-OB-G-Ens + FRCNN
16kPASCAL VOC 2012 testMAP44.2PCL-OB-G-Ens + FRCNN
16kCharadesMAP2.83PCL
16kImageNetMAP19.6PCL-OB-G-Ens + FRCNN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08