TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Utilizing the Instability in Weakly Supervised Object Dete...

Utilizing the Instability in Weakly Supervised Object Detection

Yan Gao, Boxiao Liu, Nan Guo, Xiaochun Ye, Fang Wan, Haihang You, Dongrui Fan

2019-06-14Weakly Supervised Object DetectionMultiple Instance Learningobject-detectionObject Detection
PaperPDF

Abstract

Weakly supervised object detection (WSOD) focuses on training object detector with only image-level annotations, and is challenging due to the gap between the supervision and the objective. Most of existing approaches model WSOD as a multiple instance learning (MIL) problem. However, we observe that the result of MIL based detector is unstable, i.e., the most confident bounding boxes change significantly when using different initializations. We quantitatively demonstrate the instability by introducing a metric to measure it, and empirically analyze the reason of instability. Although the instability seems harmful for detection task, we argue that it can be utilized to improve the performance by fusing the results of differently initialized detectors. To implement this idea, we propose an end-to-end framework with multiple detection branches, and introduce a simple fusion strategy. We further propose an orthogonal initialization method to increase the difference between detection branches. By utilizing the instability, we achieve 52.6% and 48.0% mAP on the challenging PASCAL VOC 2007 and 2012 datasets, which are both the new state-of-the-arts.

Results

TaskDatasetMetricValueModel
Object DetectionPASCAL VOC 2007MAP52.6Ours+FRCNN
Object DetectionPASCAL VOC 2012 testMAP48Ours+FRCNN
3DPASCAL VOC 2007MAP52.6Ours+FRCNN
3DPASCAL VOC 2012 testMAP48Ours+FRCNN
2D ClassificationPASCAL VOC 2007MAP52.6Ours+FRCNN
2D ClassificationPASCAL VOC 2012 testMAP48Ours+FRCNN
2D Object DetectionPASCAL VOC 2007MAP52.6Ours+FRCNN
2D Object DetectionPASCAL VOC 2012 testMAP48Ours+FRCNN
16kPASCAL VOC 2007MAP52.6Ours+FRCNN
16kPASCAL VOC 2012 testMAP48Ours+FRCNN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08