TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Comprehensive Attention Self-Distillation for Weakly-Super...

Comprehensive Attention Self-Distillation for Weakly-Supervised Object Detection

Zeyi Huang, Yang Zou, Vijayakumar Bhagavatula, Dong Huang

2020-10-22NeurIPS 2020 12Weakly Supervised Object Detectionobject-detectionObject Detection
PaperPDFCode(official)

Abstract

Weakly Supervised Object Detection (WSOD) has emerged as an effective tool to train object detectors using only the image-level category labels. However, without object-level labels, WSOD detectors are prone to detect bounding boxes on salient objects, clustered objects and discriminative object parts. Moreover, the image-level category labels do not enforce consistent object detection across different transformations of the same images. To address the above issues, we propose a Comprehensive Attention Self-Distillation (CASD) training approach for WSOD. To balance feature learning among all object instances, CASD computes the comprehensive attention aggregated from multiple transformations and feature layers of the same images. To enforce consistent spatial supervision on objects, CASD conducts self-distillation on the WSOD networks, such that the comprehensive attention is approximated simultaneously by multiple transformations and feature layers of the same images. CASD produces new state-of-the-art WSOD results on standard benchmarks such as PASCAL VOC 2007/2012 and MS-COCO.

Results

TaskDatasetMetricValueModel
Object DetectionMSCOCOmAP13.9CASD(ResNet50)
Object DetectionMSCOCOmAP@5027.8CASD(ResNet50)
Object DetectionPASCAL VOC 2007MAP56.8CASD(VGG16)
Object DetectionPASCAL VOC 2012 testMAP53.6CASD(VGG16)
3DMSCOCOmAP13.9CASD(ResNet50)
3DMSCOCOmAP@5027.8CASD(ResNet50)
3DPASCAL VOC 2007MAP56.8CASD(VGG16)
3DPASCAL VOC 2012 testMAP53.6CASD(VGG16)
2D ClassificationMSCOCOmAP13.9CASD(ResNet50)
2D ClassificationMSCOCOmAP@5027.8CASD(ResNet50)
2D ClassificationPASCAL VOC 2007MAP56.8CASD(VGG16)
2D ClassificationPASCAL VOC 2012 testMAP53.6CASD(VGG16)
2D Object DetectionMSCOCOmAP13.9CASD(ResNet50)
2D Object DetectionMSCOCOmAP@5027.8CASD(ResNet50)
2D Object DetectionPASCAL VOC 2007MAP56.8CASD(VGG16)
2D Object DetectionPASCAL VOC 2012 testMAP53.6CASD(VGG16)
16kMSCOCOmAP13.9CASD(ResNet50)
16kMSCOCOmAP@5027.8CASD(ResNet50)
16kPASCAL VOC 2007MAP56.8CASD(VGG16)
16kPASCAL VOC 2012 testMAP53.6CASD(VGG16)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07