TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Gradient Harmonized Single-stage Detector

Gradient Harmonized Single-stage Detector

Buyu Li, Yu Liu, Xiaogang Wang

2018-11-13PhilosophyGeneral ClassificationObject Detection
PaperPDFCodeCodeCodeCode(official)CodeCodeCodeCodeCode

Abstract

Despite the great success of two-stage detectors, single-stage detector is still a more elegant and efficient way, yet suffers from the two well-known disharmonies during training, i.e. the huge difference in quantity between positive and negative examples as well as between easy and hard examples. In this work, we first point out that the essential effect of the two disharmonies can be summarized in term of the gradient. Further, we propose a novel gradient harmonizing mechanism (GHM) to be a hedging for the disharmonies. The philosophy behind GHM can be easily embedded into both classification loss function like cross-entropy (CE) and regression loss function like smooth-$L_1$ ($SL_1$) loss. To this end, two novel loss functions called GHM-C and GHM-R are designed to balancing the gradient flow for anchor classification and bounding box refinement, respectively. Ablation study on MS COCO demonstrates that without laborious hyper-parameter tuning, both GHM-C and GHM-R can bring substantial improvement for single-stage detector. Without any whistles and bells, our model achieves 41.6 mAP on COCO test-dev set which surpasses the state-of-the-art method, Focal Loss (FL) + $SL_1$, by 0.8.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5062.8GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO test-devAP7544.2GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO test-devAPL55.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO test-devAPM45.1GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO test-devAPS22.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO test-devbox mAP41.6GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
Object DetectionCOCO minivalAP5055.5GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
Object DetectionCOCO minivalAP7538.1GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
Object DetectionCOCO minivalAPL46.7GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
Object DetectionCOCO minivalAPM39.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
Object DetectionCOCO minivalAPS19.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
Object DetectionCOCO minivalbox AP35.8GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO test-devAP5062.8GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO test-devAP7544.2GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO test-devAPL55.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO test-devAPM45.1GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO test-devAPS22.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO test-devbox mAP41.6GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
3DCOCO minivalAP5055.5GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO minivalAP7538.1GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO minivalAPL46.7GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO minivalAPM39.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO minivalAPS19.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
3DCOCO minivalbox AP35.8GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO test-devAP5062.8GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO test-devAP7544.2GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO test-devAPL55.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO test-devAPM45.1GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO test-devAPS22.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO test-devbox mAP41.6GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D ClassificationCOCO minivalAP5055.5GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO minivalAP7538.1GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO minivalAPL46.7GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO minivalAPM39.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO minivalAPS19.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D ClassificationCOCO minivalbox AP35.8GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO test-devAP5062.8GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO test-devAP7544.2GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO test-devAPL55.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO test-devAPM45.1GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO test-devAPS22.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO test-devbox mAP41.6GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
2D Object DetectionCOCO minivalAP5055.5GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO minivalAP7538.1GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO minivalAPL46.7GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO minivalAPM39.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO minivalAPS19.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
2D Object DetectionCOCO minivalbox AP35.8GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO test-devAP5062.8GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO test-devAP7544.2GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO test-devAPL55.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO test-devAPM45.1GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO test-devAPS22.3GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO test-devbox mAP41.6GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
16kCOCO minivalAP5055.5GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO minivalAP7538.1GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO minivalAPL46.7GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO minivalAPM39.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO minivalAPS19.6GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
16kCOCO minivalbox AP35.8GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Streaming 4D Visual Geometry Transformer2025-07-15Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15AI-Reporter: A Path to a New Genre of Scientific Communication2025-07-08