TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dynamic R-CNN: Towards High Quality Object Detection via D...

Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training

Hongkai Zhang, Hong Chang, Bingpeng Ma, Naiyan Wang, Xilin Chen

2020-04-13ECCV 2020 8regressionVocal Bursts Intensity Predictionobject-detectionObject Detection
PaperPDFCodeCodeCode(official)

Abstract

Although two-stage object detectors have continuously advanced the state-of-the-art performance in recent years, the training process itself is far from crystal. In this work, we first point out the inconsistency problem between the fixed network settings and the dynamic training procedure, which greatly affects the performance. For example, the fixed label assignment strategy and regression loss function cannot fit the distribution change of proposals and thus are harmful to training high quality detectors. Consequently, we propose Dynamic R-CNN to adjust the label assignment criteria (IoU threshold) and the shape of regression loss function (parameters of SmoothL1 Loss) automatically based on the statistics of proposals during training. This dynamic design makes better use of the training samples and pushes the detector to fit more high quality samples. Specifically, our method improves upon ResNet-50-FPN baseline with 1.9% AP and 5.5% AP$_{90}$ on the MS COCO dataset with no extra overhead. Codes and models are available at https://github.com/hkzhang95/DynamicRCNN.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5068.3Dynamic R-CNN (ResNet-101-DCN, multi-scale)
Object DetectionCOCO test-devAP7555.6Dynamic R-CNN (ResNet-101-DCN, multi-scale)
Object DetectionCOCO test-devAPL61.2Dynamic R-CNN (ResNet-101-DCN, multi-scale)
Object DetectionCOCO test-devAPM53Dynamic R-CNN (ResNet-101-DCN, multi-scale)
Object DetectionCOCO test-devAPS32.8Dynamic R-CNN (ResNet-101-DCN, multi-scale)
Object DetectionCOCO test-devbox mAP50.1Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devAP5068.3Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devAP7555.6Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devAPL61.2Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devAPM53Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devAPS32.8Dynamic R-CNN (ResNet-101-DCN, multi-scale)
3DCOCO test-devbox mAP50.1Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devAP5068.3Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devAP7555.6Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devAPL61.2Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devAPM53Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devAPS32.8Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D ClassificationCOCO test-devbox mAP50.1Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devAP5068.3Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devAP7555.6Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devAPL61.2Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devAPM53Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devAPS32.8Dynamic R-CNN (ResNet-101-DCN, multi-scale)
2D Object DetectionCOCO test-devbox mAP50.1Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devAP5068.3Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devAP7555.6Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devAPL61.2Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devAPM53Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devAPS32.8Dynamic R-CNN (ResNet-101-DCN, multi-scale)
16kCOCO test-devbox mAP50.1Dynamic R-CNN (ResNet-101-DCN, multi-scale)

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16