TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bounding Box Regression with Uncertainty for Accurate Obje...

Bounding Box Regression with Uncertainty for Accurate Object Detection

Yihui He, Chenchen Zhu, Jianren Wang, Marios Savvides, Xiangyu Zhang

2018-09-23CVPR 2019 6regressionObject Localizationobject-detectionObject Detection
PaperPDFCodeCode(official)CodeCode(official)

Abstract

Large-scale object detection datasets (e.g., MS-COCO) try to define the ground truth bounding boxes as clear as possible. However, we observe that ambiguities are still introduced when labeling the bounding boxes. In this paper, we propose a novel bounding box regression loss for learning bounding box transformation and localization variance together. Our loss greatly improves the localization accuracies of various architectures with nearly no additional computation. The learned localization variance allows us to merge neighboring bounding boxes during non-maximum suppression (NMS), which further improves the localization performance. On MS-COCO, we boost the Average Precision (AP) of VGG-16 Faster R-CNN from 23.6% to 29.1%. More importantly, for ResNet-50-FPN Mask R-CNN, our method improves the AP and AP90 by 1.8% and 6.2% respectively, which significantly outperforms previous state-of-the-art bounding box refinement methods. Our code and models are available at: github.com/yihui-he/KL-Loss

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devbox mAP40.4ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS
3DCOCO test-devbox mAP40.4ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS
2D ClassificationCOCO test-devbox mAP40.4ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS
2D Object DetectionCOCO test-devbox mAP40.4ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS
16kCOCO test-devbox mAP40.4ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16