TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generalized Focal Loss V2: Learning Reliable Localization ...

Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection

Xiang Li, Wenhai Wang, Xiaolin Hu, Jun Li, Jinhui Tang, Jian Yang

2020-11-25CVPR 2021 1Dense Object Detectionobject-detectionObject Detection
PaperPDFCode(official)CodeCodeCodeCode

Abstract

Localization Quality Estimation (LQE) is crucial and popular in the recent advancement of dense object detectors since it can provide accurate ranking scores that benefit the Non-Maximum Suppression processing and improve detection performance. As a common practice, most existing methods predict LQE scores through vanilla convolutional features shared with object classification or bounding box regression. In this paper, we explore a completely novel and different perspective to perform LQE -- based on the learned distributions of the four parameters of the bounding box. The bounding box distributions are inspired and introduced as "General Distribution" in GFLV1, which describes the uncertainty of the predicted bounding boxes well. Such a property makes the distribution statistics of a bounding box highly correlated to its real localization quality. Specifically, a bounding box distribution with a sharp peak usually corresponds to high localization quality, and vice versa. By leveraging the close correlation between distribution statistics and the real localization quality, we develop a considerably lightweight Distribution-Guided Quality Predictor (DGQP) for reliable LQE based on GFLV1, thus producing GFLV2. To our best knowledge, it is the first attempt in object detection to use a highly relevant, statistical representation to facilitate LQE. Extensive experiments demonstrate the effectiveness of our method. Notably, GFLV2 (ResNet-101) achieves 46.2 AP at 14.6 FPS, surpassing the previous state-of-the-art ATSS baseline (43.6 AP at 14.6 FPS) by absolute 2.6 AP on COCO {\tt test-dev}, without sacrificing the efficiency both in training and inference. Code will be available at https://github.com/implus/GFocalV2.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5070.9GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devAP7559.2GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devAPL65.6GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devAPM56.1GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devAPS35.7GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devbox mAP53.3GFLV2 (Res2Net-101, DCN, multiscale)
Object DetectionCOCO test-devAP5069GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devAP7555.3GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devAPL63.5GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devAPM54.3GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devAPS31.3GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devbox mAP50.6GFLV2 (Res2Net-101, DCN)
Object DetectionCOCO test-devAP5067.6GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devAP7553.5GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devAPL61.4GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devAPM52.4GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devAPS29.7GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devbox mAP49GFLV2 (ResNeXt-101, 32x4d, DCN)
Object DetectionCOCO test-devAP5066.5GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devAP7552.8GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devAPL60.7GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devAPM51.9GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devAPS28.8GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devbox mAP48.3GFLV2 (ResNet-101-DCN)
Object DetectionCOCO test-devAP5064.3GFLV2 (ResNet-101)
Object DetectionCOCO test-devAP7550.5GFLV2 (ResNet-101)
Object DetectionCOCO test-devAPL57GFLV2 (ResNet-101)
Object DetectionCOCO test-devAPM49.9GFLV2 (ResNet-101)
Object DetectionCOCO test-devAPS27.8GFLV2 (ResNet-101)
Object DetectionCOCO test-devbox mAP46.2GFLV2 (ResNet-101)
Object DetectionCOCO test-devAP5062.3GFLV2 (ResNet-50)
Object DetectionCOCO test-devAP7548.5GFLV2 (ResNet-50)
Object DetectionCOCO test-devAPL54.1GFLV2 (ResNet-50)
Object DetectionCOCO test-devAPM47.7GFLV2 (ResNet-50)
Object DetectionCOCO test-devAPS26.8GFLV2 (ResNet-50)
Object DetectionCOCO test-devbox mAP44.3GFLV2 (ResNet-50)
Object DetectionCOCO-OAverage mAP25.1GFLv2 (R2-101-DCN)
Object DetectionCOCO-OEffective Robustness2.6GFLv2 (R2-101-DCN)
3DCOCO test-devAP5070.9GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devAP7559.2GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devAPL65.6GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devAPM56.1GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devAPS35.7GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devbox mAP53.3GFLV2 (Res2Net-101, DCN, multiscale)
3DCOCO test-devAP5069GFLV2 (Res2Net-101, DCN)
3DCOCO test-devAP7555.3GFLV2 (Res2Net-101, DCN)
3DCOCO test-devAPL63.5GFLV2 (Res2Net-101, DCN)
3DCOCO test-devAPM54.3GFLV2 (Res2Net-101, DCN)
3DCOCO test-devAPS31.3GFLV2 (Res2Net-101, DCN)
3DCOCO test-devbox mAP50.6GFLV2 (Res2Net-101, DCN)
3DCOCO test-devAP5067.6GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devAP7553.5GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devAPL61.4GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devAPM52.4GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devAPS29.7GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devbox mAP49GFLV2 (ResNeXt-101, 32x4d, DCN)
3DCOCO test-devAP5066.5GFLV2 (ResNet-101-DCN)
3DCOCO test-devAP7552.8GFLV2 (ResNet-101-DCN)
3DCOCO test-devAPL60.7GFLV2 (ResNet-101-DCN)
3DCOCO test-devAPM51.9GFLV2 (ResNet-101-DCN)
3DCOCO test-devAPS28.8GFLV2 (ResNet-101-DCN)
3DCOCO test-devbox mAP48.3GFLV2 (ResNet-101-DCN)
3DCOCO test-devAP5064.3GFLV2 (ResNet-101)
3DCOCO test-devAP7550.5GFLV2 (ResNet-101)
3DCOCO test-devAPL57GFLV2 (ResNet-101)
3DCOCO test-devAPM49.9GFLV2 (ResNet-101)
3DCOCO test-devAPS27.8GFLV2 (ResNet-101)
3DCOCO test-devbox mAP46.2GFLV2 (ResNet-101)
3DCOCO test-devAP5062.3GFLV2 (ResNet-50)
3DCOCO test-devAP7548.5GFLV2 (ResNet-50)
3DCOCO test-devAPL54.1GFLV2 (ResNet-50)
3DCOCO test-devAPM47.7GFLV2 (ResNet-50)
3DCOCO test-devAPS26.8GFLV2 (ResNet-50)
3DCOCO test-devbox mAP44.3GFLV2 (ResNet-50)
3DCOCO-OAverage mAP25.1GFLv2 (R2-101-DCN)
3DCOCO-OEffective Robustness2.6GFLv2 (R2-101-DCN)
2D ClassificationCOCO test-devAP5070.9GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devAP7559.2GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devAPL65.6GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devAPM56.1GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devAPS35.7GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devbox mAP53.3GFLV2 (Res2Net-101, DCN, multiscale)
2D ClassificationCOCO test-devAP5069GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devAP7555.3GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devAPL63.5GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devAPM54.3GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devAPS31.3GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devbox mAP50.6GFLV2 (Res2Net-101, DCN)
2D ClassificationCOCO test-devAP5067.6GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devAP7553.5GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devAPL61.4GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devAPM52.4GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devAPS29.7GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devbox mAP49GFLV2 (ResNeXt-101, 32x4d, DCN)
2D ClassificationCOCO test-devAP5066.5GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devAP7552.8GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devAPL60.7GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devAPM51.9GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devAPS28.8GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devbox mAP48.3GFLV2 (ResNet-101-DCN)
2D ClassificationCOCO test-devAP5064.3GFLV2 (ResNet-101)
2D ClassificationCOCO test-devAP7550.5GFLV2 (ResNet-101)
2D ClassificationCOCO test-devAPL57GFLV2 (ResNet-101)
2D ClassificationCOCO test-devAPM49.9GFLV2 (ResNet-101)
2D ClassificationCOCO test-devAPS27.8GFLV2 (ResNet-101)
2D ClassificationCOCO test-devbox mAP46.2GFLV2 (ResNet-101)
2D ClassificationCOCO test-devAP5062.3GFLV2 (ResNet-50)
2D ClassificationCOCO test-devAP7548.5GFLV2 (ResNet-50)
2D ClassificationCOCO test-devAPL54.1GFLV2 (ResNet-50)
2D ClassificationCOCO test-devAPM47.7GFLV2 (ResNet-50)
2D ClassificationCOCO test-devAPS26.8GFLV2 (ResNet-50)
2D ClassificationCOCO test-devbox mAP44.3GFLV2 (ResNet-50)
2D ClassificationCOCO-OAverage mAP25.1GFLv2 (R2-101-DCN)
2D ClassificationCOCO-OEffective Robustness2.6GFLv2 (R2-101-DCN)
2D Object DetectionCOCO test-devAP5070.9GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devAP7559.2GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devAPL65.6GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devAPM56.1GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devAPS35.7GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devbox mAP53.3GFLV2 (Res2Net-101, DCN, multiscale)
2D Object DetectionCOCO test-devAP5069GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devAP7555.3GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devAPL63.5GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devAPM54.3GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devAPS31.3GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devbox mAP50.6GFLV2 (Res2Net-101, DCN)
2D Object DetectionCOCO test-devAP5067.6GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devAP7553.5GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devAPL61.4GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devAPM52.4GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devAPS29.7GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devbox mAP49GFLV2 (ResNeXt-101, 32x4d, DCN)
2D Object DetectionCOCO test-devAP5066.5GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devAP7552.8GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devAPL60.7GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devAPM51.9GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devAPS28.8GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devbox mAP48.3GFLV2 (ResNet-101-DCN)
2D Object DetectionCOCO test-devAP5064.3GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devAP7550.5GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devAPL57GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devAPM49.9GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devAPS27.8GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devbox mAP46.2GFLV2 (ResNet-101)
2D Object DetectionCOCO test-devAP5062.3GFLV2 (ResNet-50)
2D Object DetectionCOCO test-devAP7548.5GFLV2 (ResNet-50)
2D Object DetectionCOCO test-devAPL54.1GFLV2 (ResNet-50)
2D Object DetectionCOCO test-devAPM47.7GFLV2 (ResNet-50)
2D Object DetectionCOCO test-devAPS26.8GFLV2 (ResNet-50)
2D Object DetectionCOCO test-devbox mAP44.3GFLV2 (ResNet-50)
2D Object DetectionCOCO-OAverage mAP25.1GFLv2 (R2-101-DCN)
2D Object DetectionCOCO-OEffective Robustness2.6GFLv2 (R2-101-DCN)
16kCOCO test-devAP5070.9GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devAP7559.2GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devAPL65.6GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devAPM56.1GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devAPS35.7GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devbox mAP53.3GFLV2 (Res2Net-101, DCN, multiscale)
16kCOCO test-devAP5069GFLV2 (Res2Net-101, DCN)
16kCOCO test-devAP7555.3GFLV2 (Res2Net-101, DCN)
16kCOCO test-devAPL63.5GFLV2 (Res2Net-101, DCN)
16kCOCO test-devAPM54.3GFLV2 (Res2Net-101, DCN)
16kCOCO test-devAPS31.3GFLV2 (Res2Net-101, DCN)
16kCOCO test-devbox mAP50.6GFLV2 (Res2Net-101, DCN)
16kCOCO test-devAP5067.6GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devAP7553.5GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devAPL61.4GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devAPM52.4GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devAPS29.7GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devbox mAP49GFLV2 (ResNeXt-101, 32x4d, DCN)
16kCOCO test-devAP5066.5GFLV2 (ResNet-101-DCN)
16kCOCO test-devAP7552.8GFLV2 (ResNet-101-DCN)
16kCOCO test-devAPL60.7GFLV2 (ResNet-101-DCN)
16kCOCO test-devAPM51.9GFLV2 (ResNet-101-DCN)
16kCOCO test-devAPS28.8GFLV2 (ResNet-101-DCN)
16kCOCO test-devbox mAP48.3GFLV2 (ResNet-101-DCN)
16kCOCO test-devAP5064.3GFLV2 (ResNet-101)
16kCOCO test-devAP7550.5GFLV2 (ResNet-101)
16kCOCO test-devAPL57GFLV2 (ResNet-101)
16kCOCO test-devAPM49.9GFLV2 (ResNet-101)
16kCOCO test-devAPS27.8GFLV2 (ResNet-101)
16kCOCO test-devbox mAP46.2GFLV2 (ResNet-101)
16kCOCO test-devAP5062.3GFLV2 (ResNet-50)
16kCOCO test-devAP7548.5GFLV2 (ResNet-50)
16kCOCO test-devAPL54.1GFLV2 (ResNet-50)
16kCOCO test-devAPM47.7GFLV2 (ResNet-50)
16kCOCO test-devAPS26.8GFLV2 (ResNet-50)
16kCOCO test-devbox mAP44.3GFLV2 (ResNet-50)
16kCOCO-OAverage mAP25.1GFLv2 (R2-101-DCN)
16kCOCO-OEffective Robustness2.6GFLv2 (R2-101-DCN)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07