TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/VarifocalNet: An IoU-aware Dense Object Detector

VarifocalNet: An IoU-aware Dense Object Detector

Haoyang Zhang, Ying Wang, Feras Dayoub, Niko Sünderhauf

2020-08-31CVPR 2021 1General ClassificationObject Detection
PaperPDFCodeCodeCodeCode(official)

Abstract

Accurately ranking the vast number of candidate detections is crucial for dense object detectors to achieve high performance. Prior work uses the classification score or a combination of classification and predicted localization scores to rank candidates. However, neither option results in a reliable ranking, thus degrading detection performance. In this paper, we propose to learn an Iou-aware Classification Score (IACS) as a joint representation of object presence confidence and localization accuracy. We show that dense object detectors can achieve a more accurate ranking of candidate detections based on the IACS. We design a new loss function, named Varifocal Loss, to train a dense object detector to predict the IACS, and propose a new star-shaped bounding box feature representation for IACS prediction and bounding box refinement. Combining these two new components and a bounding box refinement branch, we build an IoU-aware dense object detector based on the FCOS+ATSS architecture, that we call VarifocalNet or VFNet for short. Extensive experiments on MS COCO show that our VFNet consistently surpasses the strong baseline by $\sim$2.0 AP with different backbones. Our best model VFNet-X-1200 with Res2Net-101-DCN achieves a single-model single-scale AP of 55.1 on COCO test-dev, which is state-of-the-art among various object detectors.Code is available at https://github.com/hyz-xmaster/VarifocalNet .

Results

TaskDatasetMetricValueModel
Object DetectionCOCO-OAverage mAP28VFNet (RX-101-64x4d)
Object DetectionCOCO-OEffective Robustness5.27VFNet (RX-101-64x4d)
3DCOCO-OAverage mAP28VFNet (RX-101-64x4d)
3DCOCO-OEffective Robustness5.27VFNet (RX-101-64x4d)
2D ClassificationCOCO-OAverage mAP28VFNet (RX-101-64x4d)
2D ClassificationCOCO-OEffective Robustness5.27VFNet (RX-101-64x4d)
2D Object DetectionCOCO-OAverage mAP28VFNet (RX-101-64x4d)
2D Object DetectionCOCO-OEffective Robustness5.27VFNet (RX-101-64x4d)
16kCOCO-OAverage mAP28VFNet (RX-101-64x4d)
16kCOCO-OEffective Robustness5.27VFNet (RX-101-64x4d)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07