TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Generating Positive Bounding Boxes for Balanced Training o...

Generating Positive Bounding Boxes for Balanced Training of Object Detectors

Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan

2019-09-21Object Detection
PaperPDFCode(official)

Abstract

Two-stage deep object detectors generate a set of regions-of-interest (RoI) in the first stage, then, in the second stage, identify objects among the proposed RoIs that sufficiently overlap with a ground truth (GT) box. The second stage is known to suffer from a bias towards RoIs that have low intersection-over-union (IoU) with the associated GT boxes. To address this issue, we first propose a sampling method to generate bounding boxes (BB) that overlap with a given reference box more than a given IoU threshold. Then, we use this BB generation method to develop a positive RoI (pRoI) generator that produces RoIs following any desired spatial or IoU distribution, for the second-stage. We show that our pRoI generator is able to simulate other sampling methods for positive examples such as hard example mining and prime sampling. Using our generator as an analysis tool, we show that (i) IoU imbalance has an adverse effect on performance, (ii) hard positive example mining improves the performance only for certain input IoU distributions, and (iii) the imbalance among the foreground classes has an adverse effect on performance and that it can be alleviated at the batch level. Finally, we train Faster R-CNN using our pRoI generator and, compared to conventional training, obtain better or on-par performance for low IoUs and significant improvements when trained for higher IoUs for Pascal VOC and MS COCO datasets. The code is available at: https://github.com/kemaloksuz/BoundingBoxGenerator.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO minivalAP5055.3Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
Object DetectionCOCO minivalbox AP35.6Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
3DCOCO minivalAP5055.3Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
3DCOCO minivalbox AP35.6Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
2D ClassificationCOCO minivalAP5055.3Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
2D ClassificationCOCO minivalbox AP35.6Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
2D Object DetectionCOCO minivalAP5055.3Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
2D Object DetectionCOCO minivalbox AP35.6Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
16kCOCO minivalAP5055.3Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)
16kCOCO minivalbox AP35.6Online Fg Bal. Sampling+Hard Negative Mining (ResNet-50)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07