TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/An Analysis of Scale Invariance in Object Detection - SNIP

An Analysis of Scale Invariance in Object Detection - SNIP

Bharat Singh, Larry S. Davis

2017-11-22object-detectionObject Detection
PaperPDF

Abstract

An analysis of different techniques for recognizing and detecting objects under extreme scale variation is presented. Scale specific and scale invariant design of detectors are compared by training them with different configurations of input data. By evaluating the performance of different network architectures for classifying small objects on ImageNet, we show that CNNs are not robust to changes in scale. Based on this analysis, we propose to train and test detectors on the same scales of an image-pyramid. Since small and large objects are difficult to recognize at smaller and larger scales respectively, we present a novel training scheme called Scale Normalization for Image Pyramids (SNIP) which selectively back-propagates the gradients of object instances of different sizes as a function of the image scale. On the COCO dataset, our single model performance is 45.7% and an ensemble of 3 networks obtains an mAP of 48.3%. We use off-the-shelf ImageNet-1000 pre-trained models and only train with bounding box supervision. Our submission won the Best Student Entry in the COCO 2017 challenge. Code will be made available at \url{http://bit.ly/2yXVg4c}.

Results

TaskDatasetMetricValueModel
Object DetectionCOCO test-devAP5067.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devAP7551.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devAPL57.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devAPM48.8D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devAPS29.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devbox mAP45.7D-RFCN + SNIP (DPN-98 with flip, multi-scale)
Object DetectionCOCO test-devAP5065.5D-RFCN + SNIP (ResNet-101, multi-scale)
Object DetectionCOCO test-devAP7548.4D-RFCN + SNIP (ResNet-101, multi-scale)
Object DetectionCOCO test-devAPL54.9D-RFCN + SNIP (ResNet-101, multi-scale)
Object DetectionCOCO test-devAPM46.5D-RFCN + SNIP (ResNet-101, multi-scale)
Object DetectionCOCO test-devAPS27.2D-RFCN + SNIP (ResNet-101, multi-scale)
Object DetectionCOCO test-devbox mAP43.4D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devAP5067.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devAP7551.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devAPL57.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devAPM48.8D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devAPS29.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devbox mAP45.7D-RFCN + SNIP (DPN-98 with flip, multi-scale)
3DCOCO test-devAP5065.5D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devAP7548.4D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devAPL54.9D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devAPM46.5D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devAPS27.2D-RFCN + SNIP (ResNet-101, multi-scale)
3DCOCO test-devbox mAP43.4D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devAP5067.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devAP7551.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devAPL57.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devAPM48.8D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devAPS29.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devbox mAP45.7D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D ClassificationCOCO test-devAP5065.5D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devAP7548.4D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devAPL54.9D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devAPM46.5D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devAPS27.2D-RFCN + SNIP (ResNet-101, multi-scale)
2D ClassificationCOCO test-devbox mAP43.4D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devAP5067.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devAP7551.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devAPL57.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devAPM48.8D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devAPS29.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devbox mAP45.7D-RFCN + SNIP (DPN-98 with flip, multi-scale)
2D Object DetectionCOCO test-devAP5065.5D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devAP7548.4D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devAPL54.9D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devAPM46.5D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devAPS27.2D-RFCN + SNIP (ResNet-101, multi-scale)
2D Object DetectionCOCO test-devbox mAP43.4D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devAP5067.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devAP7551.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devAPL57.1D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devAPM48.8D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devAPS29.3D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devbox mAP45.7D-RFCN + SNIP (DPN-98 with flip, multi-scale)
16kCOCO test-devAP5065.5D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devAP7548.4D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devAPL54.9D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devAPM46.5D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devAPS27.2D-RFCN + SNIP (ResNet-101, multi-scale)
16kCOCO test-devbox mAP43.4D-RFCN + SNIP (ResNet-101, multi-scale)

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07