TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Feature Pyramid Networks for Object Detection

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie

2016-12-09CVPR 2017 7Semantic SegmentationPedestrian DetectionObject Detection
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But recent deep learning object detectors have avoided pyramid representations, in part because they are compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using FPN in a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.

Results

TaskDatasetMetricValueModel
Autonomous VehiclesTJU-Ped-trafficALL (miss rate)37.78FPN
Autonomous VehiclesTJU-Ped-trafficHO (miss rate)60.3FPN
Autonomous VehiclesTJU-Ped-trafficR (miss rate)22.3FPN
Autonomous VehiclesTJU-Ped-trafficR+HO (miss rate)26.71FPN
Autonomous VehiclesTJU-Ped-trafficRS (miss rate)35.19FPN
Semantic Segmentation US3DmIoU72.51FPN
Semantic Segmentation PotsdammIoU82.99FPN
Semantic SegmentationVaihingenmIoU74.86FPN
Object DetectionCOCO test-devbox mAP36.2Faster R-CNN + FPN
Object DetectionCOCO minivalAP5061.3FPN+
Object DetectionCOCO minivalAP7543.3FPN+
Object DetectionCOCO minivalAPL52.6FPN+
Object DetectionCOCO minivalAPM43.3FPN+
Object DetectionCOCO minivalAPS22.9FPN+
Object DetectionCOCO minivalbox AP39.8FPN+
3DCOCO test-devbox mAP36.2Faster R-CNN + FPN
3DCOCO minivalAP5061.3FPN+
3DCOCO minivalAP7543.3FPN+
3DCOCO minivalAPL52.6FPN+
3DCOCO minivalAPM43.3FPN+
3DCOCO minivalAPS22.9FPN+
3DCOCO minivalbox AP39.8FPN+
2D ClassificationCOCO test-devbox mAP36.2Faster R-CNN + FPN
2D ClassificationCOCO minivalAP5061.3FPN+
2D ClassificationCOCO minivalAP7543.3FPN+
2D ClassificationCOCO minivalAPL52.6FPN+
2D ClassificationCOCO minivalAPM43.3FPN+
2D ClassificationCOCO minivalAPS22.9FPN+
2D ClassificationCOCO minivalbox AP39.8FPN+
Pedestrian DetectionTJU-Ped-trafficALL (miss rate)37.78FPN
Pedestrian DetectionTJU-Ped-trafficHO (miss rate)60.3FPN
Pedestrian DetectionTJU-Ped-trafficR (miss rate)22.3FPN
Pedestrian DetectionTJU-Ped-trafficR+HO (miss rate)26.71FPN
Pedestrian DetectionTJU-Ped-trafficRS (miss rate)35.19FPN
2D Object DetectionCOCO test-devbox mAP36.2Faster R-CNN + FPN
2D Object DetectionCOCO minivalAP5061.3FPN+
2D Object DetectionCOCO minivalAP7543.3FPN+
2D Object DetectionCOCO minivalAPL52.6FPN+
2D Object DetectionCOCO minivalAPM43.3FPN+
2D Object DetectionCOCO minivalAPS22.9FPN+
2D Object DetectionCOCO minivalbox AP39.8FPN+
10-shot image generation US3DmIoU72.51FPN
10-shot image generation PotsdammIoU82.99FPN
10-shot image generationVaihingenmIoU74.86FPN
16kCOCO test-devbox mAP36.2Faster R-CNN + FPN
16kCOCO minivalAP5061.3FPN+
16kCOCO minivalAP7543.3FPN+
16kCOCO minivalAPL52.6FPN+
16kCOCO minivalAPM43.3FPN+
16kCOCO minivalAPS22.9FPN+
16kCOCO minivalbox AP39.8FPN+

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17