Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie
Feature pyramids are a basic component in recognition systems for detecting objects at different scales. But recent deep learning object detectors have avoided pyramid representations, in part because they are compute and memory intensive. In this paper, we exploit the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost. A top-down architecture with lateral connections is developed for building high-level semantic feature maps at all scales. This architecture, called a Feature Pyramid Network (FPN), shows significant improvement as a generic feature extractor in several applications. Using FPN in a basic Faster R-CNN system, our method achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles, surpassing all existing single-model entries including those from the COCO 2016 challenge winners. In addition, our method can run at 5 FPS on a GPU and thus is a practical and accurate solution to multi-scale object detection. Code will be made publicly available.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Autonomous Vehicles | TJU-Ped-traffic | ALL (miss rate) | 37.78 | FPN |
| Autonomous Vehicles | TJU-Ped-traffic | HO (miss rate) | 60.3 | FPN |
| Autonomous Vehicles | TJU-Ped-traffic | R (miss rate) | 22.3 | FPN |
| Autonomous Vehicles | TJU-Ped-traffic | R+HO (miss rate) | 26.71 | FPN |
| Autonomous Vehicles | TJU-Ped-traffic | RS (miss rate) | 35.19 | FPN |
| Semantic Segmentation | US3D | mIoU | 72.51 | FPN |
| Semantic Segmentation | Potsdam | mIoU | 82.99 | FPN |
| Semantic Segmentation | Vaihingen | mIoU | 74.86 | FPN |
| Object Detection | COCO test-dev | box mAP | 36.2 | Faster R-CNN + FPN |
| Object Detection | COCO minival | AP50 | 61.3 | FPN+ |
| Object Detection | COCO minival | AP75 | 43.3 | FPN+ |
| Object Detection | COCO minival | APL | 52.6 | FPN+ |
| Object Detection | COCO minival | APM | 43.3 | FPN+ |
| Object Detection | COCO minival | APS | 22.9 | FPN+ |
| Object Detection | COCO minival | box AP | 39.8 | FPN+ |
| 3D | COCO test-dev | box mAP | 36.2 | Faster R-CNN + FPN |
| 3D | COCO minival | AP50 | 61.3 | FPN+ |
| 3D | COCO minival | AP75 | 43.3 | FPN+ |
| 3D | COCO minival | APL | 52.6 | FPN+ |
| 3D | COCO minival | APM | 43.3 | FPN+ |
| 3D | COCO minival | APS | 22.9 | FPN+ |
| 3D | COCO minival | box AP | 39.8 | FPN+ |
| 2D Classification | COCO test-dev | box mAP | 36.2 | Faster R-CNN + FPN |
| 2D Classification | COCO minival | AP50 | 61.3 | FPN+ |
| 2D Classification | COCO minival | AP75 | 43.3 | FPN+ |
| 2D Classification | COCO minival | APL | 52.6 | FPN+ |
| 2D Classification | COCO minival | APM | 43.3 | FPN+ |
| 2D Classification | COCO minival | APS | 22.9 | FPN+ |
| 2D Classification | COCO minival | box AP | 39.8 | FPN+ |
| Pedestrian Detection | TJU-Ped-traffic | ALL (miss rate) | 37.78 | FPN |
| Pedestrian Detection | TJU-Ped-traffic | HO (miss rate) | 60.3 | FPN |
| Pedestrian Detection | TJU-Ped-traffic | R (miss rate) | 22.3 | FPN |
| Pedestrian Detection | TJU-Ped-traffic | R+HO (miss rate) | 26.71 | FPN |
| Pedestrian Detection | TJU-Ped-traffic | RS (miss rate) | 35.19 | FPN |
| 2D Object Detection | COCO test-dev | box mAP | 36.2 | Faster R-CNN + FPN |
| 2D Object Detection | COCO minival | AP50 | 61.3 | FPN+ |
| 2D Object Detection | COCO minival | AP75 | 43.3 | FPN+ |
| 2D Object Detection | COCO minival | APL | 52.6 | FPN+ |
| 2D Object Detection | COCO minival | APM | 43.3 | FPN+ |
| 2D Object Detection | COCO minival | APS | 22.9 | FPN+ |
| 2D Object Detection | COCO minival | box AP | 39.8 | FPN+ |
| 10-shot image generation | US3D | mIoU | 72.51 | FPN |
| 10-shot image generation | Potsdam | mIoU | 82.99 | FPN |
| 10-shot image generation | Vaihingen | mIoU | 74.86 | FPN |
| 16k | COCO test-dev | box mAP | 36.2 | Faster R-CNN + FPN |
| 16k | COCO minival | AP50 | 61.3 | FPN+ |
| 16k | COCO minival | AP75 | 43.3 | FPN+ |
| 16k | COCO minival | APL | 52.6 | FPN+ |
| 16k | COCO minival | APM | 43.3 | FPN+ |
| 16k | COCO minival | APS | 22.9 | FPN+ |
| 16k | COCO minival | box AP | 39.8 | FPN+ |