Siyuan Qiao, Liang-Chieh Chen, Alan Yuille
Many modern object detectors demonstrate outstanding performances by using the mechanism of looking and thinking twice. In this paper, we explore this mechanism in the backbone design for object detection. At the macro level, we propose Recursive Feature Pyramid, which incorporates extra feedback connections from Feature Pyramid Networks into the bottom-up backbone layers. At the micro level, we propose Switchable Atrous Convolution, which convolves the features with different atrous rates and gathers the results using switch functions. Combining them results in DetectoRS, which significantly improves the performances of object detection. On COCO test-dev, DetectoRS achieves state-of-the-art 55.7% box AP for object detection, 48.5% mask AP for instance segmentation, and 50.0% PQ for panoptic segmentation. The code is made publicly available.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | COCO test-dev | PQ | 50 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Semantic Segmentation | COCO test-dev | PQst | 37.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Semantic Segmentation | COCO test-dev | PQth | 58.5 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | AP50 | 74.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | AP75 | 61.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | APL | 68.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | APM | 58.4 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | APS | 37.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | box mAP | 55.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Object Detection | COCO test-dev | AP50 | 73.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | AP75 | 60.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | APL | 66.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | APM | 57.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | APS | 37.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | box mAP | 54.7 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Object Detection | COCO test-dev | AP50 | 71.6 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | COCO test-dev | AP75 | 58.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | COCO test-dev | APL | 66.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | COCO test-dev | APM | 56.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | COCO test-dev | APS | 33.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | COCO test-dev | box mAP | 53.3 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| Object Detection | AI-TOD | AP | 14.8 | DetectoRS (ResNet-50-FPN) |
| Object Detection | AI-TOD | AP50 | 32.8 | DetectoRS (ResNet-50-FPN) |
| Object Detection | AI-TOD | AP75 | 11.4 | DetectoRS (ResNet-50-FPN) |
| Object Detection | AI-TOD | APm | 28 | DetectoRS (ResNet-50-FPN) |
| Object Detection | AI-TOD | APs | 28.3 | DetectoRS (ResNet-50-FPN) |
| Object Detection | AI-TOD | APt | 10.8 | DetectoRS (ResNet-50-FPN) |
| 3D | COCO test-dev | AP50 | 74.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | AP75 | 61.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | APL | 68.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | APM | 58.4 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | APS | 37.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | box mAP | 55.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 3D | COCO test-dev | AP50 | 73.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | AP75 | 60.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | APL | 66.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | APM | 57.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | APS | 37.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | box mAP | 54.7 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 3D | COCO test-dev | AP50 | 71.6 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | COCO test-dev | AP75 | 58.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | COCO test-dev | APL | 66.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | COCO test-dev | APM | 56.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | COCO test-dev | APS | 33.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | COCO test-dev | box mAP | 53.3 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 3D | AI-TOD | AP | 14.8 | DetectoRS (ResNet-50-FPN) |
| 3D | AI-TOD | AP50 | 32.8 | DetectoRS (ResNet-50-FPN) |
| 3D | AI-TOD | AP75 | 11.4 | DetectoRS (ResNet-50-FPN) |
| 3D | AI-TOD | APm | 28 | DetectoRS (ResNet-50-FPN) |
| 3D | AI-TOD | APs | 28.3 | DetectoRS (ResNet-50-FPN) |
| 3D | AI-TOD | APt | 10.8 | DetectoRS (ResNet-50-FPN) |
| Instance Segmentation | COCO test-dev | AP50 | 72 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | AP75 | 53.3 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APL | 61.5 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APM | 50.9 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APS | 31.6 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | mask AP | 48.5 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | AP50 | 71.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | AP75 | 51.6 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APL | 59.6 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APM | 49.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | APS | 30.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| Instance Segmentation | COCO test-dev | mask AP | 47.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | AP50 | 74.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | AP75 | 61.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | APL | 68.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | APM | 58.4 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | APS | 37.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | box mAP | 55.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Classification | COCO test-dev | AP50 | 73.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | AP75 | 60.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | APL | 66.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | APM | 57.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | APS | 37.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | box mAP | 54.7 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Classification | COCO test-dev | AP50 | 71.6 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | COCO test-dev | AP75 | 58.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | COCO test-dev | APL | 66.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | COCO test-dev | APM | 56.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | COCO test-dev | APS | 33.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | COCO test-dev | box mAP | 53.3 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Classification | AI-TOD | AP | 14.8 | DetectoRS (ResNet-50-FPN) |
| 2D Classification | AI-TOD | AP50 | 32.8 | DetectoRS (ResNet-50-FPN) |
| 2D Classification | AI-TOD | AP75 | 11.4 | DetectoRS (ResNet-50-FPN) |
| 2D Classification | AI-TOD | APm | 28 | DetectoRS (ResNet-50-FPN) |
| 2D Classification | AI-TOD | APs | 28.3 | DetectoRS (ResNet-50-FPN) |
| 2D Classification | AI-TOD | APt | 10.8 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | COCO test-dev | AP50 | 74.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | AP75 | 61.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APL | 68.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APM | 58.4 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APS | 37.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | box mAP | 55.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | AP50 | 73.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | AP75 | 60.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APL | 66.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APM | 57.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | APS | 37.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | box mAP | 54.7 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 2D Object Detection | COCO test-dev | AP50 | 71.6 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | COCO test-dev | AP75 | 58.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | COCO test-dev | APL | 66.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | COCO test-dev | APM | 56.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | COCO test-dev | APS | 33.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | COCO test-dev | box mAP | 53.3 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 2D Object Detection | AI-TOD | AP | 14.8 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | AI-TOD | AP50 | 32.8 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | AI-TOD | AP75 | 11.4 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | AI-TOD | APm | 28 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | AI-TOD | APs | 28.3 | DetectoRS (ResNet-50-FPN) |
| 2D Object Detection | AI-TOD | APt | 10.8 | DetectoRS (ResNet-50-FPN) |
| 10-shot image generation | COCO test-dev | PQ | 50 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 10-shot image generation | COCO test-dev | PQst | 37.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 10-shot image generation | COCO test-dev | PQth | 58.5 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Panoptic Segmentation | COCO test-dev | PQ | 50 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Panoptic Segmentation | COCO test-dev | PQst | 37.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| Panoptic Segmentation | COCO test-dev | PQth | 58.5 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | AP50 | 74.2 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | AP75 | 61.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | APL | 68.1 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | APM | 58.4 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | APS | 37.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | box mAP | 55.7 | DetectoRS (ResNeXt-101-64x4d, multi-scale) |
| 16k | COCO test-dev | AP50 | 73.5 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | AP75 | 60.1 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | APL | 66.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | APM | 57.3 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | APS | 37.4 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | box mAP | 54.7 | DetectoRS (ResNeXt-101-32x4d, multi-scale) |
| 16k | COCO test-dev | AP50 | 71.6 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | COCO test-dev | AP75 | 58.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | COCO test-dev | APL | 66.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | COCO test-dev | APM | 56.5 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | COCO test-dev | APS | 33.9 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | COCO test-dev | box mAP | 53.3 | DetectoRS (ResNeXt-101-32x4d, single-scale) |
| 16k | AI-TOD | AP | 14.8 | DetectoRS (ResNet-50-FPN) |
| 16k | AI-TOD | AP50 | 32.8 | DetectoRS (ResNet-50-FPN) |
| 16k | AI-TOD | AP75 | 11.4 | DetectoRS (ResNet-50-FPN) |
| 16k | AI-TOD | APm | 28 | DetectoRS (ResNet-50-FPN) |
| 16k | AI-TOD | APs | 28.3 | DetectoRS (ResNet-50-FPN) |
| 16k | AI-TOD | APt | 10.8 | DetectoRS (ResNet-50-FPN) |