Kemal Oksuz, Baris Can Cam, Emre Akbas, Sinan Kalkan
We propose average Localisation-Recall-Precision (aLRP), a unified, bounded, balanced and ranking-based loss function for both classification and localisation tasks in object detection. aLRP extends the Localisation-Recall-Precision (LRP) performance metric (Oksuz et al., 2018) inspired from how Average Precision (AP) Loss extends precision to a ranking-based loss function for classification (Chen et al., 2020). aLRP has the following distinct advantages: (i) aLRP is the first ranking-based loss function for both classification and localisation tasks. (ii) Thanks to using ranking for both tasks, aLRP naturally enforces high-quality localisation for high-precision classification. (iii) aLRP provides provable balance between positives and negatives. (iv) Compared to on average $\sim$6 hyperparameters in the loss functions of state-of-the-art detectors, aLRP Loss has only one hyperparameter, which we did not tune in practice. On the COCO dataset, aLRP Loss improves its ranking-based predecessor, AP Loss, up to around $5$ AP points, achieves $48.9$ AP without test time augmentation and outperforms all one-stage detectors. Code available at: https://github.com/kemaloksuz/aLRPLoss .
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | COCO test-dev | AP50 | 70.3 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | AP75 | 53.9 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | APL | 63 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | APM | 53.1 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | APS | 32 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | box mAP | 50.2 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| Object Detection | COCO test-dev | AP50 | 69.3 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | AP75 | 52.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | APL | 62.1 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | APM | 51.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | APS | 30.8 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | box mAP | 48.9 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| Object Detection | COCO test-dev | AP50 | 68.4 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | AP75 | 51.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | APL | 59.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | APM | 50.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | APS | 30.2 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | box mAP | 47.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| Object Detection | COCO test-dev | AP50 | 65 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO test-dev | AP75 | 47.5 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO test-dev | APL | 58.3 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO test-dev | APM | 48.1 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO test-dev | APS | 24.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO test-dev | box mAP | 44.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| Object Detection | COCO minival | AP50 | 60.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | AP75 | 43.3 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | box AP | 40.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | AP50 | 60.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | AP75 | 42.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | box AP | 40.2 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | AP50 | 58.8 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | AP75 | 41.5 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| Object Detection | COCO minival | box AP | 39.7 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO test-dev | AP50 | 70.3 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | AP75 | 53.9 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | APL | 63 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | APM | 53.1 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | APS | 32 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | box mAP | 50.2 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 3D | COCO test-dev | AP50 | 69.3 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | AP75 | 52.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | APL | 62.1 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | APM | 51.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | APS | 30.8 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | box mAP | 48.9 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 3D | COCO test-dev | AP50 | 68.4 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | AP75 | 51.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | APL | 59.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | APM | 50.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | APS | 30.2 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | box mAP | 47.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 3D | COCO test-dev | AP50 | 65 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO test-dev | AP75 | 47.5 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO test-dev | APL | 58.3 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO test-dev | APM | 48.1 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO test-dev | APS | 24.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO test-dev | box mAP | 44.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 3D | COCO minival | AP50 | 60.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | AP75 | 43.3 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | box AP | 40.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | AP50 | 60.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | AP75 | 42.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | box AP | 40.2 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | AP50 | 58.8 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | AP75 | 41.5 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 3D | COCO minival | box AP | 39.7 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO test-dev | AP50 | 70.3 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | AP75 | 53.9 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | APL | 63 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | APM | 53.1 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | APS | 32 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | box mAP | 50.2 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Classification | COCO test-dev | AP50 | 69.3 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | AP75 | 52.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | APL | 62.1 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | APM | 51.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | APS | 30.8 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | box mAP | 48.9 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Classification | COCO test-dev | AP50 | 68.4 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | AP75 | 51.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | APL | 59.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | APM | 50.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | APS | 30.2 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | box mAP | 47.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Classification | COCO test-dev | AP50 | 65 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO test-dev | AP75 | 47.5 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO test-dev | APL | 58.3 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO test-dev | APM | 48.1 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO test-dev | APS | 24.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO test-dev | box mAP | 44.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Classification | COCO minival | AP50 | 60.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | AP75 | 43.3 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | box AP | 40.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | AP50 | 60.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | AP75 | 42.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | box AP | 40.2 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | AP50 | 58.8 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | AP75 | 41.5 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Classification | COCO minival | box AP | 39.7 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO test-dev | AP50 | 70.3 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | AP75 | 53.9 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | APL | 63 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | APM | 53.1 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | APS | 32 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | box mAP | 50.2 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 2D Object Detection | COCO test-dev | AP50 | 69.3 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | AP75 | 52.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | APL | 62.1 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | APM | 51.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | APS | 30.8 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | box mAP | 48.9 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 2D Object Detection | COCO test-dev | AP50 | 68.4 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | AP75 | 51.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | APL | 59.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | APM | 50.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | APS | 30.2 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | box mAP | 47.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 2D Object Detection | COCO test-dev | AP50 | 65 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO test-dev | AP75 | 47.5 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO test-dev | APL | 58.3 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO test-dev | APM | 48.1 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO test-dev | APS | 24.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO test-dev | box mAP | 44.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 2D Object Detection | COCO minival | AP50 | 60.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | AP75 | 43.3 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | box AP | 40.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | AP50 | 60.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | AP75 | 42.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | box AP | 40.2 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | AP50 | 58.8 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | AP75 | 41.5 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 2D Object Detection | COCO minival | box AP | 39.7 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO test-dev | AP50 | 70.3 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | AP75 | 53.9 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | APL | 63 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | APM | 53.1 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | APS | 32 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | box mAP | 50.2 | aLRP Loss (ResNext-101-64x4d, DCN, multiscale test) |
| 16k | COCO test-dev | AP50 | 69.3 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | AP75 | 52.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | APL | 62.1 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | APM | 51.5 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | APS | 30.8 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | box mAP | 48.9 | aLRP Loss (ResNext-101-64x4d, DCN, single scale) |
| 16k | COCO test-dev | AP50 | 68.4 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | AP75 | 51.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | APL | 59.1 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | APM | 50.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | APS | 30.2 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | box mAP | 47.8 | aLRP Loss (ResNext-101-64x4d, single scale) |
| 16k | COCO test-dev | AP50 | 65 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO test-dev | AP75 | 47.5 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO test-dev | APL | 58.3 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO test-dev | APM | 48.1 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO test-dev | APS | 24.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO test-dev | box mAP | 44.6 | aLRP Loss (ResNext-101, DCN, 500 scale) |
| 16k | COCO minival | AP50 | 60.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | AP75 | 43.3 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | box AP | 40.7 | Faster R-CNN+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | AP50 | 60.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | AP75 | 42.3 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | box AP | 40.2 | RetinaNet+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | AP50 | 58.8 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | AP75 | 41.5 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |
| 16k | COCO minival | box AP | 39.7 | FoveaBox+aLRP Loss (ResNet-50, 500 scale) |