Rishit Dagli, Ali Mustufa Shaikh
We present a new challenging dataset, CPPE - 5 (Medical Personal Protective Equipment), with the goal to allow the study of subordinate categorization of medical personal protective equipments, which is not possible with other popular data sets that focus on broad-level categories (such as PASCAL VOC, ImageNet, Microsoft COCO, OpenImages, etc). To make it easy for models trained on this dataset to be used in practical scenarios in complex scenes, our dataset mainly contains images that show complex scenes with several objects in each scene in their natural context. The image collection for this dataset focuses on: obtaining as many non-iconic images as possible and making sure all the images are real-life images, unlike other existing datasets in this area. Our dataset includes 5 object categories (coveralls, face shields, gloves, masks, and goggles), and each image is annotated with a set of bounding boxes and positive labels. We present a detailed analysis of the dataset in comparison to other popular broad category datasets as well as datasets focusing on personal protective equipments, we also find that at present there exist no such publicly available datasets. Finally, we also analyze performance and compare model complexities on baseline and state-of-the-art models for bounding box results. Our code, data, and trained models are available at https://git.io/cppe5-dataset.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Object Detection | CPPE-5 | AP50 | 85.1 | TridentNet |
| Object Detection | CPPE-5 | AP75 | 58.3 | TridentNet |
| Object Detection | CPPE-5 | APL | 62.6 | TridentNet |
| Object Detection | CPPE-5 | APM | 41.3 | TridentNet |
| Object Detection | CPPE-5 | APS | 42.6 | TridentNet |
| Object Detection | CPPE-5 | box AP | 52.9 | TridentNet |
| Object Detection | CPPE-5 | AP50 | 86.5 | Empirical Attention |
| Object Detection | CPPE-5 | AP75 | 54.1 | Empirical Attention |
| Object Detection | CPPE-5 | APL | 61 | Empirical Attention |
| Object Detection | CPPE-5 | APM | 43.4 | Empirical Attention |
| Object Detection | CPPE-5 | APS | 38.7 | Empirical Attention |
| Object Detection | CPPE-5 | box AP | 52.5 | Empirical Attention |
| Object Detection | CPPE-5 | AP50 | 87.3 | Double Heads |
| Object Detection | CPPE-5 | AP75 | 55.2 | Double Heads |
| Object Detection | CPPE-5 | APL | 60.8 | Double Heads |
| Object Detection | CPPE-5 | APM | 41 | Double Heads |
| Object Detection | CPPE-5 | APS | 38.6 | Double Heads |
| Object Detection | CPPE-5 | box AP | 52 | Double Heads |
| Object Detection | CPPE-5 | AP50 | 87.1 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | AP75 | 55.9 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | APL | 61.3 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | APM | 41.4 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | APS | 36.3 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | box AP | 51.6 | Deformable Convolutional Network |
| Object Detection | CPPE-5 | AP50 | 85.3 | RegNet |
| Object Detection | CPPE-5 | AP75 | 51.8 | RegNet |
| Object Detection | CPPE-5 | APL | 60.5 | RegNet |
| Object Detection | CPPE-5 | APM | 41.1 | RegNet |
| Object Detection | CPPE-5 | APS | 35.7 | RegNet |
| Object Detection | CPPE-5 | box AP | 51.3 | RegNet |
| Object Detection | CPPE-5 | AP50 | 82.6 | VarifocalNet |
| Object Detection | CPPE-5 | AP75 | 56.7 | VarifocalNet |
| Object Detection | CPPE-5 | APL | 58.8 | VarifocalNet |
| Object Detection | CPPE-5 | APM | 42.1 | VarifocalNet |
| Object Detection | CPPE-5 | APS | 39 | VarifocalNet |
| Object Detection | CPPE-5 | box AP | 51 | VarifocalNet |
| Object Detection | CPPE-5 | AP50 | 76.5 | Localization Distillation |
| Object Detection | CPPE-5 | AP75 | 58.8 | Localization Distillation |
| Object Detection | CPPE-5 | APL | 59.4 | Localization Distillation |
| Object Detection | CPPE-5 | APM | 43 | Localization Distillation |
| Object Detection | CPPE-5 | APS | 45.8 | Localization Distillation |
| Object Detection | CPPE-5 | box AP | 50.9 | Localization Distillation |
| Object Detection | CPPE-5 | AP50 | 84.7 | FSAF |
| Object Detection | CPPE-5 | AP75 | 48.2 | FSAF |
| Object Detection | CPPE-5 | APL | 56.7 | FSAF |
| Object Detection | CPPE-5 | APM | 39.6 | FSAF |
| Object Detection | CPPE-5 | APS | 45.3 | FSAF |
| Object Detection | CPPE-5 | box AP | 49.2 | FSAF |
| Object Detection | CPPE-5 | AP50 | 76.9 | Deformable DETR |
| Object Detection | CPPE-5 | AP75 | 52.8 | Deformable DETR |
| Object Detection | CPPE-5 | APL | 53.9 | Deformable DETR |
| Object Detection | CPPE-5 | APM | 35.2 | Deformable DETR |
| Object Detection | CPPE-5 | APS | 36.4 | Deformable DETR |
| Object Detection | CPPE-5 | box AP | 48 | Deformable DETR |
| Object Detection | CPPE-5 | AP50 | 77.9 | Grid RCNN |
| Object Detection | CPPE-5 | AP75 | 50.6 | Grid RCNN |
| Object Detection | CPPE-5 | APL | 54.4 | Grid RCNN |
| Object Detection | CPPE-5 | APM | 37.2 | Grid RCNN |
| Object Detection | CPPE-5 | APS | 43.4 | Grid RCNN |
| Object Detection | CPPE-5 | box AP | 47.5 | Grid RCNN |
| Object Detection | CPPE-5 | AP50 | 79.5 | FCOS |
| Object Detection | CPPE-5 | AP75 | 45.9 | FCOS |
| Object Detection | CPPE-5 | APL | 51.7 | FCOS |
| Object Detection | CPPE-5 | APM | 39.2 | FCOS |
| Object Detection | CPPE-5 | APS | 36.7 | FCOS |
| Object Detection | CPPE-5 | box AP | 44.4 | FCOS |
| Object Detection | CPPE-5 | AP50 | 73.8 | Faster RCNN |
| Object Detection | CPPE-5 | AP75 | 47.8 | Faster RCNN |
| Object Detection | CPPE-5 | APL | 52.5 | Faster RCNN |
| Object Detection | CPPE-5 | APM | 34.7 | Faster RCNN |
| Object Detection | CPPE-5 | APS | 30 | Faster RCNN |
| Object Detection | CPPE-5 | box AP | 44 | Faster RCNN |
| Object Detection | CPPE-5 | AP50 | 69.6 | Sparse RCNN |
| Object Detection | CPPE-5 | AP75 | 44.6 | Sparse RCNN |
| Object Detection | CPPE-5 | APL | 54.7 | Sparse RCNN |
| Object Detection | CPPE-5 | APM | 30.6 | Sparse RCNN |
| Object Detection | CPPE-5 | APS | 30 | Sparse RCNN |
| Object Detection | CPPE-5 | box AP | 44 | Sparse RCNN |
| Object Detection | CPPE-5 | AP50 | 75.9 | RepPoints |
| Object Detection | CPPE-5 | AP75 | 40.1 | RepPoints |
| Object Detection | CPPE-5 | APL | 48 | RepPoints |
| Object Detection | CPPE-5 | APM | 36.7 | RepPoints |
| Object Detection | CPPE-5 | APS | 27.3 | RepPoints |
| Object Detection | CPPE-5 | box AP | 43 | RepPoints |
| Object Detection | CPPE-5 | AP50 | 79.4 | YOLOv3 |
| Object Detection | CPPE-5 | AP75 | 35.3 | YOLOv3 |
| Object Detection | CPPE-5 | APL | 49 | YOLOv3 |
| Object Detection | CPPE-5 | APS | 23.1 | YOLOv3 |
| Object Detection | CPPE-5 | box AP | 38.5 | YOLOv3 |
| Object Detection | CPPE-5 | AP50 | 57 | SSD |
| Object Detection | CPPE-5 | AP75 | 24.9 | SSD |
| Object Detection | CPPE-5 | APL | 34.6 | SSD |
| Object Detection | CPPE-5 | APM | 23.1 | SSD |
| Object Detection | CPPE-5 | APS | 32.1 | SSD |
| Object Detection | CPPE-5 | box AP | 29.5 | SSD |
| 3D | CPPE-5 | AP50 | 85.1 | TridentNet |
| 3D | CPPE-5 | AP75 | 58.3 | TridentNet |
| 3D | CPPE-5 | APL | 62.6 | TridentNet |
| 3D | CPPE-5 | APM | 41.3 | TridentNet |
| 3D | CPPE-5 | APS | 42.6 | TridentNet |
| 3D | CPPE-5 | box AP | 52.9 | TridentNet |
| 3D | CPPE-5 | AP50 | 86.5 | Empirical Attention |
| 3D | CPPE-5 | AP75 | 54.1 | Empirical Attention |
| 3D | CPPE-5 | APL | 61 | Empirical Attention |
| 3D | CPPE-5 | APM | 43.4 | Empirical Attention |
| 3D | CPPE-5 | APS | 38.7 | Empirical Attention |
| 3D | CPPE-5 | box AP | 52.5 | Empirical Attention |
| 3D | CPPE-5 | AP50 | 87.3 | Double Heads |
| 3D | CPPE-5 | AP75 | 55.2 | Double Heads |
| 3D | CPPE-5 | APL | 60.8 | Double Heads |
| 3D | CPPE-5 | APM | 41 | Double Heads |
| 3D | CPPE-5 | APS | 38.6 | Double Heads |
| 3D | CPPE-5 | box AP | 52 | Double Heads |
| 3D | CPPE-5 | AP50 | 87.1 | Deformable Convolutional Network |
| 3D | CPPE-5 | AP75 | 55.9 | Deformable Convolutional Network |
| 3D | CPPE-5 | APL | 61.3 | Deformable Convolutional Network |
| 3D | CPPE-5 | APM | 41.4 | Deformable Convolutional Network |
| 3D | CPPE-5 | APS | 36.3 | Deformable Convolutional Network |
| 3D | CPPE-5 | box AP | 51.6 | Deformable Convolutional Network |
| 3D | CPPE-5 | AP50 | 85.3 | RegNet |
| 3D | CPPE-5 | AP75 | 51.8 | RegNet |
| 3D | CPPE-5 | APL | 60.5 | RegNet |
| 3D | CPPE-5 | APM | 41.1 | RegNet |
| 3D | CPPE-5 | APS | 35.7 | RegNet |
| 3D | CPPE-5 | box AP | 51.3 | RegNet |
| 3D | CPPE-5 | AP50 | 82.6 | VarifocalNet |
| 3D | CPPE-5 | AP75 | 56.7 | VarifocalNet |
| 3D | CPPE-5 | APL | 58.8 | VarifocalNet |
| 3D | CPPE-5 | APM | 42.1 | VarifocalNet |
| 3D | CPPE-5 | APS | 39 | VarifocalNet |
| 3D | CPPE-5 | box AP | 51 | VarifocalNet |
| 3D | CPPE-5 | AP50 | 76.5 | Localization Distillation |
| 3D | CPPE-5 | AP75 | 58.8 | Localization Distillation |
| 3D | CPPE-5 | APL | 59.4 | Localization Distillation |
| 3D | CPPE-5 | APM | 43 | Localization Distillation |
| 3D | CPPE-5 | APS | 45.8 | Localization Distillation |
| 3D | CPPE-5 | box AP | 50.9 | Localization Distillation |
| 3D | CPPE-5 | AP50 | 84.7 | FSAF |
| 3D | CPPE-5 | AP75 | 48.2 | FSAF |
| 3D | CPPE-5 | APL | 56.7 | FSAF |
| 3D | CPPE-5 | APM | 39.6 | FSAF |
| 3D | CPPE-5 | APS | 45.3 | FSAF |
| 3D | CPPE-5 | box AP | 49.2 | FSAF |
| 3D | CPPE-5 | AP50 | 76.9 | Deformable DETR |
| 3D | CPPE-5 | AP75 | 52.8 | Deformable DETR |
| 3D | CPPE-5 | APL | 53.9 | Deformable DETR |
| 3D | CPPE-5 | APM | 35.2 | Deformable DETR |
| 3D | CPPE-5 | APS | 36.4 | Deformable DETR |
| 3D | CPPE-5 | box AP | 48 | Deformable DETR |
| 3D | CPPE-5 | AP50 | 77.9 | Grid RCNN |
| 3D | CPPE-5 | AP75 | 50.6 | Grid RCNN |
| 3D | CPPE-5 | APL | 54.4 | Grid RCNN |
| 3D | CPPE-5 | APM | 37.2 | Grid RCNN |
| 3D | CPPE-5 | APS | 43.4 | Grid RCNN |
| 3D | CPPE-5 | box AP | 47.5 | Grid RCNN |
| 3D | CPPE-5 | AP50 | 79.5 | FCOS |
| 3D | CPPE-5 | AP75 | 45.9 | FCOS |
| 3D | CPPE-5 | APL | 51.7 | FCOS |
| 3D | CPPE-5 | APM | 39.2 | FCOS |
| 3D | CPPE-5 | APS | 36.7 | FCOS |
| 3D | CPPE-5 | box AP | 44.4 | FCOS |
| 3D | CPPE-5 | AP50 | 73.8 | Faster RCNN |
| 3D | CPPE-5 | AP75 | 47.8 | Faster RCNN |
| 3D | CPPE-5 | APL | 52.5 | Faster RCNN |
| 3D | CPPE-5 | APM | 34.7 | Faster RCNN |
| 3D | CPPE-5 | APS | 30 | Faster RCNN |
| 3D | CPPE-5 | box AP | 44 | Faster RCNN |
| 3D | CPPE-5 | AP50 | 69.6 | Sparse RCNN |
| 3D | CPPE-5 | AP75 | 44.6 | Sparse RCNN |
| 3D | CPPE-5 | APL | 54.7 | Sparse RCNN |
| 3D | CPPE-5 | APM | 30.6 | Sparse RCNN |
| 3D | CPPE-5 | APS | 30 | Sparse RCNN |
| 3D | CPPE-5 | box AP | 44 | Sparse RCNN |
| 3D | CPPE-5 | AP50 | 75.9 | RepPoints |
| 3D | CPPE-5 | AP75 | 40.1 | RepPoints |
| 3D | CPPE-5 | APL | 48 | RepPoints |
| 3D | CPPE-5 | APM | 36.7 | RepPoints |
| 3D | CPPE-5 | APS | 27.3 | RepPoints |
| 3D | CPPE-5 | box AP | 43 | RepPoints |
| 3D | CPPE-5 | AP50 | 79.4 | YOLOv3 |
| 3D | CPPE-5 | AP75 | 35.3 | YOLOv3 |
| 3D | CPPE-5 | APL | 49 | YOLOv3 |
| 3D | CPPE-5 | APS | 23.1 | YOLOv3 |
| 3D | CPPE-5 | box AP | 38.5 | YOLOv3 |
| 3D | CPPE-5 | AP50 | 57 | SSD |
| 3D | CPPE-5 | AP75 | 24.9 | SSD |
| 3D | CPPE-5 | APL | 34.6 | SSD |
| 3D | CPPE-5 | APM | 23.1 | SSD |
| 3D | CPPE-5 | APS | 32.1 | SSD |
| 3D | CPPE-5 | box AP | 29.5 | SSD |
| 2D Classification | CPPE-5 | AP50 | 85.1 | TridentNet |
| 2D Classification | CPPE-5 | AP75 | 58.3 | TridentNet |
| 2D Classification | CPPE-5 | APL | 62.6 | TridentNet |
| 2D Classification | CPPE-5 | APM | 41.3 | TridentNet |
| 2D Classification | CPPE-5 | APS | 42.6 | TridentNet |
| 2D Classification | CPPE-5 | box AP | 52.9 | TridentNet |
| 2D Classification | CPPE-5 | AP50 | 86.5 | Empirical Attention |
| 2D Classification | CPPE-5 | AP75 | 54.1 | Empirical Attention |
| 2D Classification | CPPE-5 | APL | 61 | Empirical Attention |
| 2D Classification | CPPE-5 | APM | 43.4 | Empirical Attention |
| 2D Classification | CPPE-5 | APS | 38.7 | Empirical Attention |
| 2D Classification | CPPE-5 | box AP | 52.5 | Empirical Attention |
| 2D Classification | CPPE-5 | AP50 | 87.3 | Double Heads |
| 2D Classification | CPPE-5 | AP75 | 55.2 | Double Heads |
| 2D Classification | CPPE-5 | APL | 60.8 | Double Heads |
| 2D Classification | CPPE-5 | APM | 41 | Double Heads |
| 2D Classification | CPPE-5 | APS | 38.6 | Double Heads |
| 2D Classification | CPPE-5 | box AP | 52 | Double Heads |
| 2D Classification | CPPE-5 | AP50 | 87.1 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | AP75 | 55.9 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | APL | 61.3 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | APM | 41.4 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | APS | 36.3 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | box AP | 51.6 | Deformable Convolutional Network |
| 2D Classification | CPPE-5 | AP50 | 85.3 | RegNet |
| 2D Classification | CPPE-5 | AP75 | 51.8 | RegNet |
| 2D Classification | CPPE-5 | APL | 60.5 | RegNet |
| 2D Classification | CPPE-5 | APM | 41.1 | RegNet |
| 2D Classification | CPPE-5 | APS | 35.7 | RegNet |
| 2D Classification | CPPE-5 | box AP | 51.3 | RegNet |
| 2D Classification | CPPE-5 | AP50 | 82.6 | VarifocalNet |
| 2D Classification | CPPE-5 | AP75 | 56.7 | VarifocalNet |
| 2D Classification | CPPE-5 | APL | 58.8 | VarifocalNet |
| 2D Classification | CPPE-5 | APM | 42.1 | VarifocalNet |
| 2D Classification | CPPE-5 | APS | 39 | VarifocalNet |
| 2D Classification | CPPE-5 | box AP | 51 | VarifocalNet |
| 2D Classification | CPPE-5 | AP50 | 76.5 | Localization Distillation |
| 2D Classification | CPPE-5 | AP75 | 58.8 | Localization Distillation |
| 2D Classification | CPPE-5 | APL | 59.4 | Localization Distillation |
| 2D Classification | CPPE-5 | APM | 43 | Localization Distillation |
| 2D Classification | CPPE-5 | APS | 45.8 | Localization Distillation |
| 2D Classification | CPPE-5 | box AP | 50.9 | Localization Distillation |
| 2D Classification | CPPE-5 | AP50 | 84.7 | FSAF |
| 2D Classification | CPPE-5 | AP75 | 48.2 | FSAF |
| 2D Classification | CPPE-5 | APL | 56.7 | FSAF |
| 2D Classification | CPPE-5 | APM | 39.6 | FSAF |
| 2D Classification | CPPE-5 | APS | 45.3 | FSAF |
| 2D Classification | CPPE-5 | box AP | 49.2 | FSAF |
| 2D Classification | CPPE-5 | AP50 | 76.9 | Deformable DETR |
| 2D Classification | CPPE-5 | AP75 | 52.8 | Deformable DETR |
| 2D Classification | CPPE-5 | APL | 53.9 | Deformable DETR |
| 2D Classification | CPPE-5 | APM | 35.2 | Deformable DETR |
| 2D Classification | CPPE-5 | APS | 36.4 | Deformable DETR |
| 2D Classification | CPPE-5 | box AP | 48 | Deformable DETR |
| 2D Classification | CPPE-5 | AP50 | 77.9 | Grid RCNN |
| 2D Classification | CPPE-5 | AP75 | 50.6 | Grid RCNN |
| 2D Classification | CPPE-5 | APL | 54.4 | Grid RCNN |
| 2D Classification | CPPE-5 | APM | 37.2 | Grid RCNN |
| 2D Classification | CPPE-5 | APS | 43.4 | Grid RCNN |
| 2D Classification | CPPE-5 | box AP | 47.5 | Grid RCNN |
| 2D Classification | CPPE-5 | AP50 | 79.5 | FCOS |
| 2D Classification | CPPE-5 | AP75 | 45.9 | FCOS |
| 2D Classification | CPPE-5 | APL | 51.7 | FCOS |
| 2D Classification | CPPE-5 | APM | 39.2 | FCOS |
| 2D Classification | CPPE-5 | APS | 36.7 | FCOS |
| 2D Classification | CPPE-5 | box AP | 44.4 | FCOS |
| 2D Classification | CPPE-5 | AP50 | 73.8 | Faster RCNN |
| 2D Classification | CPPE-5 | AP75 | 47.8 | Faster RCNN |
| 2D Classification | CPPE-5 | APL | 52.5 | Faster RCNN |
| 2D Classification | CPPE-5 | APM | 34.7 | Faster RCNN |
| 2D Classification | CPPE-5 | APS | 30 | Faster RCNN |
| 2D Classification | CPPE-5 | box AP | 44 | Faster RCNN |
| 2D Classification | CPPE-5 | AP50 | 69.6 | Sparse RCNN |
| 2D Classification | CPPE-5 | AP75 | 44.6 | Sparse RCNN |
| 2D Classification | CPPE-5 | APL | 54.7 | Sparse RCNN |
| 2D Classification | CPPE-5 | APM | 30.6 | Sparse RCNN |
| 2D Classification | CPPE-5 | APS | 30 | Sparse RCNN |
| 2D Classification | CPPE-5 | box AP | 44 | Sparse RCNN |
| 2D Classification | CPPE-5 | AP50 | 75.9 | RepPoints |
| 2D Classification | CPPE-5 | AP75 | 40.1 | RepPoints |
| 2D Classification | CPPE-5 | APL | 48 | RepPoints |
| 2D Classification | CPPE-5 | APM | 36.7 | RepPoints |
| 2D Classification | CPPE-5 | APS | 27.3 | RepPoints |
| 2D Classification | CPPE-5 | box AP | 43 | RepPoints |
| 2D Classification | CPPE-5 | AP50 | 79.4 | YOLOv3 |
| 2D Classification | CPPE-5 | AP75 | 35.3 | YOLOv3 |
| 2D Classification | CPPE-5 | APL | 49 | YOLOv3 |
| 2D Classification | CPPE-5 | APS | 23.1 | YOLOv3 |
| 2D Classification | CPPE-5 | box AP | 38.5 | YOLOv3 |
| 2D Classification | CPPE-5 | AP50 | 57 | SSD |
| 2D Classification | CPPE-5 | AP75 | 24.9 | SSD |
| 2D Classification | CPPE-5 | APL | 34.6 | SSD |
| 2D Classification | CPPE-5 | APM | 23.1 | SSD |
| 2D Classification | CPPE-5 | APS | 32.1 | SSD |
| 2D Classification | CPPE-5 | box AP | 29.5 | SSD |
| 2D Object Detection | CPPE-5 | AP50 | 85.1 | TridentNet |
| 2D Object Detection | CPPE-5 | AP75 | 58.3 | TridentNet |
| 2D Object Detection | CPPE-5 | APL | 62.6 | TridentNet |
| 2D Object Detection | CPPE-5 | APM | 41.3 | TridentNet |
| 2D Object Detection | CPPE-5 | APS | 42.6 | TridentNet |
| 2D Object Detection | CPPE-5 | box AP | 52.9 | TridentNet |
| 2D Object Detection | CPPE-5 | AP50 | 86.5 | Empirical Attention |
| 2D Object Detection | CPPE-5 | AP75 | 54.1 | Empirical Attention |
| 2D Object Detection | CPPE-5 | APL | 61 | Empirical Attention |
| 2D Object Detection | CPPE-5 | APM | 43.4 | Empirical Attention |
| 2D Object Detection | CPPE-5 | APS | 38.7 | Empirical Attention |
| 2D Object Detection | CPPE-5 | box AP | 52.5 | Empirical Attention |
| 2D Object Detection | CPPE-5 | AP50 | 87.3 | Double Heads |
| 2D Object Detection | CPPE-5 | AP75 | 55.2 | Double Heads |
| 2D Object Detection | CPPE-5 | APL | 60.8 | Double Heads |
| 2D Object Detection | CPPE-5 | APM | 41 | Double Heads |
| 2D Object Detection | CPPE-5 | APS | 38.6 | Double Heads |
| 2D Object Detection | CPPE-5 | box AP | 52 | Double Heads |
| 2D Object Detection | CPPE-5 | AP50 | 87.1 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | AP75 | 55.9 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | APL | 61.3 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | APM | 41.4 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | APS | 36.3 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | box AP | 51.6 | Deformable Convolutional Network |
| 2D Object Detection | CPPE-5 | AP50 | 85.3 | RegNet |
| 2D Object Detection | CPPE-5 | AP75 | 51.8 | RegNet |
| 2D Object Detection | CPPE-5 | APL | 60.5 | RegNet |
| 2D Object Detection | CPPE-5 | APM | 41.1 | RegNet |
| 2D Object Detection | CPPE-5 | APS | 35.7 | RegNet |
| 2D Object Detection | CPPE-5 | box AP | 51.3 | RegNet |
| 2D Object Detection | CPPE-5 | AP50 | 82.6 | VarifocalNet |
| 2D Object Detection | CPPE-5 | AP75 | 56.7 | VarifocalNet |
| 2D Object Detection | CPPE-5 | APL | 58.8 | VarifocalNet |
| 2D Object Detection | CPPE-5 | APM | 42.1 | VarifocalNet |
| 2D Object Detection | CPPE-5 | APS | 39 | VarifocalNet |
| 2D Object Detection | CPPE-5 | box AP | 51 | VarifocalNet |
| 2D Object Detection | CPPE-5 | AP50 | 76.5 | Localization Distillation |
| 2D Object Detection | CPPE-5 | AP75 | 58.8 | Localization Distillation |
| 2D Object Detection | CPPE-5 | APL | 59.4 | Localization Distillation |
| 2D Object Detection | CPPE-5 | APM | 43 | Localization Distillation |
| 2D Object Detection | CPPE-5 | APS | 45.8 | Localization Distillation |
| 2D Object Detection | CPPE-5 | box AP | 50.9 | Localization Distillation |
| 2D Object Detection | CPPE-5 | AP50 | 84.7 | FSAF |
| 2D Object Detection | CPPE-5 | AP75 | 48.2 | FSAF |
| 2D Object Detection | CPPE-5 | APL | 56.7 | FSAF |
| 2D Object Detection | CPPE-5 | APM | 39.6 | FSAF |
| 2D Object Detection | CPPE-5 | APS | 45.3 | FSAF |
| 2D Object Detection | CPPE-5 | box AP | 49.2 | FSAF |
| 2D Object Detection | CPPE-5 | AP50 | 76.9 | Deformable DETR |
| 2D Object Detection | CPPE-5 | AP75 | 52.8 | Deformable DETR |
| 2D Object Detection | CPPE-5 | APL | 53.9 | Deformable DETR |
| 2D Object Detection | CPPE-5 | APM | 35.2 | Deformable DETR |
| 2D Object Detection | CPPE-5 | APS | 36.4 | Deformable DETR |
| 2D Object Detection | CPPE-5 | box AP | 48 | Deformable DETR |
| 2D Object Detection | CPPE-5 | AP50 | 77.9 | Grid RCNN |
| 2D Object Detection | CPPE-5 | AP75 | 50.6 | Grid RCNN |
| 2D Object Detection | CPPE-5 | APL | 54.4 | Grid RCNN |
| 2D Object Detection | CPPE-5 | APM | 37.2 | Grid RCNN |
| 2D Object Detection | CPPE-5 | APS | 43.4 | Grid RCNN |
| 2D Object Detection | CPPE-5 | box AP | 47.5 | Grid RCNN |
| 2D Object Detection | CPPE-5 | AP50 | 79.5 | FCOS |
| 2D Object Detection | CPPE-5 | AP75 | 45.9 | FCOS |
| 2D Object Detection | CPPE-5 | APL | 51.7 | FCOS |
| 2D Object Detection | CPPE-5 | APM | 39.2 | FCOS |
| 2D Object Detection | CPPE-5 | APS | 36.7 | FCOS |
| 2D Object Detection | CPPE-5 | box AP | 44.4 | FCOS |
| 2D Object Detection | CPPE-5 | AP50 | 73.8 | Faster RCNN |
| 2D Object Detection | CPPE-5 | AP75 | 47.8 | Faster RCNN |
| 2D Object Detection | CPPE-5 | APL | 52.5 | Faster RCNN |
| 2D Object Detection | CPPE-5 | APM | 34.7 | Faster RCNN |
| 2D Object Detection | CPPE-5 | APS | 30 | Faster RCNN |
| 2D Object Detection | CPPE-5 | box AP | 44 | Faster RCNN |
| 2D Object Detection | CPPE-5 | AP50 | 69.6 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | AP75 | 44.6 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | APL | 54.7 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | APM | 30.6 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | APS | 30 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | box AP | 44 | Sparse RCNN |
| 2D Object Detection | CPPE-5 | AP50 | 75.9 | RepPoints |
| 2D Object Detection | CPPE-5 | AP75 | 40.1 | RepPoints |
| 2D Object Detection | CPPE-5 | APL | 48 | RepPoints |
| 2D Object Detection | CPPE-5 | APM | 36.7 | RepPoints |
| 2D Object Detection | CPPE-5 | APS | 27.3 | RepPoints |
| 2D Object Detection | CPPE-5 | box AP | 43 | RepPoints |
| 2D Object Detection | CPPE-5 | AP50 | 79.4 | YOLOv3 |
| 2D Object Detection | CPPE-5 | AP75 | 35.3 | YOLOv3 |
| 2D Object Detection | CPPE-5 | APL | 49 | YOLOv3 |
| 2D Object Detection | CPPE-5 | APS | 23.1 | YOLOv3 |
| 2D Object Detection | CPPE-5 | box AP | 38.5 | YOLOv3 |
| 2D Object Detection | CPPE-5 | AP50 | 57 | SSD |
| 2D Object Detection | CPPE-5 | AP75 | 24.9 | SSD |
| 2D Object Detection | CPPE-5 | APL | 34.6 | SSD |
| 2D Object Detection | CPPE-5 | APM | 23.1 | SSD |
| 2D Object Detection | CPPE-5 | APS | 32.1 | SSD |
| 2D Object Detection | CPPE-5 | box AP | 29.5 | SSD |
| 16k | CPPE-5 | AP50 | 85.1 | TridentNet |
| 16k | CPPE-5 | AP75 | 58.3 | TridentNet |
| 16k | CPPE-5 | APL | 62.6 | TridentNet |
| 16k | CPPE-5 | APM | 41.3 | TridentNet |
| 16k | CPPE-5 | APS | 42.6 | TridentNet |
| 16k | CPPE-5 | box AP | 52.9 | TridentNet |
| 16k | CPPE-5 | AP50 | 86.5 | Empirical Attention |
| 16k | CPPE-5 | AP75 | 54.1 | Empirical Attention |
| 16k | CPPE-5 | APL | 61 | Empirical Attention |
| 16k | CPPE-5 | APM | 43.4 | Empirical Attention |
| 16k | CPPE-5 | APS | 38.7 | Empirical Attention |
| 16k | CPPE-5 | box AP | 52.5 | Empirical Attention |
| 16k | CPPE-5 | AP50 | 87.3 | Double Heads |
| 16k | CPPE-5 | AP75 | 55.2 | Double Heads |
| 16k | CPPE-5 | APL | 60.8 | Double Heads |
| 16k | CPPE-5 | APM | 41 | Double Heads |
| 16k | CPPE-5 | APS | 38.6 | Double Heads |
| 16k | CPPE-5 | box AP | 52 | Double Heads |
| 16k | CPPE-5 | AP50 | 87.1 | Deformable Convolutional Network |
| 16k | CPPE-5 | AP75 | 55.9 | Deformable Convolutional Network |
| 16k | CPPE-5 | APL | 61.3 | Deformable Convolutional Network |
| 16k | CPPE-5 | APM | 41.4 | Deformable Convolutional Network |
| 16k | CPPE-5 | APS | 36.3 | Deformable Convolutional Network |
| 16k | CPPE-5 | box AP | 51.6 | Deformable Convolutional Network |
| 16k | CPPE-5 | AP50 | 85.3 | RegNet |
| 16k | CPPE-5 | AP75 | 51.8 | RegNet |
| 16k | CPPE-5 | APL | 60.5 | RegNet |
| 16k | CPPE-5 | APM | 41.1 | RegNet |
| 16k | CPPE-5 | APS | 35.7 | RegNet |
| 16k | CPPE-5 | box AP | 51.3 | RegNet |
| 16k | CPPE-5 | AP50 | 82.6 | VarifocalNet |
| 16k | CPPE-5 | AP75 | 56.7 | VarifocalNet |
| 16k | CPPE-5 | APL | 58.8 | VarifocalNet |
| 16k | CPPE-5 | APM | 42.1 | VarifocalNet |
| 16k | CPPE-5 | APS | 39 | VarifocalNet |
| 16k | CPPE-5 | box AP | 51 | VarifocalNet |
| 16k | CPPE-5 | AP50 | 76.5 | Localization Distillation |
| 16k | CPPE-5 | AP75 | 58.8 | Localization Distillation |
| 16k | CPPE-5 | APL | 59.4 | Localization Distillation |
| 16k | CPPE-5 | APM | 43 | Localization Distillation |
| 16k | CPPE-5 | APS | 45.8 | Localization Distillation |
| 16k | CPPE-5 | box AP | 50.9 | Localization Distillation |
| 16k | CPPE-5 | AP50 | 84.7 | FSAF |
| 16k | CPPE-5 | AP75 | 48.2 | FSAF |
| 16k | CPPE-5 | APL | 56.7 | FSAF |
| 16k | CPPE-5 | APM | 39.6 | FSAF |
| 16k | CPPE-5 | APS | 45.3 | FSAF |
| 16k | CPPE-5 | box AP | 49.2 | FSAF |
| 16k | CPPE-5 | AP50 | 76.9 | Deformable DETR |
| 16k | CPPE-5 | AP75 | 52.8 | Deformable DETR |
| 16k | CPPE-5 | APL | 53.9 | Deformable DETR |
| 16k | CPPE-5 | APM | 35.2 | Deformable DETR |
| 16k | CPPE-5 | APS | 36.4 | Deformable DETR |
| 16k | CPPE-5 | box AP | 48 | Deformable DETR |
| 16k | CPPE-5 | AP50 | 77.9 | Grid RCNN |
| 16k | CPPE-5 | AP75 | 50.6 | Grid RCNN |
| 16k | CPPE-5 | APL | 54.4 | Grid RCNN |
| 16k | CPPE-5 | APM | 37.2 | Grid RCNN |
| 16k | CPPE-5 | APS | 43.4 | Grid RCNN |
| 16k | CPPE-5 | box AP | 47.5 | Grid RCNN |
| 16k | CPPE-5 | AP50 | 79.5 | FCOS |
| 16k | CPPE-5 | AP75 | 45.9 | FCOS |
| 16k | CPPE-5 | APL | 51.7 | FCOS |
| 16k | CPPE-5 | APM | 39.2 | FCOS |
| 16k | CPPE-5 | APS | 36.7 | FCOS |
| 16k | CPPE-5 | box AP | 44.4 | FCOS |
| 16k | CPPE-5 | AP50 | 73.8 | Faster RCNN |
| 16k | CPPE-5 | AP75 | 47.8 | Faster RCNN |
| 16k | CPPE-5 | APL | 52.5 | Faster RCNN |
| 16k | CPPE-5 | APM | 34.7 | Faster RCNN |
| 16k | CPPE-5 | APS | 30 | Faster RCNN |
| 16k | CPPE-5 | box AP | 44 | Faster RCNN |
| 16k | CPPE-5 | AP50 | 69.6 | Sparse RCNN |
| 16k | CPPE-5 | AP75 | 44.6 | Sparse RCNN |
| 16k | CPPE-5 | APL | 54.7 | Sparse RCNN |
| 16k | CPPE-5 | APM | 30.6 | Sparse RCNN |
| 16k | CPPE-5 | APS | 30 | Sparse RCNN |
| 16k | CPPE-5 | box AP | 44 | Sparse RCNN |
| 16k | CPPE-5 | AP50 | 75.9 | RepPoints |
| 16k | CPPE-5 | AP75 | 40.1 | RepPoints |
| 16k | CPPE-5 | APL | 48 | RepPoints |
| 16k | CPPE-5 | APM | 36.7 | RepPoints |
| 16k | CPPE-5 | APS | 27.3 | RepPoints |
| 16k | CPPE-5 | box AP | 43 | RepPoints |
| 16k | CPPE-5 | AP50 | 79.4 | YOLOv3 |
| 16k | CPPE-5 | AP75 | 35.3 | YOLOv3 |
| 16k | CPPE-5 | APL | 49 | YOLOv3 |
| 16k | CPPE-5 | APS | 23.1 | YOLOv3 |
| 16k | CPPE-5 | box AP | 38.5 | YOLOv3 |
| 16k | CPPE-5 | AP50 | 57 | SSD |
| 16k | CPPE-5 | AP75 | 24.9 | SSD |
| 16k | CPPE-5 | APL | 34.6 | SSD |
| 16k | CPPE-5 | APM | 23.1 | SSD |
| 16k | CPPE-5 | APS | 32.1 | SSD |
| 16k | CPPE-5 | box AP | 29.5 | SSD |