TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/2D Classification/COCO-O

2D Classification on COCO-O

Metric: Average mAP (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Average mAP▼AugmentationsPaperDate↕Code
1EVA57.8NoEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
2DETA (Swin-L)48.5NoNMS Strikes Back2022-12-12Code
3GLIP-L (Swin-L)48NoGrounded Language-Image Pre-training2021-12-07Code
4GRiT (ViT-H)42.9NoGRiT: A Generative Region-to-text Transformer fo...2022-12-01Code
5DINO (Swin-L)42.1NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
6CBNetV2 (Swin-L)39NoCBNet: A Composite Backbone Network Architecture...2021-07-01Code
7ConvNeXt-XL (Cascade Mask R-CNN)37.5NoA ConvNet for the 2020s2022-01-10Code
8InternImage-L (Cascade Mask R-CNN)37NoInternImage: Exploring Large-Scale Vision Founda...2022-11-10Code
9DyHead (Swin-L)35.3NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
10ViTDet (ViT-H)34.3NoExploring Plain Vision Transformer Backbones for...2022-03-30Code
11ViT-Adapter (BEiTv2-L)34.25NoVision Transformer Adapter for Dense Predictions2022-05-17Code
12FIBER-B (Swin-B)33.7NoCoarse-to-Fine Vision-Language Pre-training with...2022-06-15Code
13QueryInst (Swin-L)33.2NoInstances as Queries2021-05-05Code
14YOLOv6-L632.5NoYOLOv6: A Single-Stage Object Detection Framewor...2022-09-07Code
15YOLOv7-E6E32NoYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
16MViTV2-H (Cascade Mask R-CNN)30.9NoMViTv2: Improved Multiscale Vision Transformers ...2021-12-02Code
17Det-AdvProp (EfficientNet-B5)30.8NoRobust and Accurate Object Detection via Adversa...2021-03-23Code
18YOLOv4-P630.4NoYOLOv4: Optimal Speed and Accuracy of Object Det...2020-04-23Code
19YOLOX-X30.3NoYOLOX: Exceeding YOLO Series in 20212021-07-18Code
20CenterNet2 (R2-101-DCN)29.5NoProbabilistic two-stage detection2021-03-12Code
21GLIP-T (Swin-T)29.1NoGrounded Language-Image Pre-training2021-12-07Code
22EfficientDet-D5 (EfficientNet-B5)28.5NoEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
23PVTv2-B5 (Mask R-CNN)28.2NoPVT v2: Improved Baselines with Pyramid Vision T...2021-06-25Code
24VFNet (RX-101-64x4d)28NoVarifocalNet: An IoU-aware Dense Object Detector2020-08-31Code
25GCNet (RX-101-32x4d-DCN)26NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
26GFLv2 (R2-101-DCN)25.1NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
27RepPointsV2 (RX-101-64x4d-DCN)24.9NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
28UniverseNet (R2-101-DCN)24.8NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
29YOLOX-S20.6NoYOLOX: Exceeding YOLO Series in 20212021-07-18Code
30YOLOS-B (ViT-B)20NoYou Only Look at One Sequence: Rethinking Transf...2021-06-01Code
31DyHead (ResNet-50)19.3NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
32HTC (ResNet-50)19.1NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
33Deformable-DETR (ResNet-50)18.5NoDeformable DETR: Deformable Transformers for End...2020-10-08Code
34Cascade R-CNN (ResNet-50)18.2NoCascade R-CNN: High Quality Object Detection and...2019-06-24Code
35Mask R-CNN (ResNet-50)17.1NoMask R-CNN2017-03-20Code
36DETR (ResNet-50)17.1NoEnd-to-End Object Detection with Transformers2020-05-26Code
37ATSS (ResNet-50)16.8NoBridging the Gap Between Anchor-based and Anchor...2019-12-05Code
38FCOS (ResNet-50)16.7NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
39RetinaNet (ResNet-50)16.6NoFocal Loss for Dense Object Detection2017-08-07Code
40Faster R-CNN (ResNet-50-FPN)16.4YesFaster R-CNN: Towards Real-Time Object Detection...2015-06-04Code
41YOLOv3 (DarkNet-53)14.8NoYOLOv3: An Incremental Improvement2018-04-08Code
42SSD (VGG-16)13.6NoSSD: Single Shot MultiBox Detector2015-12-08Code