TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/2D Classification/COCO minival

2D Classification on COCO minival

Metric: APS (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕APS▼AugmentationsPaperDate↕Code
1Focal-Stable-DINO (Focal-Huge, no TTA)50.4YesA Strong and Reproducible Object Detector with O...2023-04-25Code
2EVA49.4YesEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
3UNINEXT-H45.1YesUniversal Instance Perception as Object Discover...2023-03-12Code
4DyHead (Swin-L, multi scale)44.5NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
5YOLOR-D6 (1280, single-scale, 31 fps)40.4NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
6QueryInst (single scale)40.2NoInstances as Queries2021-05-05Code
7EfficientDet-D7x (single-scale)40NoEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
8YOLOv4-P7 CSP-P7 (single-scale, 16 fps)38.1NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
9YOLOR-P6 (1280, single-scale, 72 fps)37.4NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
10UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)36.9NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
11ResNeSt-200 (multi-scale)36.8NoResNeSt: Split-Attention Networks2020-04-19Code
12DINO-5scale (36 epoch)35NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
13Cascade RCNN-RS (SpineNet-143L, single scale)34.5NoSimple Training Strategies and Model Scaling for...2021-06-30Code
14DINO-5scale (24 epoch)34.5NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
15Cascade RCNN-RS (ResNet-200, single scale)33.9NoSimple Training Strategies and Model Scaling for...2021-06-30Code
16UniverseNet-20.08d (Res2Net-101, DCN, single-scale)33.5NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
17ResNeSt-200-DCN (single-scale)32.67NoResNeSt: Split-Attention Networks2020-04-19Code
18DN-Deformable-DETR-R50++31.3NoDN-DETR: Accelerate DETR Training by Introducing...2022-03-02Code
19UniverseNet-20.08 (Res2Net-50, DCN, single-scale)30.6NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
20MAE-Det(MAE-Det-L+GFLV2)30.3NoMAE-DET: Revisiting Maximum Entropy Principle in...2021-11-26Code
21REGO-Deformable DETR-X10130NoRecurrent Glimpse-based Decoder for Detection wi...2021-12-09Code
22HoughNet (HG-104, MS)30NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
23RetinaNet (ViL-Base, multi-scale, 3x)29.9NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
24CenterMask+VoVNetV2-99 (single-scale)29.2NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
25RetinaNet (ViL-Base)28.9NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
26HTC (HRNetV2p-W48)28.8NoDeep High-Resolution Representation Learning for...2019-08-20Code
27Res2Net101+HTC28.6NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
28Mask R-CNN (VoVNetV2-99, single-scale)28.5NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
29Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)28.3NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
30Pix2seq (R101-DC5)28.2NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
31DAB-DETR-DC5-R10128.1NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
32CenterMask+VoVNetV2-57 (single-scale)27.7NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
33Mask R-CNN (HRNetV2p-W48, cascade)27.5NoDeep High-Resolution Representation Learning for...2019-08-20Code
34Conditional DETR-DC5-R10127.2NoConditional DETR for Fast Training Convergence2021-08-13Code
35Faster RCNN-R101-FPN+27.2NoEnd-to-End Object Detection with Transformers2020-05-26Code
36HTC (HRNetV2p-W32)27NoDeep High-Resolution Representation Learning for...2019-08-20Code
37R3-CNN (ResNet-50-FPN, GC-Net)27NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
38Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)26.9NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
39Faster R-CNN (FPN, X-volution)26.9NoX-volution: On the unification of convolution an...2021-06-04-
40CenterMask+X101-32x8d (single-scale)26.7NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
41Sparse R-CNN (ResNet-50, FPN)26.7NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
42R3-CNN (ResNet-50-FPN, DCN)26.6NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
43Pix2seq (R50-DC5 )26.6NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
44HTC (HRNetV2p-W18)26.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
45Cascade R-CNN (HRNetV2p-W48)26.3NoDeep High-Resolution Representation Learning for...2019-08-20Code
46Sparse R-CNN (ResNet-101, FPN)26.1NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
47PVT-Large (RetinaNet 3x,MS)26.1NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
48Mask R-CNN (HRNetV2p-W32, cascade)26.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
49Anchor DETR-DC5-R10125.8NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
50PVT-Large (RetinaNet 1x)25.8NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
51ExtremeNet (Hourglass-104, multi-scale)25.7NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
52Cascade R-CNN (HRNetV2p-W32)25.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
53HoughNet (HG-104)25.5NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
54CornerNet-Saccade (Hourglass-54)25.5NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
55Mask R-CNN-FPN (ResNeXt-101, GN+WS)25.49NoMicro-Batch Training with Batch-Channel Normaliz...2019-03-25Code
56PPDet (ResNet-101-FPN)25.4NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
57Conditional DETR-DC5-R5025.3NoConditional DETR for Fast Training Convergence2021-08-13Code
58Faster R-CNN (LIP-ResNet-101)25.2NoLIP: Local Importance-based Pooling2019-08-12Code
59Mask R-CNN (HRNetV2p-W32)25NoDeep High-Resolution Representation Learning for...2019-08-20Code
60TridentNet (ResNet-101)24.9NoScale-Aware Trident Networks for Object Detection2019-01-07Code
61Anchor DETR-DC5-R5024.7NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
62R3-CNN (ResNet-50-FPN)24.5NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
63Faster R-CNN (HRNetV2p-W32)24.4NoDeep High-Resolution Representation Learning for...2019-08-20Code
64R3-CNN (ResNet-50-FPN, GRoIE)24.4NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
65GCnet (ResNet-50-FPN, GRoIE)24.2NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
66DAB-DETR-R10124.1NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
67Cascade R-CNN (ResNet-101-FPN+, cascade)23.8NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
68CornerNet-Saccade (Hourglass-104)23.8NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
69DETR-DC5 (ResNet-101)23.7NoEnd-to-End Object Detection with Transformers2020-05-26Code
70Cascade R-CNN (HRNetV2p-W18)23.7NoDeep High-Resolution Representation Learning for...2019-08-20Code
71Conditional DETR-R10123.6NoConditional DETR for Fast Training Convergence2021-08-13Code
72CenterNet511 (Hourglass-52)23.6NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
73Grid R-CNN (ResNet-101-FPN)23.4NoGrid R-CNN2018-11-29Code
74Cascade R-CNN (ResNet-50-FPN+)22.9NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
75FPN+22.9NoFeature Pyramid Networks for Object Detection2016-12-09Code
76Libra R-CNN (ResNet-50 FPN)22.9NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
77Mask R-CNN (ResNet-50-FPN, GRoIE)22.9NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
78Conditional DETR-R5022.7NoConditional DETR for Fast Training Convergence2021-08-13Code
79Grid R-CNN (ResNet-50-FPN)22.6NoGrid R-CNN2018-11-29Code
80Faster R-CNN (HRNetV2p-W18)22.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
81FoveaBox (ResNet-101-FPN, 800x800)22.3NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
82FCOS (ResNet-50-FPN + improvements)22.3NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
83Faster R-CNN (ResNet-50-FPN, GRoIE)22.3NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
84Faster R-CNN (ResNet-101, DCNv2)22.2NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
85ExtremeNet (Hourglass-104, single-scale)21.6NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
86HTC (cascade)20.3NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
87FSAF (ResNet-50)19.8NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
88GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)19.6NoGradient Harmonized Single-stage Detector2018-11-13Code
89FoveaBox (ResNet-101-FPN, 600x600)19.5NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
90CornerNet511 (Hourglass-104)18.6NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
91FoveaBox (ResNet-50-FPN, 600x600)18.6NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
92M2Det (ResNet-1o1, 320x320)15.9NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
93M2Det (VGG-16, 320x320)15NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
94Faster R-CNN (Res2Net-50)14NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code