TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/16k/COCO minival

16k on COCO minival

Metric: APL (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕APL▼AugmentationsPaperDate↕Code
1Focal-Stable-DINO (Focal-Huge, no TTA)78.5YesA Strong and Reproducible Object Detector with O...2023-04-25Code
2EVA78.5YesEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
3UNINEXT-H75.3YesUniversal Instance Perception as Object Discover...2023-03-12Code
4DyHead (Swin-L, multi scale, self-training)74.2YesDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
5Focal-L (DyHead, multi-scale)73.4NoFocal Self-attention for Local-Global Interactio...2021-07-01Code
6DyHead (Swin-L, multi scale)73.2NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
7SOLQ (Swin-L, single scale)71.9NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
8QueryInst (single scale)71.5NoInstances as Queries2021-05-05Code
9Cascade RCNN-RS (SpineNet-143L, single scale)70.6NoSimple Training Strategies and Model Scaling for...2021-06-30Code
10Cascade RCNN-RS (ResNet-200, single scale)70.3NoSimple Training Strategies and Model Scaling for...2021-06-30Code
11YOLOR-D6 (1280, single-scale, 31 fps)68.7NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
12UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)68.1NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
13EfficientDet-D7x (single-scale)67.9NoEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
14YOLOv4-P7 CSP-P7 (single-scale, 16 fps)67.4NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
15DyHead (ResNeXt-64x4d-101-DCN, multi scale)66.3NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
16ResNeSt-200 (multi-scale)66.29NoResNeSt: Split-Attention Networks2020-04-19Code
17ResNeSt-200-DCN (single-scale)65.83NoResNeSt: Split-Attention Networks2020-04-19Code
18DINO-5scale (24 epoch)65.8NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
19UniverseNet-20.08d (Res2Net-101, DCN, single-scale)65.8NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
20DN-Deformable-DETR-R50++65.4NoDN-DETR: Accelerate DETR Training by Introducing...2022-03-02Code
21DINO-5scale (36 epoch)65.3NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
22YOLOR-P6 (1280, single-scale, 72 fps)65.2NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
23REGO-Deformable DETR-X10165NoRecurrent Glimpse-based Decoder for Detection wi...2021-12-09Code
24DAB-DETR-DC5-R10164.1NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
25ResNeSt-200 (single-scale)63.9NoResNeSt: Split-Attention Networks2020-04-19Code
26Conditional DETR-R10163.6NoConditional DETR for Fast Training Convergence2021-08-13Code
27Conditional DETR-DC5-R10163.3NoConditional DETR for Fast Training Convergence2021-08-13Code
28DAB-DETR-R10162.9NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
29UniverseNet-20.08 (Res2Net-50, DCN, single-scale)62.7NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
30DETR-DC5 (ResNet-101)62.3NoEnd-to-End Object Detection with Transformers2020-05-26Code
31HTC (HRNetV2p-W48)62.2NoDeep High-Resolution Representation Learning for...2019-08-20Code
32Conditional DETR-DC5-R5062.2NoConditional DETR for Fast Training Convergence2021-08-13Code
33Res2Net101+HTC62.1NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
34Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)61.6NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
35Anchor DETR-DC5-R10161.6NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
36Conditional DETR-R5061.5NoConditional DETR for Fast Training Convergence2021-08-13Code
37MAE-Det(MAE-Det-L+GFLV2)61.1NoMAE-DET: Revisiting Maximum Entropy Principle in...2021-11-26Code
38Anchor DETR-DC5-R5060.6NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
39Pix2seq (R101-DC5)60.4NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
40Mask R-CNN (HRNetV2p-W48, cascade)60.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
41HoughNet (HG-104, MS)59.7NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
42Sparse R-CNN (ResNet-101, FPN)59.7NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
43R3-CNN (ResNet-50-FPN, DCN)59.6NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
44HTC (HRNetV2p-W32)59.5NoDeep High-Resolution Representation Learning for...2019-08-20Code
45Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)59.5NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
46PVT-Large (RetinaNet 3x,MS)59.5NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
47ExtremeNet (Hourglass-104, multi-scale)59.4NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
48R3-CNN (ResNet-50-FPN, GC-Net)58.9NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
49CenterMask+VoVNetV2-99 (single-scale)58.8NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
50Faster R-CNN (ResNet-101, DCNv2)58.7NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
51Pix2seq (R50-DC5 )58.6NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
52Cascade R-CNN (HRNetV2p-W48)58.5NoDeep High-Resolution Representation Learning for...2019-08-20Code
53PVT-Large (RetinaNet 1x)58.4NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
54CornerNet-Saccade (Hourglass-54)58.4NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
55RetinaNet (ViL-Base)58.3NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
56RetinaNet (ViL-Base, multi-scale, 3x)58.1NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
57Mask R-CNN (VoVNetV2-99, single-scale)57.7NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
58Sparse R-CNN (ResNet-50, FPN)57.6NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
59Cascade R-CNN (HRNetV2p-W32)57.4NoDeep High-Resolution Representation Learning for...2019-08-20Code
60Cascade R-CNN (ResNet-101-FPN+, cascade)57.4NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
61CenterMask+X101-32x8d (single-scale)57.1NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
62CornerNet-Saccade (Hourglass-104)57.1NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
63TridentNet (ResNet-101)56.9NoScale-Aware Trident Networks for Object Detection2019-01-07Code
64Mask R-CNN-FPN (ResNeXt-101, GN+WS)56.39NoMicro-Batch Training with Batch-Channel Normaliz...2019-03-25Code
65ExtremeNet (Hourglass-104, single-scale)56.1NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
66Faster RCNN-R101-FPN+56NoEnd-to-End Object Detection with Transformers2020-05-26Code
67HoughNet (HG-104)55.8NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
68CenterNet511 (Hourglass-52)55.8NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
69R3-CNN (ResNet-50-FPN)55.7NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
70Faster R-CNN (FPN, X-volution)55NoX-volution: On the unification of convolution an...2021-06-04-
71Faster R-CNN (HRNetV2p-W48)54.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
72Grid R-CNN (ResNet-101-FPN)54.1NoGrid R-CNN2018-11-29Code
73Cascade R-CNN (HRNetV2p-W18)54.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
74Cascade R-CNN (ResNet-50-FPN+)54.1NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
75Faster R-CNN (HRNetV2p-W32)53.3NoDeep High-Resolution Representation Learning for...2019-08-20Code
76FoveaBox (ResNet-101-FPN, 600x600)52.7NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
77FPN+52.6NoFeature Pyramid Networks for Object Detection2016-12-09Code
78GCnet (ResNet-50-FPN, GRoIE)52.5NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
79HTC (cascade)52.3NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
80PPDet (ResNet-101-FPN)52.3NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
81CornerNet511 (Hourglass-104)51.8NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
82FoveaBox (ResNet-101-FPN, 800x800)51.7NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
83Grid R-CNN (ResNet-50-FPN)51.5NoGrid R-CNN2018-11-29Code
84Faster R-CNN (Res2Net-50)51.1NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
85Mask R-CNN (HRNetV2p-W18)51NoDeep High-Resolution Representation Learning for...2019-08-20Code
86Libra R-CNN (ResNet-50 FPN)50.5NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
87FoveaBox (ResNet-50-FPN, 600x600)50.5NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
88FCOS (ResNet-50-FPN + improvements)49.8NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
89Mask R-CNN (ResNet-50-FPN, GRoIE)49.7NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
90Faster R-CNN (HRNetV2p-W18)49.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
91M2Det (ResNet-1o1, 320x320)49.3NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
92M2Det (VGG-16, 320x320)49.1NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
93FSAF (ResNet-50)48.2NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
94Faster R-CNN (ResNet-50-FPN, GRoIE)47.8NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
95GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)46.7NoGradient Harmonized Single-stage Detector2018-11-13Code