TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Object Detection/COCO minival

Object Detection on COCO minival

Metric: APM (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕APM▼Extra DataPaperDate↕Code
1Focal-Stable-DINO (Focal-Huge, no TTA)68.5YesA Strong and Reproducible Object Detector with O...2023-04-25Code
2EVA68.4YesEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
3UNINEXT-H64.8YesUniversal Instance Perception as Object Discover...2023-03-12Code
4DyHead (Swin-L, multi scale)62.2NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
5YOLOR-D6 (1280, single-scale, 31 fps)60.1NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
6QueryInst (single scale)59.8NoInstances as Queries2021-05-05Code
7YOLOv4-P7 CSP-P7 (single-scale, 16 fps)59.5NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
8EfficientDet-D7x (single-scale)58NoEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
9UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)57.5NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
10YOLOR-P6 (1280, single-scale, 72 fps)57.3NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
11Cascade RCNN-RS (SpineNet-143L, single scale)56.7NoSimple Training Strategies and Model Scaling for...2021-06-30Code
12ResNeSt-200 (multi-scale)56.36NoResNeSt: Split-Attention Networks2020-04-19Code
13Cascade RCNN-RS (ResNet-200, single scale)56.2NoSimple Training Strategies and Model Scaling for...2021-06-30Code
14UniverseNet-20.08d (Res2Net-101, DCN, single-scale)55.5NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
15ResNeSt-200-DCN (single-scale)54.66NoResNeSt: Split-Attention Networks2020-04-19Code
16DINO-5scale (36 epoch)54.3NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
17DINO-5scale (24 epoch)54.2NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
18ResNeSt-200 (single-scale)54.2NoResNeSt: Split-Attention Networks2020-04-19Code
19UniverseNet-20.08 (Res2Net-50, DCN, single-scale)52.7NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
20DN-Deformable-DETR-R50++52.6NoDN-DETR: Accelerate DETR Training by Introducing...2022-03-02Code
21REGO-Deformable DETR-X10152.6NoRecurrent Glimpse-based Decoder for Detection wi...2021-12-09Code
22MAE-Det(MAE-Det-L+GFLV2)51.9NoMAE-DET: Revisiting Maximum Entropy Principle in...2021-11-26Code
23Res2Net101+HTC51.6NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
24DAB-DETR-DC5-R10150.5NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
25HTC (HRNetV2p-W48)50.3NoDeep High-Resolution Representation Learning for...2019-08-20Code
26Conditional DETR-DC5-R10150.3NoConditional DETR for Fast Training Convergence2021-08-13Code
27DETR-DC5 (ResNet-101)49.5NoEnd-to-End Object Detection with Transformers2020-05-26Code
28Anchor DETR-DC5-R10149.4NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
29Conditional DETR-DC5-R5049NoConditional DETR for Fast Training Convergence2021-08-13Code
30Pix2seq (R101-DC5)48.9NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
31HoughNet (HG-104, MS)48.8NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
32HTC (HRNetV2p-W32)48.4NoDeep High-Resolution Representation Learning for...2019-08-20Code
33Conditional DETR-R10148.4NoConditional DETR for Fast Training Convergence2021-08-13Code
34Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)48.3NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
35R3-CNN (ResNet-50-FPN, DCN)48.3NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
36CenterMask+VoVNetV2-57 (single-scale)48.3NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
37Anchor DETR-DC5-R5048.2NoAnchor DETR: Query Design for Transformer-Based ...2021-09-15Code
38DAB-DETR-R10148.2NoDAB-DETR: Dynamic Anchor Boxes are Better Querie...2022-01-28Code
39Cascade R-CNN (HRNetV2p-W48)48.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
40Faster RCNN-R101-FPN+48.1NoEnd-to-End Object Detection with Transformers2020-05-26Code
41RetinaNet (ViL-Base, multi-scale, 3x)48NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
42RetinaNet (ViL-Base)47.9NoMulti-Scale Vision Longformer: A New Vision Tran...2021-03-29Code
43Mask R-CNN (HRNetV2p-W32, cascade)47.9NoDeep High-Resolution Representation Learning for...2019-08-20Code
44HoughNet (HG-104)47.6NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
45Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)47.2NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
46Mask R-CNN-FPN (ResNeXt-101, GN+WS)47.19NoMicro-Batch Training with Batch-Channel Normaliz...2019-03-25Code
47R3-CNN (ResNet-50-FPN, GC-Net)47.1NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
48Pix2seq (R50-DC5 )47NoPix2seq: A Language Modeling Framework for Objec...2021-09-22Code
49TridentNet (ResNet-101)47NoScale-Aware Trident Networks for Object Detection2019-01-07Code
50Conditional DETR-R5046.7NoConditional DETR for Fast Training Convergence2021-08-13Code
51ExtremeNet (Hourglass-104, multi-scale)46.6NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
52Cascade R-CNN (HRNetV2p-W32)46.5NoDeep High-Resolution Representation Learning for...2019-08-20Code
53Sparse R-CNN (ResNet-101, FPN)46.3NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
54Cascade R-CNN (ResNet-101-FPN+, cascade)46.2NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
55PVT-Large (RetinaNet 3x,MS)46NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
56HTC (HRNetV2p-W18)46NoDeep High-Resolution Representation Learning for...2019-08-20Code
57Faster R-CNN (FPN, X-volution)46NoX-volution: On the unification of convolution an...2021-06-04-
58PVT-Large (RetinaNet 1x)46NoPyramid Vision Transformer: A Versatile Backbone...2021-02-24Code
59Faster R-CNN (LIP-ResNet-101)45.8NoLIP: Local Importance-based Pooling2019-08-12Code
60Faster R-CNN (ResNet-101, DCNv2)45.8NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
61Grid R-CNN (ResNet-101-FPN)45.8NoGrid R-CNN2018-11-29Code
62Mask R-CNN (HRNetV2p-W32)45.4NoDeep High-Resolution Representation Learning for...2019-08-20Code
63R3-CNN (ResNet-50-FPN)45.2NoRecursively Refined R-CNN: Instance Segmentation...2021-04-03Code
64Faster R-CNN (HRNetV2p-W48)44.7NoDeep High-Resolution Representation Learning for...2019-08-20Code
65PPDet (ResNet-101-FPN)44.7NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
66Sparse R-CNN (ResNet-50, FPN)44.6NoSparse R-CNN: End-to-End Object Detection with L...2020-11-25Code
67GCnet (ResNet-50-FPN, GRoIE)44.4NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
68CornerNet-Saccade (Hourglass-54)44.3NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
69Cascade R-CNN (HRNetV2p-W18)44.2NoDeep High-Resolution Representation Learning for...2019-08-20Code
70ExtremeNet (Hourglass-104, single-scale)44NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
71CenterNet511 (Hourglass-52)43.8NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
72Grid R-CNN (ResNet-50-FPN)43.8NoGrid R-CNN2018-11-29Code
73Faster R-CNN (HRNetV2p-W32)43.7NoDeep High-Resolution Representation Learning for...2019-08-20Code
74Cascade R-CNN (ResNet-50-FPN+)43.7NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
75CornerNet-Saccade (Hourglass-104)43.5NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
76FoveaBox (ResNet-101-FPN, 800x800)43.5NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
77FPN+43.3NoFeature Pyramid Networks for Object Detection2016-12-09Code
78FCOS (ResNet-50-FPN + improvements)42.5NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
79Shift-T42.3NoWhen Shift Operation Meets Vision Transformer: A...2022-01-26Code
80FoveaBox (ResNet-101-FPN, 600x600)42.2NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
81Libra R-CNN (ResNet-50 FPN)42.1NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
82Mask R-CNN (ResNet-50-FPN, GRoIE)42.1NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
83Mask R-CNN (HRNetV2p-W18)41.7NoDeep High-Resolution Representation Learning for...2019-08-20Code
84Faster R-CNN (ResNet-50-FPN, GRoIE)41.5NoA novel Region of Interest Extraction Layer for ...2020-04-28Code
85HTC (cascade)40.9NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
86Faster R-CNN (HRNetV2p-W18)40.8NoDeep High-Resolution Representation Learning for...2019-08-20Code
87CornerNet511 (Hourglass-104)40.5NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
88FSAF (ResNet-50)39.6NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
89GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)39.6NoGradient Harmonized Single-stage Detector2018-11-13Code
90M2Det (ResNet-1o1, 320x320)39.5NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
91FoveaBox (ResNet-50-FPN, 600x600)39.4NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
92Faster R-CNN (Res2Net-50)38.3NoRes2Net: A New Multi-scale Backbone Architecture2019-04-02Code
93M2Det (VGG-16, 320x320)38.2NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code