TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/16k/COCO test-dev

16k on COCO test-dev

Metric: APS (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕APS▼AugmentationsPaperDate↕Code
1Focal-Stable-DINO (Focal-Huge, no TTA)48.6NoA Strong and Reproducible Object Detector with O...2023-04-25Code
2EVA48.5NoEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
3Group DETR v248.4NoGroup DETR v2: Strong Object Detector with Encod...2022-11-07-
4Plain-DETR (Swin-L)48.2No--Code
5Relation-DETR (Focal-L)47.2NoRelation DETR: Exploring Explicit Position Relat...2024-07-16Code
6DETA (Swin-L)46.1NoNMS Strikes Back2022-12-12Code
7GLIP (Swin-L, multi-scale)45.3NoGrounded Language-Image Pre-training2021-12-07Code
8PyCenterNet (Swin-L, multi-scale)38.7NoCenterNet++ for Object Detection2022-04-18Code
9CenterNet2 (Res2Net-101-DCN-BiFPN, self-training, 1560 single-scale)38.7NoProbabilistic two-stage detection2021-03-12Code
10DetectoRS (ResNeXt-101-64x4d, multi-scale)37.7NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
11SOLQ (Swin-L, single scale)37.6NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
12QueryInst (single-scale)37.4NoInstances as Queries2021-05-05Code
13DetectoRS (ResNeXt-101-32x4d, multi-scale)37.4NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
14YOLOv4-P6 CSP-P6 (single-scale, 32 fps)36.6NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
15PAA (ResNext-152-32x8d + DCN, multi-scale)36NoProbabilistic Anchor Assignment with IoU Predict...2020-07-16Code
16UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)35.8NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
17Mask R-CNN (ResNet-101-FPN, CBN)35.8NoCross-Iteration Batch Normalization2020-02-13Code
18GFLV2 (Res2Net-101, DCN, multiscale)35.7NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
19Cascade Mask R-CNN (Triple-ResNeXt152, multi-scale)35.5NoCBNet: A Novel Composite Backbone Network Archit...2019-09-09Code
20RetinaNet (SpineNet-190, 1280x1280)35.4NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
21LSNet (Res2Net-101+ DCN, multi-scale)35.2NoLocation-Sensitive Visual Recognition with Cross...2021-04-11Code
22ResNeSt-200 (multi-scale)35.1NoResNeSt: Split-Attention Networks2020-04-19Code
23RepPoints v2 (ResNeXt-101, DCN, multi-scale)34.5NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
24Deformable DETR (ResNeXt-101+DCN)34.4NoDeformable DETR: Deformable Transformers for End...2020-10-08Code
25AC-FPN Cascade R-CNN (X-152-32x8d-FPN-IN5k, multi scale, only CEM)34.2NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
26NAS-FPN (AmoebaNet-D, learned aug)34.2NoLearning Data Augmentation Strategies for Object...2019-06-26Code
27OTA (ResNeXt-101+DCN, multiscale)34.1NoOTA: Optimal Transport Assignment for Object Det...2021-03-26Code
28DetectoRS (ResNeXt-101-32x4d, single-scale)33.9NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
29TSD(SENet154-DCN,multi-scale)33.8NoRevisiting the Sibling Head in Object Detector2020-03-17Code
30RetinaNet (SpineNet-143, 1280x1280)33.6NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
31PP-YOLOE-x(CSPRepResNet-x, 640x640, single-scale )33.3NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
32ATSS (ResNetXt-64x4d-101+DCN,multi-scale)33.2NoBridging the Gap Between Anchor-based and Anchor...2019-12-05Code
33Dynamic R-CNN (ResNet-101-DCN, multi-scale)32.8NoDynamic R-CNN: Towards High Quality Object Detec...2020-04-13Code
34D2Det (ResNet-101-DCN, multi-scale test)32.7No--Code
35TSD(ResNet-101-Deformable, Image Pyramid)32.7NoRevisiting the Sibling Head in Object Detector2020-03-17Code
36DAT-S (RetinaNet)32.3NoVision Transformer with Deformable Attention2022-01-03Code
37aLRP Loss (ResNext-101-64x4d, DCN, multiscale test)32NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
38RetinaNet (SpineNet-96, 1024x1024)32NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
39TridentNet (ResNet-101-Deformable, Image Pyramid)31.8NoScale-Aware Trident Networks for Object Detection2019-01-07Code
40UniverseNet-20.08d (Res2Net-101, DCN, single-scale)31.7NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
41PP-YOLOE-l(CSPRepResNet-l, 640x640, single-scale )31.4NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
42PPDet (ResNeXt-101-FPN, multiscale)31.4NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
43GFLV2 (Res2Net-101, DCN)31.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
44FreeAnchor + SEPC (DCN, ResNext-101-64x4d)31.3NoScale-Equalizing Pyramid Convolution for Object ...2020-05-06Code
45YOLOX-X (Modified CSP v5)31.2NoYOLOX: Exceeding YOLO Series in 20212021-07-18Code
46CPNDet (Hourglass-104, multi-scale)31NoCorner Proposal Network for Anchor-free, Two-sta...2020-07-27Code
47aLRP Loss (ResNext-101-64x4d, DCN, single scale)30.8NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
48RepPoints v2 (ResNeXt-101, DCN)30.3NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
49RPDet (ResNet-101-DCN, multi-scale)30.3NoRepPoints: Point Set Representation for Object D...2019-04-25Code
50aLRP Loss (ResNext-101-64x4d, single scale)30.2NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
51UniverseNet-20.08 (Res2Net-50, DCN, single-scale)30.1NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
52PANet (ResNeXt-101, multi-scale)30.1NoPath Aggregation Network for Instance Segmentation2018-03-05Code
53GFLV2 (ResNeXt-101, 32x4d, DCN)29.7NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
54MatrixNet Corners (ResNet-152, multi-scale)29.7NoMatrix Nets: A New Deep Architecture for Object ...2019-08-13Code
55FSAF (ResNeXt-101, multi-scale)29.7NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
56SNIPER (ResNet-101)29.6NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
57M2Det (ResNet-101, multi-scale)29.6NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
58D-RFCN + SNIP (DPN-98 with flip, multi-scale)29.3NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
59GFL (X-101-32x4d-DCN, single-scale)29.2NoGeneralized Focal Loss: Learning Qualified and D...2020-06-08Code
60M2Det (VGG-16, multi-scale)29.2NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
61RetinaNet (SpineNet-49, 896x896)29.1NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
62HoughNet (MS)29.1NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
63CenterNet511 (Hourglass-104, multi-scale)28.9NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
64GFLV2 (ResNet-101-DCN)28.8NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
65ISTR (ResNet101-FPN-3x, single-scale)28.7NoISTR: End-to-End Instance Segmentation with Tran...2021-05-03Code
66PP-YOLOE-m(CSPRepResNet-m, 640x640, single-scale )28.6NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
67SAPD (ResNeXt-101, single-scale)28.1NoSoft Anchor-Point Object Detection2019-11-27Code
68HTC (HRNetV2p-W48)28NoDeep High-Resolution Representation Learning for...2019-08-20Code
69ISTR (ResNet50-FPN-3x, single-scale)27.8NoISTR: End-to-End Instance Segmentation with Tran...2021-05-03Code
70GFLV2 (ResNet-101)27.8NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
71DCNv2 (ResNet-101, multi-scale)27.8NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
72CenterMask+VoVNetV2-99 (single-scale)27.8NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
73FCOS (ResNeXt-64x4d-101-FPN 4 + improvements)27.6NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
74InterNet (ResNet-101-FPN, multi-scale)27.2NoFeature Intertwiner for Object Detection2019-03-28Code
75D-RFCN + SNIP (ResNet-101, multi-scale)27.2NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
76Mask R-CNN (HRNetV2p-W48 + cascade)27.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
77CenterMask+VoVNet2-57 (single-scale)27.1NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
78YOLOv4 (CD53)27YesScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
79FreeAnchor (ResNeXt-101)27NoFreeAnchor: Learning to Match Anchors for Visual...2019-09-05Code
80YOLOv3 @800 + ASFF* (Darknet-53)27YesLearning Spatial Fusion for Single-Shot Object D...2019-11-21Code
81AC-FPN Cascade R-CNN(ResNet-101, single scale)26.9NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
82GFLV2 (ResNet-50)26.8NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
83FoveaBox (ResNeXt-101)26.8NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
84YOLOv4-60826.7YesYOLOv4: Optimal Speed and Accuracy of Object Det...2020-04-23Code
85FCOS (ResNeXt-101-64x4d-FPN)26.5NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
86Cascade R-CNN-FPN (ResNet-101, map-guided)26.3NoInstaBoost: Boosting Instance Segmentation via P...2019-08-21Code
87SNIPER (ResNet-50)26.1NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
88FCOS (ResNeXt-32x8d-101-FPN)26NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
89RetinaNet (SpineNet-49, 640x640)25.9NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
90RefineDet512+ (ResNet-101)25.6NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
91Faster R-CNN (LIP-ResNet-101-MD w FPN)25.4NoLIP: Local Importance-based Pooling2019-08-12Code
92FCOS (HRNet-W32-5l)25.4NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
93Libra R-CNN (ResNeXt-101-FPN)25.3NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
94Grid R-CNN (ResNeXt-101-FPN)25.1NoGrid R-CNN2018-11-29Code
95RPDet (ResNet-101-DCN)24.9NoRepPoints: Point Set Representation for Object D...2019-04-25Code
96Faster R-CNN (HRNetV2p-W48)24.9NoDeep High-Resolution Representation Learning for...2019-08-20Code
97RetinaMask (ResNeXt-101-FPN-GN)24.8NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
98aLRP Loss (ResNext-101, DCN, 500 scale)24.6NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
99ResNet-50-DW-DPN (Deformable Kernels)24.6NoDeformable Kernels: Adapting Effective Receptive...2019-10-07Code
100CornerNet-Saccade (Hourglass-104, multi-scale)24.4NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
101ExtremeNet (Hourglass-104, multi-scale)24.1NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
102RetinaNet (ResNeXt-101-FPN)24.1NoFocal Loss for Dense Object Detection2017-08-07Code
103YOLOF-DC524NoYou Only Look One-level Feature2021-03-17Code
104FSAF (ResNet-101, single-scale)24NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
105TridentNet (ResNet-101)23.9NoScale-Aware Trident Networks for Object Detection2019-01-07Code
106SpineNet-49 (640, RetinaNet, single-scale)23.7NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
107Cascade R-CNN (ResNet-101-FPN+, cascade)23.7NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
108Cascade R-CNN23.7NoCascade R-CNN: High Quality Object Detection and...2019-06-24Code
109RPDet (ResNet-101)23.6NoRepPoints: Point Set Representation for Object D...2019-04-25Code
110FCOS (HRNetV2p-W48)23.4YesDeep High-Resolution Representation Learning for...2019-08-20Code
111RetinaNet (SpineNet-49S, 640x640)23.3NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
112PP-YOLOE-s(CSPRepResNet-s, 640x640, single-scale )23.2NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
113HTC (ResNeXt-101-FPN)22.8NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
114HSD (Rest101, 768x768, single-scale test)22.8No--Code
115RefineDet512+ (VGG-16)22.7NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
116Cascade R-CNN (ResNet-50-FPN+, cascade)22.6NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
117GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)22.3NoGradient Harmonized Single-stage Detector2018-11-13Code
118CenterNet (HRNetV2-W48)22.2NoDeep High-Resolution Representation Learning for...2019-08-20Code
119M2Det (VGG-16, single-scale)22.1NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
120RDSNet (ResNet-101, RetinaNet, mask, MBRM)22.1NoRDSNet: A New Deep Architecture for Reciprocal O...2019-12-11Code
121Fast R-CNN (Cascade RPN)22.1YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
122Mask R-CNN (ResNeXt-101-FPN)22.1NoMask R-CNN2017-03-20Code
123Faster R-CNN (Cascade RPN)22YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
124RetinaMask (ResNet-50-FPN)21.9NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
125GA-Faster-RCNN21.8NoRegion Proposal by Guided Anchoring2019-01-10Code
126RetinaNet (ResNet-101-FPN)21.8NoFocal Loss for Dense Object Detection2017-08-07Code
127CenterNet-DLA (DLA-34, multi-scale)21.5NoObjects as Points2019-04-16Code
128Cascade R-CNN (ResNet-101-FPN+)21.3NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
129CornerNet511 (Hourglass-104, multi-scale)20.8NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
130M2Det (ResNet-101, single-scale)20.5NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
131ExtremeNet (Hourglass-104, single-scale)20.4NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
132Cascade R-CNN (ResNet-50-FPN+)20.3NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
133Mask R-CNN (ResNet-101-FPN)20.1NoMask R-CNN2017-03-20Code
134DeformConv-R-FCN (Aligned-Inception-ResNet)19.4NoDeformable Convolutional Networks2017-03-17Code
135SaccadeNet (DLA-34-DCN)19.2NoSaccadeNet: A Fast and Accurate Object Detector2020-03-26Code
136Faster R-CNN (ImageNet+300M)17.5NoRevisiting Unreasonable Effectiveness of Data in...2017-07-10Code
137CornerNet511 (Hourglass-52, single-scale)17NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
138RefineDet512 (ResNet-101)16.6NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code