TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/2D Object Detection/COCO test-dev

2D Object Detection on COCO test-dev

Metric: box mAP (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕box mAP▼AugmentationsPaperDate↕Code
1Co-DETR66NoDETRs with Collaborative Hybrid Assignments Trai...2022-11-22Code
2InternImage-H (M3I Pre-training)65.5NoInternImage: Exploring Large-Scale Vision Founda...2022-11-10Code
3M3I Pre-training (InternImage-H)65.4NoTowards All-in-one Pre-training via Maximizing M...2022-11-17Code
4MoCaE65.1NoMoCaE: Mixture of Calibrated Experts Significant...2023-09-26Code
5Focal-Stable-DINO (Focal-Huge, no TTA)64.8NoA Strong and Reproducible Object Detector with O...2023-04-25Code
6Co-DETR (Swin-L)64.8NoDETRs with Collaborative Hybrid Assignments Trai...2022-11-22Code
7EVA64.7NoEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
8Group DETR v264.5NoGroup DETR v2: Strong Object Detector with Encod...2022-11-07-
9FocalNet-H (DINO)64.4NoFocal Modulation Networks2022-03-22Code
10InternImage-XL64.3NoInternImage: Exploring Large-Scale Vision Founda...2022-11-10Code
11FD-SwinV2-G64.2NoContrastive Learning Rivals Masked Image Modelin...2022-05-27Code
12Plain-DETR (Swin-L)63.9No--Code
13RevCol-H(DINO)63.8NoReversible Column Networks2022-12-22Code
14BEiT-363.7NoImage as a Foreign Language: BEiT Pretraining fo...2022-08-22Code
15Relation-DETR (Focal-L)63.5NoRelation DETR: Exploring Explicit Position Relat...2024-07-16Code
16DETA (Swin-L)63.5NoNMS Strikes Back2022-12-12Code
17DINO (Swin-L,multi-scale, TTA)63.3NoDINO: DETR with Improved DeNoising Anchor Boxes ...2022-03-07Code
18SwinV2-G (HTC++)63.1NoSwin Transformer V2: Scaling Up Capacity and Res...2021-11-18Code
19Grounding DINO63NoGrounding DINO: Marrying DINO with Grounded Pre-...2023-03-09Code
20Florence-CoSwin-H62.4NoFlorence: A New Foundation Model for Computer Vi...2021-11-22Code
21GLIPv2 (CoSwin-H, multi-scale)62.4NoGLIPv2: Unifying Localization and Vision-Languag...2022-06-12Code
22GLEE-Pro62.3NoGeneral Object Foundation Model for Images and V...2023-12-14Code
23GLIP (Swin-L, multi-scale)61.5NoGrounded Language-Image Pre-training2021-12-07Code
24Soft Teacher + Swin-L (HTC++, multi-scale)61.3NoEnd-to-End Semi-Supervised Object Detection with...2021-06-16Code
25ViT-Adapter-L (HTC++, BEiTv2 pretrain, multi-scale)60.9NoVision Transformer Adapter for Dense Predictions2022-05-17Code
26DyHead (Swin-L, multi scale, self-training)60.6NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
27GLEE-Plus60.6NoGeneral Object Foundation Model for Images and V...2023-12-14Code
28ViT-Adapter-L (HTC++, BEiT pretrain, multi-scale)60.4NoVision Transformer Adapter for Dense Predictions2022-05-17Code
29GRiT (ViT-H, single-scale testing)60.4NoGRiT: A Generative Region-to-text Transformer fo...2022-12-01Code
30CBNetV2 (Dual-Swin-L HTC, multi-scale)60.1NoCBNet: A Composite Backbone Network Architecture...2021-07-01Code
31PIIP-H6B (DINO)60NoParameter-Inverted Image Pyramid Networks2024-06-06Code
32CBNetV2 (Dual-Swin-L HTC, single-scale)59.4NoCBNet: A Composite Backbone Network Architecture...2021-07-01Code
33Focal-L (DyHead, multi-scale)58.9NoFocal Self-attention for Local-Global Interactio...2021-07-01Code
34DyHead (Swin-L, multi scale)58.7NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
35Swin-L (HTC++, multi scale)58.7NoSwin Transformer: Hierarchical Vision Transforme...2021-03-25Code
36Swin-L (HTC++, single scale)57.7NoSwin Transformer: Hierarchical Vision Transforme...2021-03-25Code
37Cascade Eff-B7 NAS-FPN (1280, self-training Copy Paste, single-scale)57.3NoSimple Copy-Paste is a Strong Data Augmentation ...2020-12-13Code
38PyCenterNet (Swin-L, multi-scale)57.1NoCenterNet++ for Object Detection2022-04-18Code
39dBOT ViT-L (CLIP)56.8NoExploring Target Representations for Masked Auto...2022-09-08Code
40YOLOv7-D6 (44 fps)56.6YesYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
41SOLQ (Swin-L, single scale)56.5NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
42CenterNet2 (Res2Net-101-DCN-BiFPN, self-training, 1560 single-scale)56.4NoProbabilistic two-stage detection2021-03-12Code
43ISTR (ResNet50-FPN-3x, single-scale)56.4NoISTR: End-to-End Instance Segmentation with Tran...2021-05-03Code
44QueryInst (single-scale)56.1NoInstances as Queries2021-05-05Code
45dBOT ViT-L56.1NoExploring Target Representations for Masked Auto...2022-09-08Code
46YOLOv7-E6 (56 fps)56NoYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
47YOLOv4-P7 with TTA55.8NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
48DetectoRS (ResNeXt-101-64x4d, multi-scale)55.7NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
49YOLOR-D6 (1280, single-scale, 30 fps)55.4NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
50YOLOv4-P6 with TTA54.9NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
51YOLOv7-W6 (84 fps)54.9NoYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
52Cascade Eff-B7 NAS-FPN (1280)54.8NoSimple Copy-Paste is a Strong Data Augmentation ...2020-12-13Code
53DetectoRS (ResNeXt-101-32x4d, multi-scale)54.7NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
54GLEE-Lite54.7NoGeneral Object Foundation Model for Images and V...2023-12-14Code
55YOLOv4-P6 CSP-P6 (single-scale, 32 fps)54.3NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
56SpineNet-190 (1280, with Self-training on OpenImages, single-scale)54.3NoRethinking Pre-training and Self-training2020-06-11Code
57UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)54.1NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
58DyHead (ResNeXt-64x4d-101-DCN, multi scale)54NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
59dBOT ViT-B (CLIP)53.6NoExploring Target Representations for Masked Auto...2022-09-08Code
60PAA (ResNext-152-32x8d + DCN, multi-scale)53.5NoProbabilistic Anchor Assignment with IoU Predict...2020-07-16Code
61LSNet (Res2Net-101+ DCN, multi-scale)53.5NoLocation-Sensitive Visual Recognition with Cross...2021-04-11Code
62dBOT ViT-B53.5NoExploring Target Representations for Masked Auto...2022-09-08Code
63ResNeSt-200 (multi-scale)53.3NoResNeSt: Split-Attention Networks2020-04-19Code
64Cascade Mask R-CNN (Triple-ResNeXt152, multi-scale)53.3NoCBNet: A Novel Composite Backbone Network Archit...2019-09-09Code
65DetectoRS (ResNeXt-101-32x4d, single-scale)53.3NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
66GFLV2 (Res2Net-101, DCN, multiscale)53.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
67YOLOv7-X (114 fps)53.1YesYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
68RelationNet++ (ResNeXt-64x4d-101-DCN)52.7NoRelationNet++: Bridging Visual Representations f...2020-10-29Code
69EfficientDet-D7 (1536)52.6YesEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
70YOLOv4-P5 with TTA52.5NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
71Deformable DETR (ResNeXt-101+DCN)52.3NoDeformable DETR: Deformable Transformers for End...2020-10-08Code
72GCNet (ResNeXt-101 + DCN + cascade + GC r4)52.3NoGlobal Context Networks2020-12-24Code
73PP-YOLOE-x(CSPRepResNet-x, 640x640, single-scale )52.2NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
74RetinaNet (SpineNet-190, 1280x1280)52.1NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
75RepPoints v2 (ResNeXt-101, DCN, multi-scale)52.1NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
76AC-FPN Cascade R-CNN (X-152-32x8d-FPN-IN5k, multi scale, only CEM)51.9NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
77OTA (ResNeXt-101+DCN, multiscale)51.5NoOTA: Optimal Transport Assignment for Object Det...2021-03-26Code
78YOLOX-x(Modified CSP v5, 640x640, single-scale)51.5YesYOLOX: Exceeding YOLO Series in 20212021-07-18Code
79PP-YOLOE-l(CSPRepResNet-l, 640x640, single-scale )51.4NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
80YOLOv7 (161 fps)51.4YesYOLOv7: Trainable bag-of-freebies sets new state...2022-07-06Code
81UniverseNet-20.08d (Res2Net-101, DCN, single-scale)51.3NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
82TSD(SENet154-DCN,multi-scale)51.2NoRevisiting the Sibling Head in Object Detector2020-03-17Code
83YOLOX-X (Modified CSP v5)51.2NoYOLOX: Exceeding YOLO Series in 20212021-07-18Code
84iBOT (ViT-B/16)51.2NoiBOT: Image BERT Pre-Training with Online Tokeni...2021-11-15Code
85RetinaNet (SpineNet-143, 1280x1280)50.7NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
86ATSS (ResNetXt-64x4d-101+DCN,multi-scale)50.7NoBridging the Gap Between Anchor-based and Anchor...2019-12-05Code
87NAS-FPN (AmoebaNet-D, learned aug)50.7NoLearning Data Augmentation Strategies for Object...2019-06-26Code
88Boosting R-CNN*50.7NoBoosting R-CNN: Reweighting R-CNN Samples by RPN...2022-06-28Code
89GFLV2 (Res2Net-101, DCN)50.6NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
90aLRP Loss (ResNext-101-64x4d, DCN, multiscale test)50.2NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
91FreeAnchor + SEPC (DCN, ResNext-101-64x4d)50.1NoScale-Equalizing Pyramid Convolution for Object ...2020-05-06Code
92D2Det (ResNet-101-DCN, multi-scale test)50.1No--Code
93Dynamic R-CNN (ResNet-101-DCN, multi-scale)50.1NoDynamic R-CNN: Towards High Quality Object Detec...2020-04-13Code
94TSD(ResNet-101-Deformable, Image Pyramid)49.4NoRevisiting the Sibling Head in Object Detector2020-03-17Code
95RepPoints v2 (ResNeXt-101, DCN)49.4NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
96A2MIM (ViT-B)49.4NoArchitecture-Agnostic Masked Image Modeling -- F...2022-05-27Code
97iBOT (ViT-S/16)49.4NoiBOT: Image BERT Pre-Training with Online Tokeni...2021-11-15Code
98CPNDet (Hourglass-104, multi-scale)49.2NoCorner Proposal Network for Anchor-free, Two-sta...2020-07-27Code
99GFLV2 (ResNeXt-101, 32x4d, DCN)49NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
100aLRP Loss (ResNext-101-64x4d, DCN, single scale)48.9NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
101PP-YOLOE-m(CSPRepResNet-m, 640x640, single-scale )48.9NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
102UniverseNet-20.08 (Res2Net-50, DCN, single-scale)48.8NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
103SOLQ (ResNet101, single scale)48.7NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
104RetinaNet (SpineNet-96, 1024x1024)48.6NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
105TridentNet (ResNet-101-Deformable, Image Pyramid)48.4NoScale-Aware Trident Networks for Object Detection2019-01-07Code
106GCNet (ResNeXt-101 + DCN + cascade + GC r4)48.4NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
107GFLV2 (ResNet-101-DCN)48.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
108Swin-S (RPE w/ GAB)48.23NoUnderstanding Gaussian Attention Bias of Vision ...2023-05-08Code
109GFL (X-101-32x4d-DCN, single-scale)48.2NoGeneralized Focal Loss: Learning Qualified and D...2020-06-08Code
110ISTR (ResNet101-FPN-3x, single-scale)48.1NoISTR: End-to-End Instance Segmentation with Tran...2021-05-03Code
111YOLOX-Darknet53(Darknet53, 640x640, single-scale)48YesYOLOX: Exceeding YOLO Series in 20212021-07-18Code
112DAT-S (RetinaNet)47.9NoVision Transformer with Deformable Attention2022-01-03Code
113aLRP Loss (ResNext-101-64x4d, single scale)47.8NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
114MatrixNet Corners (ResNet-152, multi-scale)47.8NoMatrix Nets: A New Deep Architecture for Object ...2019-08-13Code
115SOLQ (ResNet50, single scale)47.8NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
116DyHead (ResNeXt-64x4d-101)47.7NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
117SAPD (ResNeXt-101, single-scale)47.4NoSoft Anchor-Point Object Detection2019-11-27Code
118PANet (ResNeXt-101, multi-scale)47.4NoPath Aggregation Network for Instance Segmentation2018-03-05Code
119HTC (HRNetV2p-W48)47.3NoDeep High-Resolution Representation Learning for...2019-08-20Code
120HTC (ResNeXt-101-FPN)47.1NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
121CenterNet511 (Hourglass-104, multi-scale)47NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
122MAL (ResNeXt101, multi-scale)47NoMultiple Anchor Learning for Visual Object Detec...2019-12-04Code
123ISTR (ResNet50-FPN-3x)46.8NoISTR: End-to-End Instance Segmentation with Tran...2021-05-03Code
124RetinaNet (SpineNet-49, 896x896)46.7NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
125RPDet (ResNet-101-DCN, multi-scale)46.5NoRepPoints: Point Set Representation for Object D...2019-04-25Code
126HoughNet (MS)46.4NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
127PPDet (ResNeXt-101-FPN, multiscale)46.3NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
128GFLV2 (ResNet-101)46.2NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
129SNIPER (ResNet-101)46.1NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
130Mask R-CNN (HRNetV2p-W48 + cascade)46.1NoDeep High-Resolution Representation Learning for...2019-08-20Code
131ResNeXt-64x4d-101 NAS-FCOS @128-256 w/improvements46.1NoNAS-FCOS: Fast Neural Architecture Search for Ob...2019-06-11Code
132DCNv2 (ResNet-101, multi-scale)46NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
133Gaussian-FCOS46NoLocalization Uncertainty Estimation for Anchor-F...2020-06-28-
134Cascade R-CNN-FPN (ResNet-101, map-guided)45.9NoInstaBoost: Boosting Instance Segmentation via P...2019-08-21Code
135MAL (ResNeXt101, single-scale)45.9NoMultiple Anchor Learning for Visual Object Detec...2019-12-04Code
136CenterMask+VoVNetV2-99 (single-scale)45.8NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
137D-RFCN + SNIP (DPN-98 with flip, multi-scale)45.7NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
138YOLOv4 (CD53)45.5YesScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
139AC-FPN Cascade R-CNN(ResNet-101, single scale)45NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
140FreeAnchor (ResNeXt-101)44.8NoFreeAnchor: Learning to Match Anchors for Visual...2019-09-05Code
141FCOS (ResNeXt-64x4d-101-FPN 4 + improvements)44.7NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
142CenterMask+VoVNet2-57 (single-scale)44.7NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
143FSAF (ResNeXt-101, multi-scale)44.6NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
144aLRP Loss (ResNext-101, DCN, 500 scale)44.6NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
145CenterMask + X-101-32x8d (single-scale)44.6NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
146RetinaNet (SpineNet-49, 640x640)44.3NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
147YOLOF-DC544.3NoYou Only Look One-level Feature2021-03-17Code
148GFLV2 (ResNet-50)44.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
149InterNet (ResNet-101-FPN, multi-scale)44.2NoFeature Intertwiner for Object Detection2019-03-28Code
150M2Det (VGG-16, multi-scale)44.2NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
151Faster R-CNN (LIP-ResNet-101-MD w FPN)43.9NoLIP: Local Importance-based Pooling2019-08-12Code
152M2Det (ResNet-101, multi-scale)43.9NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
153YOLOv3 @800 + ASFF* (Darknet-53)43.9YesLearning Spatial Fusion for Single-Shot Object D...2019-11-21Code
154FoveaBox (ResNeXt-101)43.9NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
155ExtremeNet (Hourglass-104, multi-scale)43.7NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
156YOLOv4-60843.5YesYOLOv4: Optimal Speed and Accuracy of Object Det...2020-04-23Code
157SNIPER (ResNet-50)43.5NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
158CenterNet (HRNetV2-W48)43.5NoDeep High-Resolution Representation Learning for...2019-08-20Code
159D-RFCN + SNIP (ResNet-101, multi-scale)43.4NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
160Grid R-CNN (ResNeXt-101-FPN)43.2NoGrid R-CNN2018-11-29Code
161FCOS (ResNeXt-101-64x4d-FPN)43.2NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
162CornerNet-Saccade (Hourglass-104, multi-scale)43.2NoCornerNet-Lite: Efficient Keypoint Based Object ...2019-04-18Code
163PP-YOLOE-s(CSPRepResNet-s, 640x640, single-scale )43.1NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
164Libra R-CNN (ResNeXt-101-FPN)43NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
165DyHead (ResNet-50)43NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
166RPDet (ResNet-101-DCN)42.8NoRepPoints: Point Set Representation for Object D...2019-04-25Code
167SpineNet-49 (640, RetinaNet, single-scale)42.8NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
168Cascade R-CNN (ResNet-101-FPN+, cascade)42.8NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
169Cascade R-CNN42.8NoCascade R-CNN: High Quality Object Detection and...2019-06-24Code
170TridentNet (ResNet-101)42.7NoScale-Aware Trident Networks for Object Detection2019-01-07Code
171FCOS (ResNeXt-32x8d-101-FPN)42.7NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
172RetinaMask (ResNeXt-101-FPN-GN)42.6NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
173TAL + TAP42.5NoTOOD: Task-aligned One-stage Object Detection2021-08-17Code
174Faster R-CNN (HRNetV2p-W48)42.4NoDeep High-Resolution Representation Learning for...2019-08-20Code
175HSD (Rest101, 768x768, single-scale test)42.3No--Code
176CornerNet511 (Hourglass-104, multi-scale)42.1NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
177FoveaBox (ResNeXt-101)42.1NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
178FCOS (HRNet-W32-5l)42NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
179FoveaBox (ResNeXt-101)41.9NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
180RefineDet512+ (ResNet-101)41.8NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
181GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)41.6NoGradient Harmonized Single-stage Detector2018-11-13Code
182CenterNet-DLA (DLA-34, multi-scale)41.6NoObjects as Points2019-04-16Code
183RetinaNet (SpineNet-49S, 640x640)41.5NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
184RPDet (ResNet-101)41NoRepPoints: Point Set Representation for Object D...2019-04-25Code
185M2Det (VGG-16, single-scale)41NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
186LeYOLO (Large@768)41NoLeYOLO, New Scalable and Efficient CNN Architect...2024-06-20Code
187FSAF (ResNet-101, single-scale)40.9NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
188RetinaNet (ResNeXt-101-FPN)40.8NoFocal Loss for Dense Object Detection2017-08-07Code
189Cascade R-CNN (ResNet-50-FPN+, cascade)40.6NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
190Faster R-CNN (Cascade RPN)40.6YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
191ResNet-50-DW-DPN (Deformable Kernels)40.6NoDeformable Kernels: Adapting Effective Receptive...2019-10-07Code
192IoU-Net40.6NoAcquisition of Localization Confidence for Accur...2018-07-30Code
193FCOS (HRNetV2p-W48)40.5YesDeep High-Resolution Representation Learning for...2019-08-20Code
194ResNet-50-FPN Mask R-CNN + KL Loss + var voting + soft-NMS40.4NoBounding Box Regression with Uncertainty for Acc...2018-09-23Code
195RDSNet (ResNet-101, RetinaNet, mask, MBRM)40.3NoRDSNet: A New Deep Architecture for Reciprocal O...2019-12-11Code
196ExtremeNet (Hourglass-104, single-scale)40.2NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
197Mask R-CNN (ResNet-101-FPN, CBN)40.1NoCross-Iteration Batch Normalization2020-02-13Code
198Fast R-CNN (Cascade RPN)40.1YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
199Mask R-CNN (ResNeXt-101-FPN)39.8NoMask R-CNN2017-03-20Code
200GA-Faster-RCNN39.8NoRegion Proposal by Guided Anchoring2019-01-10Code
201ResNet-50 NAS-FCOS @25639.8NoNAS-FCOS: Fast Neural Architecture Search for Ob...2019-06-11Code
202A2MIM (ResNet-50 2x)39.8NoArchitecture-Agnostic Masked Image Modeling -- F...2022-05-27Code
203FPN (ResNet101 backbone)39.5NoChainerCV: a Library for Deep Learning in Comput...2017-08-28Code
204RetinaMask (ResNet-50-FPN)39.4NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
205LeYOLO (Medium@640)39.3NoLeYOLO, New Scalable and Efficient CNN Architect...2024-06-20Code
206AA-ResNet-10 + RetinaNet39.2NoAttention Augmented Convolutional Networks2019-04-22Code
207MAL (ResNet50, single-scale)39.2NoMultiple Anchor Learning for Visual Object Detec...2019-12-04Code
208RetinaNet (ResNet-101-FPN)39.1NoFocal Loss for Dense Object Detection2017-08-07Code
209Cascade R-CNN (ResNet-101-FPN+)38.8NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
210M2Det (ResNet-101, single-scale)38.8NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
211SaccadeNet (DLA-34-DCN)38.5NoSaccadeNet: A Fast and Accurate Object Detector2020-03-26Code
212Mask R-CNN (ResNet-101-FPN)38.2NoMask R-CNN2017-03-20Code
213LeYOLO (Small@640)38.2NoLeYOLO, New Scalable and Efficient CNN Architect...2024-06-20Code
214WSMA-Seg38.1NoSegmentation is All You Need2019-04-30-
215Faster R-CNN + FPN + CGD37.9NoCompact Global Descriptor for Neural Networks2019-07-23Code
216CornerNet511 (Hourglass-52, single-scale)37.8NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
217RefineDet512+ (VGG-16)37.6NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
218DeformConv-R-FCN (Aligned-Inception-ResNet)37.5NoDeformable Convolutional Networks2017-03-17Code
219Faster R-CNN (ImageNet+300M)37.4NoRevisiting Unreasonable Effectiveness of Data in...2017-07-10Code
220Mask R-CNN (Bottleneck-injected ResNet-50, FPN)36.9Notorchdistill: A Modular, Configuration-Driven Fr...2020-11-25Code
221Faster R-CNN + TDM36.8NoBeyond Skip Connections: Top-Down Modulation for...2016-12-20Code
222Cascade R-CNN (ResNet-50-FPN+)36.5NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
223RefineDet512 (ResNet-101)36.4NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
224Faster R-CNN + FPN36.2YesFeature Pyramid Networks for Object Detection2016-12-09Code
225Faster R-CNN (Bottleneck-injected ResNet-50 and FPN)35.9Notorchdistill: A Modular, Configuration-Driven Fr...2020-11-25Code