TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Methodology/3D/COCO test-dev

3D on COCO test-dev

Metric: AP50 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕AP50▼AugmentationsPaperDate↕Code
1ViTPose (ViTAE-G, ensemble)95NoViTPose: Simple Vision Transformer Baselines for...2022-04-26Code
2ViTPose (ViTAE-G)94.8NoViTPose: Simple Vision Transformer Baselines for...2022-04-26Code
34xRSN-50 (ensemble)94.4NoLearning Delicate Local Representations for Mult...2020-03-09Code
44xRSN-5094.3NoLearning Delicate Local Representations for Mult...2020-03-09Code
5CCM+93.8NoTowards High Performance Human Keypoint Detection2020-02-03Code
6UDP-Pose-PSA(384x288)93.6NoPolarized Self-Attention: Towards High-quality P...2021-07-02Code
7UDP-Pose-PSA(256x192)93.6NoPolarized Self-Attention: Towards High-quality P...2021-07-02Code
8SCIO (HRNet-48)93.5NoSelf-Constrained Inference Optimization on Struc...2022-07-06-
9MSPN93.4NoRethinking on Multi-Stage Networks for Human Pos...2019-01-01Code
10MSPN93.4NoRethinking on Multi-Stage Networks for Human Pos...2019-01-01Code
11HRNet-W48 + extra data92.7NoDeep High-Resolution Representation Learning for...2019-02-25Code
12HRNet-W48+UDP92.7NoThe Devil is in the Details: Delving into Unbias...2019-11-18Code
13HRFormer-B92.7NoHRFormer: High-Resolution Transformer for Dense ...2021-10-18Code
14HRNet*92.7NoDeep High-Resolution Representation Learning for...2019-02-25Code
15HRNet-W48+DARK92.6NoDistribution-Aware Coordinate Representation for...2019-10-14Code
16OmniPose (WASPv2)92.6NoOmniPose: A Multi-Scale Framework for Multi-Pers...2021-03-18Code
17EvoPose2D-L92.5NoEvoPose2D: Pushing the Boundaries of 2D Human Po...2020-11-17Code
18HRNet92.5NoDeep High-Resolution Representation Learning for...2019-02-25Code
19MIPNet92.4NoMulti-Instance Pose Networks: Rethinking Top-Dow...2021-01-27Code
20Simple Base+*92.4NoSimple Baselines for Human Pose Estimation and T...2018-04-17Code
21TransPose-H-A692.2NoTransPose: Keypoint Localization via Transformer2020-12-28Code
22PoseBH-H91.9YesPoseBH: Prototypical Multi-Dataset Training Beyo...2025-05-23Code
23DPIT-L91.9NoDPIT: Dual-Pipeline Integrated Transformer for H...2022-09-02-
24Flow-based (ResNet-152)91.9NoSimple Baselines for Human Pose Estimation and T...2018-04-17Code
25Simple Base91.9NoSimple Baselines for Human Pose Estimation and T...2018-04-17Code
26S-ViPNAS-HRNetW3291.7NoViPNAS: Efficient Video Pose Estimation via Neur...2021-05-21Code
27CPN+ [6, 9]91.7NoCascaded Pyramid Network for Multi-Person Pose E...2017-11-20Code
28CPN+91.7NoCascaded Pyramid Network for Multi-Person Pose E...2017-11-20Code
29CPN91.4NoCascaded Pyramid Network for Multi-Person Pose E...2017-11-20Code
30CPN91.4NoCascaded Pyramid Network for Multi-Person Pose E...2017-11-20Code
31PoseFix91.2NoPoseFix: Model-agnostic General Human Pose Refin...2018-12-10Code
32KAPAO-L91.2NoRethinking Keypoint Representations: Modeling Ke...2021-11-16Code
33TFPose (ND=6 ResNet-50)90.9NoTFPose: Direct Human Pose Estimation with Transf...2021-03-29-
34Dite-HRNet-3090.8NoDite-HRNet: Dynamic Lightweight High-Resolution ...2022-04-22Code
35S-ViPNAS-Res5090.7NoViPNAS: Efficient Video Pose Estimation via Neur...2021-05-21Code
36Lite-HRNet-3090.7YesLite-HRNet: A Lightweight High-Resolution Network2021-04-13Code
37KAPAO-M90.5NoRethinking Keypoint Representations: Modeling Ke...2021-11-16Code
38PPE (ResNeXt-101)90.3NoDeep Multi-Task Networks For Occluded Pedestrian...2022-06-15-
39yolopose90.3NoYOLO-Pose: Enhancing YOLO for Multi Person Pose ...2022-04-14Code
40HigherHRNet (ScaleNet_P4)90.3NoScaleNAS: One-Shot Learning of Scale-Aware Repre...2020-11-30-
41SMPR (HR-Net-32)89.7NoSMPR: Single-Stage Multi-Person Pose Regression2020-06-28Code
42Lite-HRNet-1889.4NoLite-HRNet: A Lightweight High-Resolution Network2021-04-13Code
43HigherHRNet (HR-Net-48)89.3NoHigherHRNet: Scale-Aware Representation Learning...2019-08-27Code
44RMPE++89.2NoRMPE: Regional Multi-person Pose Estimation2016-12-01Code
45PersonLab89NoPersonLab: Person Pose Estimation and Instance S...2018-03-22Code
46SPM88.5NoSingle-Stage Multi-Person Pose Machines2019-08-24Code
47KAPAO-S88.4NoRethinking Keypoint Representations: Modeling Ke...2021-11-16Code
48DirectPose (ResNet-101)87.8NoDirectPose: Direct End-to-End Multi-Person Pose ...2019-11-18Code
49Mask-RCNN87.3NoMask R-CNN2017-03-20Code
50Mask R-CNN87.3NoMask R-CNN2017-03-20Code
51AE86.8NoAssociative Embedding: End-to-End Learning for J...2016-11-16Code
52DirectPose (ResNet-101)86.7NoDirectPose: Direct End-to-End Multi-Person Pose ...2019-11-18Code
53OpenPose86.2YesOpenPose: Realtime Multi-Person 2D Pose Estimati...2018-12-18Code
54Faster R-CNN (ImageNet+300M)85.7YesRevisiting Unreasonable Effectiveness of Data in...2017-07-10Code
55G-RMI85.5NoTowards Accurate Multi-person Pose Estimation in...2017-01-06-
56G-RMI85.5NoTowards Accurate Multi-person Pose Estimation in...2017-01-06-
57G-RMI85.5NoTowards Accurate Multi-person Pose Estimation in...2017-01-06-
58CMU-Pose84.9NoRealtime Multi-Person 2D Pose Estimation using P...2016-11-24Code
59CMU Pose84.9NoRealtime Multi-Person 2D Pose Estimation using P...2016-11-24Code
60CMU-Pose84.9NoRealtime Multi-Person 2D Pose Estimation using P...2016-11-24Code
61RMPE83.7NoRMPE: Regional Multi-person Pose Estimation2016-12-01Code
62RMPE83.7NoRMPE: Regional Multi-person Pose Estimation2016-12-01Code
63Plain-DETR (Swin-L)82.1No--Code
64EVA81.9NoEVA: Exploring the Limits of Masked Visual Repre...2022-11-14Code
65Group DETR v281.8NoGroup DETR v2: Strong Object Detector with Encod...2022-11-07-
66Focal-Stable-DINO (Focal-Huge, no TTA)81.7NoA Strong and Reproducible Object Detector with O...2023-04-25Code
67Relation-DETR (Focal-L)80.8NoRelation DETR: Exploring Explicit Position Relat...2024-07-16Code
68DETA (Swin-L)80.4NoNMS Strikes Back2022-12-12Code
69GLIP (Swin-L, multi-scale)79.5NoGrounded Language-Image Pre-training2021-12-07Code
70PIIP-H6B (DINO)79NoParameter-Inverted Image Pyramid Networks2024-06-06Code
71DyHead (Swin-L, multi scale, self-training)78.5NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
72DyHead (Swin-L, multi scale)77.1NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
73QueryInst (single-scale)75.9NoInstances as Queries2021-05-05Code
74SOLQ (Swin-L, single scale)74.6NoSOLQ: Segmenting Objects by Learning Queries2021-06-04Code
75DetectoRS (ResNeXt-101-64x4d, multi-scale)74.2NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
76CenterNet2 (Res2Net-101-DCN-BiFPN, self-training, 1560 single-scale)74NoProbabilistic two-stage detection2021-03-12Code
77PyCenterNet (Swin-L, multi-scale)73.7NoCenterNet++ for Object Detection2022-04-18Code
78DetectoRS (ResNeXt-101-32x4d, multi-scale)73.5NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
79YOLOR-D6 (1280, single-scale, 30 fps)73.3NoYou Only Learn One Representation: Unified Netwo...2021-05-10Code
80YOLOv4-P7 with TTA73.2NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
81YOLOv4-P6 with TTA72.6NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
82YOLOv4-P6 CSP-P6 (single-scale, 32 fps)72.3NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
83DyHead (ResNeXt-64x4d-101-DCN, multi scale)72.1NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
84ResNeSt-200 (multi-scale)72NoResNeSt: Split-Attention Networks2020-04-19Code
85Cascade Mask R-CNN (Triple-ResNeXt152, multi-scale)71.9NoCBNet: A Novel Composite Backbone Network Archit...2019-09-09Code
86Deformable DETR (ResNeXt-101+DCN)71.9NoDeformable DETR: Deformable Transformers for End...2020-10-08Code
87TSD(SENet154-DCN,multi-scale)71.9NoRevisiting the Sibling Head in Object Detector2020-03-17Code
88RetinaNet (SpineNet-190, 1280x1280)71.8NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
89UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)71.6NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
90PAA (ResNext-152-32x8d + DCN, multi-scale)71.6NoProbabilistic Anchor Assignment with IoU Predict...2020-07-16Code
91DetectoRS (ResNeXt-101-32x4d, single-scale)71.6NoDetectoRS: Detecting Objects with Recursive Feat...2020-06-03Code
92EfficientDet-D7 (1536)71.6YesEfficientDet: Scalable and Efficient Object Dete...2019-11-20Code
93LSNet (Res2Net-101+ DCN, multi-scale)71.1NoLocation-Sensitive Visual Recognition with Cross...2021-04-11Code
94GFLV2 (Res2Net-101, DCN, multiscale)70.9NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
95GCNet (ResNeXt-101 + DCN + cascade + GC r4)70.9NoGlobal Context Networks2020-12-24Code
96AC-FPN Cascade R-CNN (X-152-32x8d-FPN-IN5k, multi scale, only CEM)70.4NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
97RetinaNet (SpineNet-143, 1280x1280)70.4NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
98YOLOv4-P5 with TTA70.3NoScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
99aLRP Loss (ResNext-101-64x4d, DCN, multiscale test)70.3NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
100RepPoints v2 (ResNeXt-101, DCN, multi-scale)70.1NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
101UniverseNet-20.08d (Res2Net-101, DCN, single-scale)70NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
102PP-YOLOE-x(CSPRepResNet-x, 640x640, single-scale )69.9NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
103FreeAnchor + SEPC (DCN, ResNext-101-64x4d)69.8NoScale-Equalizing Pyramid Convolution for Object ...2020-05-06Code
104TridentNet (ResNet-101-Deformable, Image Pyramid)69.7NoScale-Aware Trident Networks for Object Detection2019-01-07Code
105YOLOX-X (Modified CSP v5)69.6NoYOLOX: Exceeding YOLO Series in 20212021-07-18Code
106TSD(ResNet-101-Deformable, Image Pyramid)69.6NoRevisiting the Sibling Head in Object Detector2020-03-17Code
107DAT-S (RetinaNet)69.6NoVision Transformer with Deformable Attention2022-01-03Code
108D2Det (ResNet-101-DCN, multi-scale test)69.4No--Code
109aLRP Loss (ResNext-101-64x4d, DCN, single scale)69.3NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
110GFLV2 (Res2Net-101, DCN)69NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
111PP-YOLOE-l(CSPRepResNet-l, 640x640, single-scale )68.9NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
112ATSS (ResNetXt-64x4d-101+DCN,multi-scale)68.9NoBridging the Gap Between Anchor-based and Anchor...2019-12-05Code
113RepPoints v2 (ResNeXt-101, DCN)68.9NoRepPoints V2: Verification Meets Regression for ...2020-07-16Code
114OTA (ResNeXt-101+DCN, multiscale)68.6NoOTA: Optimal Transport Assignment for Object Det...2021-03-26Code
115RetinaNet (SpineNet-96, 1024x1024)68.4NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
116aLRP Loss (ResNext-101-64x4d, single scale)68.4NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
117Dynamic R-CNN (ResNet-101-DCN, multi-scale)68.3NoDynamic R-CNN: Towards High Quality Object Detec...2020-04-13Code
118DCNv2 (ResNet-101, multi-scale)67.9NoDeformable ConvNets v2: More Deformable, Better ...2018-11-27Code
119GFLV2 (ResNeXt-101, 32x4d, DCN)67.6NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
120GCNet (ResNeXt-101 + DCN + cascade + GC r4)67.6NoGCNet: Non-local Networks Meet Squeeze-Excitatio...2019-04-25Code
121UniverseNet-20.08 (Res2Net-50, DCN, single-scale)67.5NoUSB: Universal-Scale Object Detection Benchmark2021-03-25Code
122InterNet (ResNet-101-FPN, multi-scale)67.5NoFeature Intertwiner for Object Detection2019-03-28Code
123GFL (X-101-32x4d-DCN, single-scale)67.4NoGeneralized Focal Loss: Learning Qualified and D...2020-06-08Code
124SAPD (ResNeXt-101, single-scale)67.4NoSoft Anchor-Point Object Detection2019-11-27Code
125RPDet (ResNet-101-DCN, multi-scale)67.4NoRepPoints: Point Set Representation for Object D...2019-04-25Code
126CPNDet (Hourglass-104, multi-scale)67.3NoCorner Proposal Network for Anchor-free, Two-sta...2020-07-27Code
127D-RFCN + SNIP (DPN-98 with flip, multi-scale)67.3NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
128PANet (ResNeXt-101, multi-scale)67.2NoPath Aggregation Network for Instance Segmentation2018-03-05Code
129SNIPER (ResNet-101)67NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
130PP-YOLOE-m(CSPRepResNet-m, 640x640, single-scale )66.5NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
131GFLV2 (ResNet-101-DCN)66.5NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
132RetinaNet (SpineNet-49, 896x896)66.3NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
133MatrixNet Corners (ResNet-152, multi-scale)66.2NoMatrix Nets: A New Deep Architecture for Object ...2019-08-13Code
134HTC (HRNetV2p-W48)65.9NoDeep High-Resolution Representation Learning for...2019-08-20Code
135DyHead (ResNeXt-64x4d-101)65.7NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
136Faster R-CNN (LIP-ResNet-101-MD w FPN)65.7NoLIP: Local Importance-based Pooling2019-08-12Code
137YOLOv4-60865.7YesYOLOv4: Optimal Speed and Accuracy of Object Det...2020-04-23Code
138D-RFCN + SNIP (ResNet-101, multi-scale)65.5NoAn Analysis of Scale Invariance in Object Detect...2017-11-22-
139FSAF (ResNeXt-101, multi-scale)65.2NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
140HoughNet (MS)65.1NoHoughNet: Integrating near and long-range eviden...2020-07-05Code
141aLRP Loss (ResNext-101, DCN, 500 scale)65NoA Ranking-based, Balanced Loss Function Unifying...2020-09-28Code
142SNIPER (ResNet-50)65NoSNIPER: Efficient Multi-Scale Training2018-05-23Code
143RPDet (ResNet-101-DCN)65NoRepPoints: Point Set Representation for Object D...2019-04-25Code
144PPDet (ResNeXt-101-FPN, multiscale)64.8NoReducing Label Noise in Anchor-Free Object Detec...2020-08-03Code
145M2Det (VGG-16, multi-scale)64.6NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
146CenterNet511 (Hourglass-104, multi-scale)64.5NoCenterNet: Keypoint Triplets for Object Detection2019-04-17Code
147CenterMask+VoVNetV2-99 (single-scale)64.5NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
148AC-FPN Cascade R-CNN(ResNet-101, single scale)64.4NoAttention-guided Context Feature Pyramid Network...2020-05-23Code
149M2Det (ResNet-101, multi-scale)64.4NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
150GFLV2 (ResNet-101)64.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
151FreeAnchor (ResNeXt-101)64.3NoFreeAnchor: Learning to Match Anchors for Visual...2019-09-05Code
152Cascade R-CNN-FPN (ResNet-101, map-guided)64.2NoInstaBoost: Boosting Instance Segmentation via P...2019-08-21Code
153YOLOv4 (CD53)64.1YesScaled-YOLOv4: Scaling Cross Stage Partial Network2020-11-16Code
154FCOS (ResNeXt-64x4d-101-FPN 4 + improvements)64.1NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
155YOLOv3 @800 + ASFF* (Darknet-53)64.1YesLearning Spatial Fusion for Single-Shot Object D...2019-11-21Code
156Mask R-CNN (HRNetV2p-W48 + cascade)64NoDeep High-Resolution Representation Learning for...2019-08-20Code
157Libra R-CNN (ResNeXt-101-FPN)64NoLibra R-CNN: Towards Balanced Learning for Objec...2019-04-04Code
158HTC (ResNeXt-101-FPN)63.9NoHybrid Task Cascade for Instance Segmentation2019-01-22Code
159RetinaNet (SpineNet-49, 640x640)63.8NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
160TridentNet (ResNet-101)63.6NoScale-Aware Trident Networks for Object Detection2019-01-07Code
161Faster R-CNN (HRNetV2p-W48)63.6NoDeep High-Resolution Representation Learning for...2019-08-20Code
162FoveaBox (ResNeXt-101)63.5NoFoveaBox: Beyond Anchor-based Object Detector2019-04-08Code
163CenterMask + X-101-32x8d (single-scale)63.4NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
164CenterMask+VoVNet2-57 (single-scale)63.1NoCenterMask : Real-Time Anchor-Free Instance Segm...2019-11-15Code
165Grid R-CNN (ResNeXt-101-FPN)63NoGrid R-CNN2018-11-29Code
166YOLOF-DC562.9NoYou Only Look One-level Feature2021-03-17Code
167RefineDet512+ (ResNet-101)62.9NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
168RPDet (ResNet-101)62.9NoRepPoints: Point Set Representation for Object D...2019-04-25Code
169FCOS (ResNeXt-101-64x4d-FPN)62.8NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
170GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)62.8NoGradient Harmonized Single-stage Detector2018-11-13Code
171RetinaMask (ResNeXt-101-FPN-GN)62.5NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
172GFLV2 (ResNet-50)62.3NoGeneralized Focal Loss V2: Learning Reliable Loc...2020-11-25Code
173SpineNet-49 (640, RetinaNet, single-scale)62.3NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
174Mask R-CNN (ResNeXt-101-FPN)62.3NoMask R-CNN2017-03-20Code
175FCOS (ResNeXt-32x8d-101-FPN)62.2NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
176Cascade R-CNN (ResNet-101-FPN+, cascade)62.1NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
177Cascade R-CNN62.1NoCascade R-CNN: High Quality Object Detection and...2019-06-24Code
178FSAF (ResNet-101, single-scale)61.5NoFeature Selective Anchor-Free Module for Single-...2019-03-02Code
179HSD (Rest101, 768x768, single-scale test)61.2No--Code
180RetinaNet (ResNeXt-101-FPN)61.1NoFocal Loss for Dense Object Detection2017-08-07Code
181Cascade R-CNN (ResNet-101-FPN+)61.1NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
182DyHead (ResNet-50)60.7NoDynamic Head: Unifying Object Detection Heads wi...2021-06-15Code
183ExtremeNet (Hourglass-104, multi-scale)60.5NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
184PP-YOLOE-s(CSPRepResNet-s, 640x640, single-scale )60.5NoPP-YOLOE: An evolved version of YOLO2022-03-30Code
185RetinaNet (SpineNet-49S, 640x640)60.5NoSpineNet: Learning Scale-Permuted Backbone for R...2019-12-10Code
186Mask R-CNN (ResNet-101-FPN, CBN)60.5NoCross-Iteration Batch Normalization2020-02-13Code
187FCOS (HRNet-W32-5l)60.4NoFCOS: Fully Convolutional One-Stage Object Detec...2019-04-02Code
188TAL + TAP60.3NoTOOD: Task-aligned One-stage Object Detection2021-08-17Code
189Mask R-CNN (ResNet-101-FPN)60.3NoMask R-CNN2017-03-20Code
190RDSNet (ResNet-101, RetinaNet, mask, MBRM)60.1NoRDSNet: A New Deep Architecture for Reciprocal O...2019-12-11Code
191Cascade R-CNN (ResNet-50-FPN+, cascade)59.9NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
192M2Det (VGG-16, single-scale)59.7NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
193Fast R-CNN (Cascade RPN)59.4YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
194M2Det (ResNet-101, single-scale)59.4NoM2Det: A Single-Shot Object Detector based on Mu...2018-11-12Code
195FCOS (HRNetV2p-W48)59.3YesDeep High-Resolution Representation Learning for...2019-08-20Code
196GA-Faster-RCNN59.2NoRegion Proposal by Guided Anchoring2019-01-10Code
197RetinaNet (ResNet-101-FPN)59.1NoFocal Loss for Dense Object Detection2017-08-07Code
198Cascade R-CNN (ResNet-50-FPN+)59NoCascade R-CNN: Delving into High Quality Object ...2017-12-03Code
199Faster R-CNN (Cascade RPN)58.9YesCascade RPN: Delving into High-Quality Region Pr...2019-09-15Code
200RefineDet512+ (VGG-16)58.7NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
201RetinaMask (ResNet-50-FPN)58.6NoRetinaMask: Learning to predict masks improves s...2019-01-10Code
202DeformConv-R-FCN (Aligned-Inception-ResNet)58NoDeformable Convolutional Networks2017-03-17Code
203Faster R-CNN (ImageNet+300M)58NoRevisiting Unreasonable Effectiveness of Data in...2017-07-10Code
204CornerNet511 (Hourglass-104, multi-scale)57.8NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
205RefineDet512 (ResNet-101)57.5NoSingle-Shot Refinement Neural Network for Object...2017-11-18Code
206SaccadeNet (DLA-34-DCN)55.6NoSaccadeNet: A Fast and Accurate Object Detector2020-03-26Code
207ExtremeNet (Hourglass-104, single-scale)55.5NoBottom-up Object Detection by Grouping Extreme a...2019-01-23Code
208CornerNet511 (Hourglass-52, single-scale)53.7NoCornerNet: Detecting Objects as Paired Keypoints2018-08-03Code
209wetectron(single-model, VGG16)24.8NoInstance-aware, Context-focused, and Memory-effi...2020-04-09Code
210WSGARN+SSD13.6NoWeakly Supervised Object Discovery by Generative...2017-11-22-
211WCCN12.3NoWeakly Supervised Cascaded Convolutional Networks2016-11-24-
212WSDDN11.5NoWeakly Supervised Deep Detection Networks2015-11-09Code
213PCT (256x256)9NoHuman Pose as Compositional Tokens2023-03-21Code