3D on COCO test-dev

Metric: APM (higher is better)

LeaderboardDataset

Loading chart...

Results

Hide augmentations

Sort:

#	Model↕	APM▼	Augmentations	Paper	Date↕	Code
1	4xRSN-50 (ensemble)	83.8	No	Learning Delicate Local Representations for Mult...	2020-03-09	Code
2	4xRSN-50	83.3	No	Learning Delicate Local Representations for Mult...	2020-03-09	Code
3	HRNet-W48+UDP	82.4	No	The Devil is in the Details: Delving into Unbias...	2019-11-18	Code
4	PPE (ResNeXt-101)	80.7	No	Deep Multi-Task Networks For Occluded Pedestrian...	2022-06-15	-
5	ViTPose (ViTAE-G, ensemble)	77.8	No	ViTPose: Simple Vision Transformer Baselines for...	2022-04-26	Code
6	ViTPose (ViTAE-G)	77.5	No	ViTPose: Simple Vision Transformer Baselines for...	2022-04-26	Code
7	UDP-Pose-PSA(384x288)	76.3	No	Polarized Self-Attention: Towards High-quality P...	2021-07-02	Code
8	UDP-Pose-PSA(256x192)	76.1	No	Polarized Self-Attention: Towards High-quality P...	2021-07-02	Code
9	PoseBH-H	75.9	Yes	PoseBH: Prototypical Multi-Dataset Training Beyo...	2025-05-23	Code
10	CCM+	75	No	Towards High Performance Human Keypoint Detection	2020-02-03	Code
11	SCIO (HRNet-48)	74.1	No	Self-Constrained Inference Optimization on Struc...	2022-07-06	-
12	HRNet-W48+DARK	73.6	No	Distribution-Aware Coordinate Representation for...	2019-10-14	Code
13	EvoPose2D-L	73.5	No	EvoPose2D: Pushing the Boundaries of 2D Human Po...	2020-11-17	Code
14	HRNet-W48 + extra data	73.4	No	Deep High-Resolution Representation Learning for...	2019-02-25	Code
15	HRNet*	73.4	No	Deep High-Resolution Representation Learning for...	2019-02-25	Code
16	Simple Base+*	73	No	Simple Baselines for Human Pose Estimation and T...	2018-04-17	Code
17	OmniPose (WASPv2)	72.6	No	OmniPose: A Multi-Scale Framework for Multi-Pers...	2021-03-18	Code
18	HRFormer-B	72.5	No	HRFormer: High-Resolution Transformer for Dense ...	2021-10-18	Code
19	MSPN	72.3	No	Rethinking on Multi-Stage Networks for Human Pos...	2019-01-01	Code
20	MSPN	72.3	No	Rethinking on Multi-Stage Networks for Human Pos...	2019-01-01	Code
21	HRNet	71.9	No	Deep High-Resolution Representation Learning for...	2019-02-25	Code
22	MIPNet	71.4	No	Multi-Instance Pose Networks: Rethinking Top-Dow...	2021-01-27	Code
23	TransPose-H-A6	71.3	No	TransPose: Keypoint Localization via Transformer	2020-12-28	Code
24	DPIT-L	71.3	No	DPIT: Dual-Pipeline Integrated Transformer for H...	2022-09-02	-
25	PoseFix	71.1	No	PoseFix: Model-agnostic General Human Pose Refin...	2018-12-10	Code
26	S-ViPNAS-HRNetW32	70.5	No	ViPNAS: Efficient Video Pose Estimation via Neur...	2021-05-21	Code
27	Flow-based (ResNet-152)	70.3	No	Simple Baselines for Human Pose Estimation and T...	2018-04-17	Code
28	Simple Base	70.3	No	Simple Baselines for Human Pose Estimation and T...	2018-04-17	Code
29	CPN+	69.5	No	Cascaded Pyramid Network for Multi-Person Pose E...	2017-11-20	Code
30	TFPose (ND=6 ResNet-50)	69.1	No	TFPose: Direct Human Pose Estimation with Transf...	2021-03-29	-
31	CPN	68.7	No	Cascaded Pyramid Network for Multi-Person Pose E...	2017-11-20	Code
32	RMPE++	68	No	RMPE: Regional Multi-person Pose Estimation	2016-12-01	Code
33	EVA	67.7	No	EVA: Exploring the Limits of Masked Visual Repre...	2022-11-14	Code
34	Focal-Stable-DINO (Focal-Huge, no TTA)	67.6	No	A Strong and Reproducible Object Detector with O...	2023-04-25	Code
35	HigherHRNet (ScaleNet_P4)	67.5	No	ScaleNAS: One-Shot Learning of Scale-Aware Repre...	2020-11-30	-
36	Dite-HRNet-30	67.4	No	Dite-HRNet: Dynamic Lightweight High-Resolution ...	2022-04-22	Code
37	S-ViPNAS-Res50	67.3	No	ViPNAS: Efficient Video Pose Estimation via Neur...	2021-05-21	Code
38	Group DETR v2	67.2	No	Group DETR v2: Strong Object Detector with Encod...	2022-11-07	-
39	OpenPifPaf	67.1	No	OpenPifPaf: Composite Fields for Semantic Keypoi...	2021-03-03	Code
40	Relation-DETR (Focal-L)	66.9	No	Relation DETR: Exploring Explicit Position Relat...	2024-07-16	Code
41	DETA (Swin-L)	66.9	No	NMS Strikes Back	2022-12-12	Code
42	Lite-HRNet-30	66.9	Yes	Lite-HRNet: A Lightweight High-Resolution Network	2021-04-13	Code
43	Plain-DETR (Swin-L)	66.8	No	-	-	Code
44	Simple Pose	66.8	Yes	Simple Pose: Rethinking and Improving a Bottom-u...	2019-11-24	Code
45	Simple Pose	66.8	No	Simple Pose: Rethinking and Improving a Bottom-u...	2019-11-24	Code
46	Identity Mapping Hourglass	66.8	No	Simple Pose: Rethinking and Improving a Bottom-u...	2019-11-24	Code
47	HigherHRNet (HR-Net-48)	66.6	No	HigherHRNet: Scale-Aware Representation Learning...	2019-08-27	Code
48	KAPAO-L	66.3	No	Rethinking Keypoint Representations: Modeling Ke...	2021-11-16	Code
49	SMPR (HR-Net-32)	65.9	No	SMPR: Single-Stage Multi-Person Pose Regression	2020-06-28	Code
50	GLIP (Swin-L, multi-scale)	64.9	No	Grounded Language-Image Pre-training	2021-12-07	Code
51	KAPAO-M	64.3	No	Rethinking Keypoint Representations: Modeling Ke...	2021-11-16	Code
52	PersonLab	64.1	No	PersonLab: Person Pose Estimation and Instance S...	2018-03-22	Code
53	DyHead (Swin-L, multi scale, self-training)	64	No	Dynamic Head: Unifying Object Detection Heads wi...	2021-06-15	Code
54	Lite-HRNet-18	64	No	Lite-HRNet: A Lightweight High-Resolution Network	2021-04-13	Code
55	Hourglass-104	63.3	No	Greedy Offset-Guided Keypoint Grouping for Human...	2021-07-07	Code
56	PifPaf (single-scale)	62.6	No	PifPaf: Composite Fields for Human Pose Estimation	2019-03-15	Code
57	SPM	62.6	No	Single-Stage Multi-Person Pose Machines	2019-08-24	Code
58	G-RMI	62.3	No	Towards Accurate Multi-person Pose Estimation in...	2017-01-06	-
59	G-RMI	62.3	No	Towards Accurate Multi-person Pose Estimation in...	2017-01-06	-
60	DyHead (Swin-L, multi scale)	62	No	Dynamic Head: Unifying Object Detection Heads wi...	2021-06-15	Code
61	Faster R-CNN (ImageNet+300M)	61.8	Yes	Revisiting Unreasonable Effectiveness of Data in...	2017-07-10	Code
62	OpenPose	61	Yes	OpenPose: Realtime Multi-Person 2D Pose Estimati...	2018-12-18	Code
63	AE	60.6	No	Associative Embedding: End-to-End Learning for J...	2016-11-16	Code
64	DirectPose (ResNet-101)	60.4	No	DirectPose: Direct End-to-End Multi-Person Pose ...	2019-11-18	Code
65	SOLQ (Swin-L, single scale)	60	No	SOLQ: Segmenting Objects by Learning Queries	2021-06-04	Code
66	CenterNet2 (Res2Net-101-DCN-BiFPN, self-training, 1560 single-scale)	59.7	No	Probabilistic two-stage detection	2021-03-12	Code
67	PyCenterNet (Swin-L, multi-scale)	59.2	No	CenterNet++ for Object Detection	2022-04-18	Code
68	QueryInst (single-scale)	58.9	No	Instances as Queries	2021-05-05	Code
69	KAPAO-S	58.6	No	Rethinking Keypoint Representations: Modeling Ke...	2021-11-16	Code
70	RMPE	58.6	No	RMPE: Regional Multi-person Pose Estimation	2016-12-01	Code
71	RMPE	58.6	No	RMPE: Regional Multi-person Pose Estimation	2016-12-01	Code
72	DetectoRS (ResNeXt-101-64x4d, multi-scale)	58.4	No	DetectoRS: Detecting Objects with Recursive Feat...	2020-06-03	Code
73	YOLOv4-P6 CSP-P6 (single-scale, 32 fps)	58.2	No	Scaled-YOLOv4: Scaling Cross Stage Partial Network	2020-11-16	Code
74	DirectPose (ResNet-101)	57.8	No	DirectPose: Direct End-to-End Multi-Person Pose ...	2019-11-18	Code
75	Mask R-CNN	57.8	No	Mask R-CNN	2017-03-20	Code
76	DetectoRS (ResNeXt-101-32x4d, multi-scale)	57.3	No	DetectoRS: Detecting Objects with Recursive Feat...	2020-06-03	Code
77	Mask R-CNN (ResNet-101-FPN, CBN)	57.3	No	Cross-Iteration Batch Normalization	2020-02-13	Code
78	UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)	57.2	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
79	CMU-Pose	57.1	No	Realtime Multi-Person 2D Pose Estimation using P...	2016-11-24	Code
80	CMU Pose	57.1	No	Realtime Multi-Person 2D Pose Estimation using P...	2016-11-24	Code
81	CMU-Pose	57.1	No	Realtime Multi-Person 2D Pose Estimation using P...	2016-11-24	Code
82	DetectoRS (ResNeXt-101-32x4d, single-scale)	56.5	No	DetectoRS: Detecting Objects with Recursive Feat...	2020-06-03	Code
83	LSNet (Res2Net-101+ DCN, multi-scale)	56.4	No	Location-Sensitive Visual Recognition with Cross...	2021-04-11	Code
84	PAA (ResNext-152-32x8d + DCN, multi-scale)	56.3	No	Probabilistic Anchor Assignment with IoU Predict...	2020-07-16	Code
85	PP-YOLOE-x(CSPRepResNet-x, 640x640, single-scale )	56.3	No	PP-YOLOE: An evolved version of YOLO	2022-03-30	Code
86	ResNeSt-200 (multi-scale)	56.2	No	ResNeSt: Split-Attention Networks	2020-04-19	Code
87	GFLV2 (Res2Net-101, DCN, multiscale)	56.1	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
88	YOLOX-X (Modified CSP v5)	56.1	No	YOLOX: Exceeding YOLO Series in 2021	2021-07-18	Code
89	Cascade Mask R-CNN (Triple-ResNeXt152, multi-scale)	55.8	No	CBNet: A Novel Composite Backbone Network Archit...	2019-09-09	Code
90	NAS-FPN (AmoebaNet-D, learned aug)	55.5	No	Learning Data Augmentation Strategies for Object...	2019-06-26	Code
91	PP-YOLOE-l(CSPRepResNet-l, 640x640, single-scale )	55.3	No	PP-YOLOE: An evolved version of YOLO	2022-03-30	Code
92	UniverseNet-20.08d (Res2Net-101, DCN, single-scale)	55.3	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
93	RetinaNet (SpineNet-190, 1280x1280)	55	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
94	AC-FPN Cascade R-CNN (X-152-32x8d-FPN-IN5k, multi scale, only CEM)	54.8	No	Attention-guided Context Feature Pyramid Network...	2020-05-23	Code
95	TSD(SENet154-DCN,multi-scale)	54.8	No	Revisiting the Sibling Head in Object Detector	2020-03-17	Code
96	RepPoints v2 (ResNeXt-101, DCN, multi-scale)	54.6	No	RepPoints V2: Verification Meets Regression for ...	2020-07-16	Code
97	Deformable DETR (ResNeXt-101+DCN)	54.4	No	Deformable DETR: Deformable Transformers for End...	2020-10-08	Code
98	GFLV2 (Res2Net-101, DCN)	54.3	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
99	RetinaNet (SpineNet-143, 1280x1280)	53.9	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
100	OTA (ResNeXt-101+DCN, multiscale)	53.7	No	OTA: Optimal Transport Assignment for Object Det...	2021-03-26	Code
101	FreeAnchor + SEPC (DCN, ResNext-101-64x4d)	53.3	No	Scale-Equalizing Pyramid Convolution for Object ...	2020-05-06	Code
102	aLRP Loss (ResNext-101-64x4d, DCN, multiscale test)	53.1	No	A Ranking-based, Balanced Loss Function Unifying...	2020-09-28	Code
103	Dynamic R-CNN (ResNet-101-DCN, multi-scale)	53	No	Dynamic R-CNN: Towards High Quality Object Detec...	2020-04-13	Code
104	ATSS (ResNetXt-64x4d-101+DCN,multi-scale)	52.9	No	Bridging the Gap Between Anchor-based and Anchor...	2019-12-05	Code
105	PP-YOLOE-m(CSPRepResNet-m, 640x640, single-scale )	52.9	No	PP-YOLOE: An evolved version of YOLO	2022-03-30	Code
106	D2Det (ResNet-101-DCN, multi-scale test)	52.7	No	-	-	Code
107	TSD(ResNet-101-Deformable, Image Pyramid)	52.5	No	Revisiting the Sibling Head in Object Detector	2020-03-17	Code
108	GFLV2 (ResNeXt-101, 32x4d, DCN)	52.4	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
109	UniverseNet-20.08 (Res2Net-50, DCN, single-scale)	52.3	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
110	RetinaNet (SpineNet-96, 1024x1024)	52.3	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
111	RepPoints v2 (ResNeXt-101, DCN)	52.1	No	RepPoints V2: Verification Meets Regression for ...	2020-07-16	Code
112	CPNDet (Hourglass-104, multi-scale)	51.9	No	Corner Proposal Network for Anchor-free, Two-sta...	2020-07-27	Code
113	GFLV2 (ResNet-101-DCN)	51.9	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
114	DAT-S (RetinaNet)	51.8	No	Vision Transformer with Deformable Attention	2022-01-03	Code
115	GFL (X-101-32x4d-DCN, single-scale)	51.7	No	Generalized Focal Loss: Learning Qualified and D...	2020-06-08	Code
116	PANet (ResNeXt-101, multi-scale)	51.7	No	Path Aggregation Network for Instance Segmentation	2018-03-05	Code
117	aLRP Loss (ResNext-101-64x4d, DCN, single scale)	51.5	No	A Ranking-based, Balanced Loss Function Unifying...	2020-09-28	Code
118	TridentNet (ResNet-101-Deformable, Image Pyramid)	51.3	No	Scale-Aware Trident Networks for Object Detection	2019-01-07	Code
119	aLRP Loss (ResNext-101-64x4d, single scale)	50.8	No	A Ranking-based, Balanced Loss Function Unifying...	2020-09-28	Code
120	ISTR (ResNet101-FPN-3x, single-scale)	50.4	No	ISTR: End-to-End Instance Segmentation with Tran...	2021-05-03	Code
121	MatrixNet Corners (ResNet-152, multi-scale)	50.4	No	Matrix Nets: A New Deep Architecture for Object ...	2019-08-13	Code
122	SAPD (ResNeXt-101, single-scale)	50.3	No	Soft Anchor-Point Object Detection	2019-11-27	Code
123	InterNet (ResNet-101-FPN, multi-scale)	50.3	No	Feature Intertwiner for Object Detection	2019-03-28	Code
124	RetinaNet (SpineNet-49, 896x896)	50.1	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
125	CenterNet511 (Hourglass-104, multi-scale)	49.9	No	CenterNet: Keypoint Triplets for Object Detection	2019-04-17	Code
126	PPDet (ResNeXt-101-FPN, multiscale)	49.9	No	Reducing Label Noise in Anchor-Free Object Detec...	2020-08-03	Code
127	GFLV2 (ResNet-101)	49.9	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
128	HTC (HRNetV2p-W48)	49.7	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
129	RPDet (ResNet-101-DCN, multi-scale)	49.7	No	RepPoints: Point Set Representation for Object D...	2019-04-25	Code
130	M2Det (ResNet-101, multi-scale)	49.6	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
131	DCNv2 (ResNet-101, multi-scale)	49.1	No	Deformable ConvNets v2: More Deformable, Better ...	2018-11-27	Code
132	Cascade R-CNN-FPN (ResNet-101, map-guided)	49	No	InstaBoost: Boosting Instance Segmentation via P...	2019-08-21	Code
133	YOLOv4 (CD53)	49	Yes	Scaled-YOLOv4: Scaling Cross Stage Partial Network	2020-11-16	Code
134	SNIPER (ResNet-101)	48.9	No	SNIPER: Efficient Multi-Scale Training	2018-05-23	Code
135	D-RFCN + SNIP (DPN-98 with flip, multi-scale)	48.8	No	An Analysis of Scale Invariance in Object Detect...	2017-11-22	-
136	ISTR (ResNet50-FPN-3x, single-scale)	48.7	No	ISTR: End-to-End Instance Segmentation with Tran...	2021-05-03	Code
137	Mask R-CNN (HRNetV2p-W48 + cascade)	48.6	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
138	HoughNet (MS)	48.5	No	HoughNet: Integrating near and long-range eviden...	2020-07-05	Code
139	YOLOF-DC5	48.5	No	You Only Look One-level Feature	2021-03-17	Code
140	CenterMask+VoVNetV2-99 (single-scale)	48.3	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
141	aLRP Loss (ResNext-101, DCN, 500 scale)	48.1	No	A Ranking-based, Balanced Loss Function Unifying...	2020-09-28	Code
142	FreeAnchor (ResNeXt-101)	47.9	No	FreeAnchor: Learning to Match Anchors for Visual...	2019-09-05	Code
143	M2Det (VGG-16, multi-scale)	47.9	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
144	AC-FPN Cascade R-CNN(ResNet-101, single scale)	47.7	No	Attention-guided Context Feature Pyramid Network...	2020-05-23	Code
145	RetinaNet (SpineNet-49, 640x640)	47.7	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
146	GFLV2 (ResNet-50)	47.7	No	Generalized Focal Loss V2: Learning Reliable Loc...	2020-11-25	Code
147	FCOS (ResNeXt-64x4d-101-FPN 4 + improvements)	47.5	No	FCOS: Fully Convolutional One-Stage Object Detec...	2019-04-02	Code
148	HSD (Rest101, 768x768, single-scale test)	47.3	No	-	-	Code
149	CenterMask + X-101-32x8d (single-scale)	47.2	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
150	FSAF (ResNeXt-101, multi-scale)	47.1	No	Feature Selective Anchor-Free Module for Single-...	2019-03-02	Code
151	FoveaBox (ResNeXt-101)	46.9	No	FoveaBox: Beyond Anchor-based Object Detector	2019-04-08	Code
152	ExtremeNet (Hourglass-104, multi-scale)	46.9	No	Bottom-up Object Detection by Grouping Extreme a...	2019-01-23	Code
153	Faster R-CNN (LIP-ResNet-101-MD w FPN)	46.7	No	LIP: Local Importance-based Pooling	2019-08-12	Code
154	YOLOv4-608	46.7	Yes	YOLOv4: Optimal Speed and Accuracy of Object Det...	2020-04-23	Code
155	YOLOv3 @800 + ASFF* (Darknet-53)	46.6	Yes	Learning Spatial Fusion for Single-Shot Object D...	2019-11-21	Code
156	TridentNet (ResNet-101)	46.6	No	Scale-Aware Trident Networks for Object Detection	2019-01-07	Code
157	D-RFCN + SNIP (ResNet-101, multi-scale)	46.5	No	An Analysis of Scale Invariance in Object Detect...	2017-11-22	-
158	Grid R-CNN (ResNeXt-101-FPN)	46.5	No	Grid R-CNN	2018-11-29	Code
159	M2Det (VGG-16, single-scale)	46.5	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
160	PP-YOLOE-s(CSPRepResNet-s, 640x640, single-scale )	46.4	No	PP-YOLOE: An evolved version of YOLO	2022-03-30	Code
161	SNIPER (ResNet-50)	46.3	No	SNIPER: Efficient Multi-Scale Training	2018-05-23	Code
162	FCOS (ResNeXt-101-64x4d-FPN)	46.2	No	FCOS: Fully Convolutional One-Stage Object Detec...	2019-04-02	Code
163	RPDet (ResNet-101-DCN)	46.2	No	RepPoints: Point Set Representation for Object D...	2019-04-25	Code
164	Libra R-CNN (ResNeXt-101-FPN)	45.6	No	Libra R-CNN: Towards Balanced Learning for Objec...	2019-04-04	Code
165	FCOS (ResNeXt-32x8d-101-FPN)	45.6	No	FCOS: Fully Convolutional One-Stage Object Detec...	2019-04-02	Code
166	RetinaMask (ResNeXt-101-FPN-GN)	45.6	No	RetinaMask: Learning to predict masks improves s...	2019-01-10	Code
167	Cascade R-CNN (ResNet-101-FPN+, cascade)	45.5	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code
168	Cascade R-CNN	45.5	No	Cascade R-CNN: High Quality Object Detection and...	2019-06-24	Code
169	SpineNet-49 (640, RetinaNet, single-scale)	45.2	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
170	RefineDet512+ (ResNet-101)	45.1	No	Single-Shot Refinement Neural Network for Object...	2017-11-18	Code
171	GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)	45.1	No	Gradient Harmonized Single-stage Detector	2018-11-13	Code
172	FCOS (HRNet-W32-5l)	45	No	FCOS: Fully Convolutional One-Stage Object Detec...	2019-04-02	Code
173	RetinaNet (SpineNet-49S, 640x640)	45	No	SpineNet: Learning Scale-Permuted Backbone for R...	2019-12-10	Code
174	CornerNet511 (Hourglass-104, multi-scale)	44.8	No	CornerNet: Detecting Objects as Paired Keypoints	2018-08-03	Code
175	CornerNet-Saccade (Hourglass-104, multi-scale)	44.6	No	CornerNet-Lite: Efficient Keypoint Based Object ...	2019-04-18	Code
176	Faster R-CNN (HRNetV2p-W48)	44.6	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
177	FSAF (ResNet-101, single-scale)	44.2	No	Feature Selective Anchor-Free Module for Single-...	2019-03-02	Code
178	RetinaNet (ResNeXt-101-FPN)	44.2	No	Focal Loss for Dense Object Detection	2017-08-07	Code
179	RPDet (ResNet-101)	44.1	No	RepPoints: Point Set Representation for Object D...	2019-04-25	Code
180	HTC (ResNeXt-101-FPN)	43.9	No	Hybrid Task Cascade for Instance Segmentation	2019-01-22	Code
181	CenterNet-DLA (DLA-34, multi-scale)	43.9	No	Objects as Points	2019-04-16	Code
182	ResNet-50-DW-DPN (Deformable Kernels)	43.9	No	Deformable Kernels: Adapting Effective Receptive...	2019-10-07	Code
183	M2Det (ResNet-101, single-scale)	43.9	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
184	RDSNet (ResNet-101, RetinaNet, mask, MBRM)	43.5	No	RDSNet: A New Deep Architecture for Reciprocal O...	2019-12-11	Code
185	ExtremeNet (Hourglass-104, single-scale)	43.2	No	Bottom-up Object Detection by Grouping Extreme a...	2019-01-23	Code
186	Mask R-CNN (ResNeXt-101-FPN)	43.2	No	Mask R-CNN	2017-03-20	Code
187	Faster R-CNN (Cascade RPN)	42.8	Yes	Cascade RPN: Delving into High-Quality Region Pr...	2019-09-15	Code
188	Cascade R-CNN (ResNet-50-FPN+, cascade)	42.7	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code
189	RetinaNet (ResNet-101-FPN)	42.7	No	Focal Loss for Dense Object Detection	2017-08-07	Code
190	FCOS (HRNetV2p-W48)	42.6	Yes	Deep High-Resolution Representation Learning for...	2019-08-20	Code
191	GA-Faster-RCNN	42.6	No	Region Proposal by Guided Anchoring	2019-01-10	Code
192	Fast R-CNN (Cascade RPN)	42.4	Yes	Cascade RPN: Delving into High-Quality Region Pr...	2019-09-15	Code
193	SaccadeNet (DLA-34-DCN)	42.1	No	SaccadeNet: A Fast and Accurate Object Detector	2020-03-26	Code
194	RetinaMask (ResNet-50-FPN)	42	No	RetinaMask: Learning to predict masks improves s...	2019-01-10	Code
195	Cascade R-CNN (ResNet-101-FPN+)	41.8	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code
196	Mask R-CNN (ResNet-101-FPN)	41.1	No	Mask R-CNN	2017-03-20	Code
197	Faster R-CNN (ImageNet+300M)	41.1	No	Revisiting Unreasonable Effectiveness of Data in...	2017-07-10	Code
198	RefineDet512+ (VGG-16)	40.3	No	Single-Shot Refinement Neural Network for Object...	2017-11-18	Code
199	DeformConv-R-FCN (Aligned-Inception-ResNet)	40.1	No	Deformable Convolutional Networks	2017-03-17	Code
200	RefineDet512 (ResNet-101)	39.9	No	Single-Shot Refinement Neural Network for Object...	2017-11-18	Code
201	CornerNet511 (Hourglass-52, single-scale)	39	No	CornerNet: Detecting Objects as Paired Keypoints	2018-08-03	Code
202	Cascade R-CNN (ResNet-50-FPN+)	38.8	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code

#14xRSN-50 (ensemble)SOTA
83.8
APM· 2020-03-09
Learning Delicate Local Representations for Multi-Person Pose Estimation Code
#24xRSN-50
83.3
APM· 2020-03-09
Learning Delicate Local Representations for Multi-Person Pose Estimation Code
#3HRNet-W48+UDPSOTA
82.4
APM· 2019-11-18
The Devil is in the Details: Delving into Unbiased Data Processing for Human Pose Estimation Code
#4PPE (ResNeXt-101)
80.7
APM· 2022-06-15
Deep Multi-Task Networks For Occluded Pedestrian Pose Estimation
#5ViTPose (ViTAE-G, ensemble)
77.8
APM· 2022-04-26
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Code
#6ViTPose (ViTAE-G)
77.5
APM· 2022-04-26
ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation Code
#7UDP-Pose-PSA(384x288)
76.3
APM· 2021-07-02
Polarized Self-Attention: Towards High-quality Pixel-wise Regression Code
#8UDP-Pose-PSA(256x192)
76.1
APM· 2021-07-02
Polarized Self-Attention: Towards High-quality Pixel-wise Regression Code
#9PoseBH-H
75.9
APM· Augmentations· 2025-05-23
PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation Code
#10CCM+
75
APM· 2020-02-03
Towards High Performance Human Keypoint Detection Code
#11SCIO (HRNet-48)
74.1
APM· 2022-07-06
Self-Constrained Inference Optimization on Structural Groups for Human Pose Estimation
#12HRNet-W48+DARKSOTA
73.6
APM· 2019-10-14
Distribution-Aware Coordinate Representation for Human Pose Estimation Code
#13EvoPose2D-L
73.5
APM· 2020-11-17
EvoPose2D: Pushing the Boundaries of 2D Human Pose Estimation using Accelerated Neuroevolution with Weight Transfer Code
#14HRNet-W48 + extra dataSOTA
73.4
APM· 2019-02-25
Deep High-Resolution Representation Learning for Human Pose Estimation Code
#15HRNet*
73.4
APM· 2019-02-25
Deep High-Resolution Representation Learning for Human Pose Estimation Code
#16Simple Base+*SOTA
73
APM· 2018-04-17
Simple Baselines for Human Pose Estimation and Tracking Code
#17OmniPose (WASPv2)
72.6
APM· 2021-03-18
OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation Code
#18HRFormer-B
72.5
APM· 2021-10-18
HRFormer: High-Resolution Transformer for Dense Prediction Code
#19MSPN
72.3
APM· 2019-01-01
Rethinking on Multi-Stage Networks for Human Pose Estimation Code
#20MSPN
72.3
APM· 2019-01-01
Rethinking on Multi-Stage Networks for Human Pose Estimation Code
#21HRNet
71.9
APM· 2019-02-25
Deep High-Resolution Representation Learning for Human Pose Estimation Code
#22MIPNet
71.4
APM· 2021-01-27
Multi-Instance Pose Networks: Rethinking Top-Down Pose Estimation Code
#23TransPose-H-A6
71.3
APM· 2020-12-28
TransPose: Keypoint Localization via Transformer Code
#24DPIT-L
71.3
APM· 2022-09-02
DPIT: Dual-Pipeline Integrated Transformer for Human Pose Estimation
#25PoseFix
71.1
APM· 2018-12-10
PoseFix: Model-agnostic General Human Pose Refinement Network Code
#26S-ViPNAS-HRNetW32
70.5
APM· 2021-05-21
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search Code
#27Flow-based (ResNet-152)
70.3
APM· 2018-04-17
Simple Baselines for Human Pose Estimation and Tracking Code
#28Simple Base
70.3
APM· 2018-04-17
Simple Baselines for Human Pose Estimation and Tracking Code
#29CPN+SOTA
69.5
APM· 2017-11-20
Cascaded Pyramid Network for Multi-Person Pose Estimation Code
#30TFPose (ND=6 ResNet-50)
69.1
APM· 2021-03-29
TFPose: Direct Human Pose Estimation with Transformers
#31CPN
68.7
APM· 2017-11-20
Cascaded Pyramid Network for Multi-Person Pose Estimation Code
#32RMPE++SOTA
68
APM· 2016-12-01
RMPE: Regional Multi-person Pose Estimation Code
#33EVA
67.7
APM· 2022-11-14
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Code
#34Focal-Stable-DINO (Focal-Huge, no TTA)
67.6
APM· 2023-04-25
A Strong and Reproducible Object Detector with Only Public Datasets Code
#35HigherHRNet (ScaleNet_P4)
67.5
APM· 2020-11-30
ScaleNAS: One-Shot Learning of Scale-Aware Representations for Visual Recognition
#36Dite-HRNet-30
67.4
APM· 2022-04-22
Dite-HRNet: Dynamic Lightweight High-Resolution Network for Human Pose Estimation Code
#37S-ViPNAS-Res50
67.3
APM· 2021-05-21
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search Code
#38Group DETR v2
67.2
APM· 2022-11-07
Group DETR v2: Strong Object Detector with Encoder-Decoder Pretraining
#39OpenPifPaf
67.1
APM· 2021-03-03
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association Code
#40Relation-DETR (Focal-L)
66.9
APM· 2024-07-16
Relation DETR: Exploring Explicit Position Relation Prior for Object Detection Code
#41DETA (Swin-L)
66.9
APM· 2022-12-12
NMS Strikes Back Code
#42Lite-HRNet-30
66.9
APM· Augmentations· 2021-04-13
Lite-HRNet: A Lightweight High-Resolution Network Code
#43Plain-DETR (Swin-L)
66.8
APM
No paperCode
#44Simple Pose
66.8
APM· Augmentations· 2019-11-24
Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation Code
#45Simple Pose
66.8
APM· 2019-11-24
Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation Code
#46Identity Mapping Hourglass
66.8
APM· 2019-11-24
Simple Pose: Rethinking and Improving a Bottom-up Approach for Multi-Person Pose Estimation Code
#47HigherHRNet (HR-Net-48)
66.6
APM· 2019-08-27
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation Code
#48KAPAO-L
66.3
APM· 2021-11-16
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation Code
#49SMPR (HR-Net-32)
65.9
APM· 2020-06-28
SMPR: Single-Stage Multi-Person Pose Regression Code
#50GLIP (Swin-L, multi-scale)
64.9
APM· 2021-12-07
Grounded Language-Image Pre-training Code
#51KAPAO-M
64.3
APM· 2021-11-16
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation Code
#52PersonLab
64.1
APM· 2018-03-22
PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model Code
#53DyHead (Swin-L, multi scale, self-training)
64
APM· 2021-06-15
Dynamic Head: Unifying Object Detection Heads with Attentions Code
#54Lite-HRNet-18
64
APM· 2021-04-13
Lite-HRNet: A Lightweight High-Resolution Network Code
#55Hourglass-104
63.3
APM· 2021-07-07
Greedy Offset-Guided Keypoint Grouping for Human Pose Estimation Code
#56PifPaf (single-scale)
62.6
APM· 2019-03-15
PifPaf: Composite Fields for Human Pose Estimation Code
#57SPM
62.6
APM· 2019-08-24
Single-Stage Multi-Person Pose Machines Code
#58G-RMI
62.3
APM· 2017-01-06
Towards Accurate Multi-person Pose Estimation in the Wild
#59G-RMI
62.3
APM· 2017-01-06
Towards Accurate Multi-person Pose Estimation in the Wild
#60DyHead (Swin-L, multi scale)
62
APM· 2021-06-15
Dynamic Head: Unifying Object Detection Heads with Attentions Code
#61Faster R-CNN (ImageNet+300M)
61.8
APM· Augmentations· 2017-07-10
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era Code
#62OpenPose
61
APM· Augmentations· 2018-12-18
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Code
#63AESOTA
60.6
APM· 2016-11-16
Associative Embedding: End-to-End Learning for Joint Detection and Grouping Code
#64DirectPose (ResNet-101)
60.4
APM· 2019-11-18
DirectPose: Direct End-to-End Multi-Person Pose Estimation Code
#65SOLQ (Swin-L, single scale)
60
APM· 2021-06-04
SOLQ: Segmenting Objects by Learning Queries Code
#66CenterNet2 (Res2Net-101-DCN-BiFPN, self-training, 1560 single-scale)
59.7
APM· 2021-03-12
Probabilistic two-stage detection Code
#67PyCenterNet (Swin-L, multi-scale)
59.2
APM· 2022-04-18
CenterNet++ for Object Detection Code
#68QueryInst (single-scale)
58.9
APM· 2021-05-05
Instances as Queries Code
#69KAPAO-S
58.6
APM· 2021-11-16
Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation Code
#70RMPE
58.6
APM· 2016-12-01
RMPE: Regional Multi-person Pose Estimation Code
#71RMPE
58.6
APM· 2016-12-01
RMPE: Regional Multi-person Pose Estimation Code
#72DetectoRS (ResNeXt-101-64x4d, multi-scale)
58.4
APM· 2020-06-03
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution Code
#73YOLOv4-P6 CSP-P6 (single-scale, 32 fps)
58.2
APM· 2020-11-16
Scaled-YOLOv4: Scaling Cross Stage Partial Network Code
#74DirectPose (ResNet-101)
57.8
APM· 2019-11-18
DirectPose: Direct End-to-End Multi-Person Pose Estimation Code
#75Mask R-CNN
57.8
APM· 2017-03-20
Mask R-CNN Code
#76DetectoRS (ResNeXt-101-32x4d, multi-scale)
57.3
APM· 2020-06-03
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution Code
#77Mask R-CNN (ResNet-101-FPN, CBN)
57.3
APM· 2020-02-13
Cross-Iteration Batch Normalization Code
#78UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)
57.2
APM· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#79CMU-Pose
57.1
APM· 2016-11-24
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Code
#80CMU Pose
57.1
APM· 2016-11-24
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Code
#81CMU-Pose
57.1
APM· 2016-11-24
Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields Code
#82DetectoRS (ResNeXt-101-32x4d, single-scale)
56.5
APM· 2020-06-03
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous Convolution Code
#83LSNet (Res2Net-101+ DCN, multi-scale)
56.4
APM· 2021-04-11
Location-Sensitive Visual Recognition with Cross-IOU Loss Code
#84PAA (ResNext-152-32x8d + DCN, multi-scale)
56.3
APM· 2020-07-16
Probabilistic Anchor Assignment with IoU Prediction for Object Detection Code
#85PP-YOLOE-x(CSPRepResNet-x, 640x640, single-scale )
56.3
APM· 2022-03-30
PP-YOLOE: An evolved version of YOLO Code
#86ResNeSt-200 (multi-scale)
56.2
APM· 2020-04-19
ResNeSt: Split-Attention Networks Code
#87GFLV2 (Res2Net-101, DCN, multiscale)
56.1
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#88YOLOX-X (Modified CSP v5)
56.1
APM· 2021-07-18
YOLOX: Exceeding YOLO Series in 2021 Code
#89Cascade Mask R-CNN (Triple-ResNeXt152, multi-scale)
55.8
APM· 2019-09-09
CBNet: A Novel Composite Backbone Network Architecture for Object Detection Code
#90NAS-FPN (AmoebaNet-D, learned aug)
55.5
APM· 2019-06-26
Learning Data Augmentation Strategies for Object Detection Code
#91PP-YOLOE-l(CSPRepResNet-l, 640x640, single-scale )
55.3
APM· 2022-03-30
PP-YOLOE: An evolved version of YOLO Code
#92UniverseNet-20.08d (Res2Net-101, DCN, single-scale)
55.3
APM· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#93RetinaNet (SpineNet-190, 1280x1280)
55
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#94AC-FPN Cascade R-CNN (X-152-32x8d-FPN-IN5k, multi scale, only CEM)
54.8
APM· 2020-05-23
Attention-guided Context Feature Pyramid Network for Object Detection Code
#95TSD(SENet154-DCN,multi-scale)
54.8
APM· 2020-03-17
Revisiting the Sibling Head in Object Detector Code
#96RepPoints v2 (ResNeXt-101, DCN, multi-scale)
54.6
APM· 2020-07-16
RepPoints V2: Verification Meets Regression for Object Detection Code
#97Deformable DETR (ResNeXt-101+DCN)
54.4
APM· 2020-10-08
Deformable DETR: Deformable Transformers for End-to-End Object Detection Code
#98GFLV2 (Res2Net-101, DCN)
54.3
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#99RetinaNet (SpineNet-143, 1280x1280)
53.9
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#100OTA (ResNeXt-101+DCN, multiscale)
53.7
APM· 2021-03-26
OTA: Optimal Transport Assignment for Object Detection Code
#101FreeAnchor + SEPC (DCN, ResNext-101-64x4d)
53.3
APM· 2020-05-06
Scale-Equalizing Pyramid Convolution for Object Detection Code
#102aLRP Loss (ResNext-101-64x4d, DCN, multiscale test)
53.1
APM· 2020-09-28
A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection Code
#103Dynamic R-CNN (ResNet-101-DCN, multi-scale)
53
APM· 2020-04-13
Dynamic R-CNN: Towards High Quality Object Detection via Dynamic Training Code
#104ATSS (ResNetXt-64x4d-101+DCN,multi-scale)
52.9
APM· 2019-12-05
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample Selection Code
#105PP-YOLOE-m(CSPRepResNet-m, 640x640, single-scale )
52.9
APM· 2022-03-30
PP-YOLOE: An evolved version of YOLO Code
#106D2Det (ResNet-101-DCN, multi-scale test)
52.7
APM
No paperCode
#107TSD(ResNet-101-Deformable, Image Pyramid)
52.5
APM· 2020-03-17
Revisiting the Sibling Head in Object Detector Code
#108GFLV2 (ResNeXt-101, 32x4d, DCN)
52.4
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#109UniverseNet-20.08 (Res2Net-50, DCN, single-scale)
52.3
APM· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#110RetinaNet (SpineNet-96, 1024x1024)
52.3
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#111RepPoints v2 (ResNeXt-101, DCN)
52.1
APM· 2020-07-16
RepPoints V2: Verification Meets Regression for Object Detection Code
#112CPNDet (Hourglass-104, multi-scale)
51.9
APM· 2020-07-27
Corner Proposal Network for Anchor-free, Two-stage Object Detection Code
#113GFLV2 (ResNet-101-DCN)
51.9
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#114DAT-S (RetinaNet)
51.8
APM· 2022-01-03
Vision Transformer with Deformable Attention Code
#115GFL (X-101-32x4d-DCN, single-scale)
51.7
APM· 2020-06-08
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection Code
#116PANet (ResNeXt-101, multi-scale)
51.7
APM· 2018-03-05
Path Aggregation Network for Instance Segmentation Code
#117aLRP Loss (ResNext-101-64x4d, DCN, single scale)
51.5
APM· 2020-09-28
A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection Code
#118TridentNet (ResNet-101-Deformable, Image Pyramid)
51.3
APM· 2019-01-07
Scale-Aware Trident Networks for Object Detection Code
#119aLRP Loss (ResNext-101-64x4d, single scale)
50.8
APM· 2020-09-28
A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection Code
#120ISTR (ResNet101-FPN-3x, single-scale)
50.4
APM· 2021-05-03
ISTR: End-to-End Instance Segmentation with Transformers Code
#121MatrixNet Corners (ResNet-152, multi-scale)
50.4
APM· 2019-08-13
Matrix Nets: A New Deep Architecture for Object Detection Code
#122SAPD (ResNeXt-101, single-scale)
50.3
APM· 2019-11-27
Soft Anchor-Point Object Detection Code
#123InterNet (ResNet-101-FPN, multi-scale)
50.3
APM· 2019-03-28
Feature Intertwiner for Object Detection Code
#124RetinaNet (SpineNet-49, 896x896)
50.1
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#125CenterNet511 (Hourglass-104, multi-scale)
49.9
APM· 2019-04-17
CenterNet: Keypoint Triplets for Object Detection Code
#126PPDet (ResNeXt-101-FPN, multiscale)
49.9
APM· 2020-08-03
Reducing Label Noise in Anchor-Free Object Detection Code
#127GFLV2 (ResNet-101)
49.9
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#128HTC (HRNetV2p-W48)
49.7
APM· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#129RPDet (ResNet-101-DCN, multi-scale)
49.7
APM· 2019-04-25
RepPoints: Point Set Representation for Object Detection Code
#130M2Det (ResNet-101, multi-scale)
49.6
APM· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#131DCNv2 (ResNet-101, multi-scale)
49.1
APM· 2018-11-27
Deformable ConvNets v2: More Deformable, Better Results Code
#132Cascade R-CNN-FPN (ResNet-101, map-guided)
49
APM· 2019-08-21
InstaBoost: Boosting Instance Segmentation via Probability Map Guided Copy-Pasting Code
#133YOLOv4 (CD53)
49
APM· Augmentations· 2020-11-16
Scaled-YOLOv4: Scaling Cross Stage Partial Network Code
#134SNIPER (ResNet-101)
48.9
APM· 2018-05-23
SNIPER: Efficient Multi-Scale Training Code
#135D-RFCN + SNIP (DPN-98 with flip, multi-scale)
48.8
APM· 2017-11-22
An Analysis of Scale Invariance in Object Detection - SNIP
#136ISTR (ResNet50-FPN-3x, single-scale)
48.7
APM· 2021-05-03
ISTR: End-to-End Instance Segmentation with Transformers Code
#137Mask R-CNN (HRNetV2p-W48 + cascade)
48.6
APM· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#138HoughNet (MS)
48.5
APM· 2020-07-05
HoughNet: Integrating near and long-range evidence for bottom-up object detection Code
#139YOLOF-DC5
48.5
APM· 2021-03-17
You Only Look One-level Feature Code
#140CenterMask+VoVNetV2-99 (single-scale)
48.3
APM· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#141aLRP Loss (ResNext-101, DCN, 500 scale)
48.1
APM· 2020-09-28
A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection Code
#142FreeAnchor (ResNeXt-101)
47.9
APM· 2019-09-05
FreeAnchor: Learning to Match Anchors for Visual Object Detection Code
#143M2Det (VGG-16, multi-scale)
47.9
APM· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#144AC-FPN Cascade R-CNN(ResNet-101, single scale)
47.7
APM· 2020-05-23
Attention-guided Context Feature Pyramid Network for Object Detection Code
#145RetinaNet (SpineNet-49, 640x640)
47.7
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#146GFLV2 (ResNet-50)
47.7
APM· 2020-11-25
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection Code
#147FCOS (ResNeXt-64x4d-101-FPN 4 + improvements)
47.5
APM· 2019-04-02
FCOS: Fully Convolutional One-Stage Object Detection Code
#148HSD (Rest101, 768x768, single-scale test)
47.3
APM
No paperCode
#149CenterMask + X-101-32x8d (single-scale)
47.2
APM· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#150FSAF (ResNeXt-101, multi-scale)
47.1
APM· 2019-03-02
Feature Selective Anchor-Free Module for Single-Shot Object Detection Code
#151FoveaBox (ResNeXt-101)
46.9
APM· 2019-04-08
FoveaBox: Beyond Anchor-based Object Detector Code
#152ExtremeNet (Hourglass-104, multi-scale)
46.9
APM· 2019-01-23
Bottom-up Object Detection by Grouping Extreme and Center Points Code
#153Faster R-CNN (LIP-ResNet-101-MD w FPN)
46.7
APM· 2019-08-12
LIP: Local Importance-based Pooling Code
#154YOLOv4-608
46.7
APM· Augmentations· 2020-04-23
YOLOv4: Optimal Speed and Accuracy of Object Detection Code
#155YOLOv3 @800 + ASFF* (Darknet-53)
46.6
APM· Augmentations· 2019-11-21
Learning Spatial Fusion for Single-Shot Object Detection Code
#156TridentNet (ResNet-101)
46.6
APM· 2019-01-07
Scale-Aware Trident Networks for Object Detection Code
#157D-RFCN + SNIP (ResNet-101, multi-scale)
46.5
APM· 2017-11-22
An Analysis of Scale Invariance in Object Detection - SNIP
#158Grid R-CNN (ResNeXt-101-FPN)
46.5
APM· 2018-11-29
Grid R-CNN Code
#159M2Det (VGG-16, single-scale)
46.5
APM· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#160PP-YOLOE-s(CSPRepResNet-s, 640x640, single-scale )
46.4
APM· 2022-03-30
PP-YOLOE: An evolved version of YOLO Code
#161SNIPER (ResNet-50)
46.3
APM· 2018-05-23
SNIPER: Efficient Multi-Scale Training Code
#162FCOS (ResNeXt-101-64x4d-FPN)
46.2
APM· 2019-04-02
FCOS: Fully Convolutional One-Stage Object Detection Code
#163RPDet (ResNet-101-DCN)
46.2
APM· 2019-04-25
RepPoints: Point Set Representation for Object Detection Code
#164Libra R-CNN (ResNeXt-101-FPN)
45.6
APM· 2019-04-04
Libra R-CNN: Towards Balanced Learning for Object Detection Code
#165FCOS (ResNeXt-32x8d-101-FPN)
45.6
APM· 2019-04-02
FCOS: Fully Convolutional One-Stage Object Detection Code
#166RetinaMask (ResNeXt-101-FPN-GN)
45.6
APM· 2019-01-10
RetinaMask: Learning to predict masks improves state-of-the-art single-shot detection for free Code
#167Cascade R-CNN (ResNet-101-FPN+, cascade)
45.5
APM· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code
#168Cascade R-CNN
45.5
APM· 2019-06-24
Cascade R-CNN: High Quality Object Detection and Instance Segmentation Code
#169SpineNet-49 (640, RetinaNet, single-scale)
45.2
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#170RefineDet512+ (ResNet-101)
45.1
APM· 2017-11-18
Single-Shot Refinement Neural Network for Object Detection Code
#171GHM-C + GHM-R (RetinaNet-FPN-ResNeXt-101)
45.1
APM· 2018-11-13
Gradient Harmonized Single-stage Detector Code
#172FCOS (HRNet-W32-5l)
45
APM· 2019-04-02
FCOS: Fully Convolutional One-Stage Object Detection Code
#173RetinaNet (SpineNet-49S, 640x640)
45
APM· 2019-12-10
SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization Code
#174CornerNet511 (Hourglass-104, multi-scale)
44.8
APM· 2018-08-03
CornerNet: Detecting Objects as Paired Keypoints Code
#175CornerNet-Saccade (Hourglass-104, multi-scale)
44.6
APM· 2019-04-18
CornerNet-Lite: Efficient Keypoint Based Object Detection Code
#176Faster R-CNN (HRNetV2p-W48)
44.6
APM· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#177FSAF (ResNet-101, single-scale)
44.2
APM· 2019-03-02
Feature Selective Anchor-Free Module for Single-Shot Object Detection Code
#178RetinaNet (ResNeXt-101-FPN)
44.2
APM· 2017-08-07
Focal Loss for Dense Object Detection Code
#179RPDet (ResNet-101)
44.1
APM· 2019-04-25
RepPoints: Point Set Representation for Object Detection Code
#180HTC (ResNeXt-101-FPN)
43.9
APM· 2019-01-22
Hybrid Task Cascade for Instance Segmentation Code
#181CenterNet-DLA (DLA-34, multi-scale)
43.9
APM· 2019-04-16
Objects as Points Code
#182ResNet-50-DW-DPN (Deformable Kernels)
43.9
APM· 2019-10-07
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation Code
#183M2Det (ResNet-101, single-scale)
43.9
APM· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#184RDSNet (ResNet-101, RetinaNet, mask, MBRM)
43.5
APM· 2019-12-11
RDSNet: A New Deep Architecture for Reciprocal Object Detection and Instance Segmentation Code
#185ExtremeNet (Hourglass-104, single-scale)
43.2
APM· 2019-01-23
Bottom-up Object Detection by Grouping Extreme and Center Points Code
#186Mask R-CNN (ResNeXt-101-FPN)
43.2
APM· 2017-03-20
Mask R-CNN Code
#187Faster R-CNN (Cascade RPN)
42.8
APM· Augmentations· 2019-09-15
Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution Code
#188Cascade R-CNN (ResNet-50-FPN+, cascade)
42.7
APM· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code
#189RetinaNet (ResNet-101-FPN)
42.7
APM· 2017-08-07
Focal Loss for Dense Object Detection Code
#190FCOS (HRNetV2p-W48)
42.6
APM· Augmentations· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#191GA-Faster-RCNN
42.6
APM· 2019-01-10
Region Proposal by Guided Anchoring Code
#192Fast R-CNN (Cascade RPN)
42.4
APM· Augmentations· 2019-09-15
Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution Code
#193SaccadeNet (DLA-34-DCN)
42.1
APM· 2020-03-26
SaccadeNet: A Fast and Accurate Object Detector Code
#194RetinaMask (ResNet-50-FPN)
42
APM· 2019-01-10
RetinaMask: Learning to predict masks improves state-of-the-art single-shot detection for free Code
#195Cascade R-CNN (ResNet-101-FPN+)
41.8
APM· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code
#196Mask R-CNN (ResNet-101-FPN)
41.1
APM· 2017-03-20
Mask R-CNN Code
#197Faster R-CNN (ImageNet+300M)
41.1
APM· 2017-07-10
Revisiting Unreasonable Effectiveness of Data in Deep Learning Era Code
#198RefineDet512+ (VGG-16)
40.3
APM· 2017-11-18
Single-Shot Refinement Neural Network for Object Detection Code
#199DeformConv-R-FCN (Aligned-Inception-ResNet)
40.1
APM· 2017-03-17
Deformable Convolutional Networks Code
#200RefineDet512 (ResNet-101)
39.9
APM· 2017-11-18
Single-Shot Refinement Neural Network for Object Detection Code
#201CornerNet511 (Hourglass-52, single-scale)
39
APM· 2018-08-03
CornerNet: Detecting Objects as Paired Keypoints Code
#202Cascade R-CNN (ResNet-50-FPN+)
38.8
APM· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code