3D on COCO minival

Metric: APS (higher is better)

LeaderboardDataset

Loading chart...

Results

Hide augmentations

Sort:

#	Model↕	APS▼	Augmentations	Paper	Date↕	Code
1	Focal-Stable-DINO (Focal-Huge, no TTA)	50.4	Yes	A Strong and Reproducible Object Detector with O...	2023-04-25	Code
2	EVA	49.4	Yes	EVA: Exploring the Limits of Masked Visual Repre...	2022-11-14	Code
3	UNINEXT-H	45.1	Yes	Universal Instance Perception as Object Discover...	2023-03-12	Code
4	DyHead (Swin-L, multi scale)	44.5	No	Dynamic Head: Unifying Object Detection Heads wi...	2021-06-15	Code
5	YOLOR-D6 (1280, single-scale, 31 fps)	40.4	No	You Only Learn One Representation: Unified Netwo...	2021-05-10	Code
6	QueryInst (single scale)	40.2	No	Instances as Queries	2021-05-05	Code
7	EfficientDet-D7x (single-scale)	40	No	EfficientDet: Scalable and Efficient Object Dete...	2019-11-20	Code
8	YOLOv4-P7 CSP-P7 (single-scale, 16 fps)	38.1	No	Scaled-YOLOv4: Scaling Cross Stage Partial Network	2020-11-16	Code
9	YOLOR-P6 (1280, single-scale, 72 fps)	37.4	No	You Only Learn One Representation: Unified Netwo...	2021-05-10	Code
10	UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)	36.9	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
11	ResNeSt-200 (multi-scale)	36.8	No	ResNeSt: Split-Attention Networks	2020-04-19	Code
12	DINO-5scale (36 epoch)	35	No	DINO: DETR with Improved DeNoising Anchor Boxes ...	2022-03-07	Code
13	Cascade RCNN-RS (SpineNet-143L, single scale)	34.5	No	Simple Training Strategies and Model Scaling for...	2021-06-30	Code
14	DINO-5scale (24 epoch)	34.5	No	DINO: DETR with Improved DeNoising Anchor Boxes ...	2022-03-07	Code
15	Cascade RCNN-RS (ResNet-200, single scale)	33.9	No	Simple Training Strategies and Model Scaling for...	2021-06-30	Code
16	UniverseNet-20.08d (Res2Net-101, DCN, single-scale)	33.5	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
17	ResNeSt-200-DCN (single-scale)	32.67	No	ResNeSt: Split-Attention Networks	2020-04-19	Code
18	DN-Deformable-DETR-R50++	31.3	No	DN-DETR: Accelerate DETR Training by Introducing...	2022-03-02	Code
19	UniverseNet-20.08 (Res2Net-50, DCN, single-scale)	30.6	No	USB: Universal-Scale Object Detection Benchmark	2021-03-25	Code
20	MAE-Det(MAE-Det-L+GFLV2)	30.3	No	MAE-DET: Revisiting Maximum Entropy Principle in...	2021-11-26	Code
21	REGO-Deformable DETR-X101	30	No	Recurrent Glimpse-based Decoder for Detection wi...	2021-12-09	Code
22	HoughNet (HG-104, MS)	30	No	HoughNet: Integrating near and long-range eviden...	2020-07-05	Code
23	RetinaNet (ViL-Base, multi-scale, 3x)	29.9	No	Multi-Scale Vision Longformer: A New Vision Tran...	2021-03-29	Code
24	CenterMask+VoVNetV2-99 (single-scale)	29.2	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
25	RetinaNet (ViL-Base)	28.9	No	Multi-Scale Vision Longformer: A New Vision Tran...	2021-03-29	Code
26	HTC (HRNetV2p-W48)	28.8	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
27	Res2Net101+HTC	28.6	No	Res2Net: A New Multi-scale Backbone Architecture	2019-04-02	Code
28	Mask R-CNN (VoVNetV2-99, single-scale)	28.5	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
29	Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)	28.3	No	Sparse R-CNN: End-to-End Object Detection with L...	2020-11-25	Code
30	Pix2seq (R101-DC5)	28.2	No	Pix2seq: A Language Modeling Framework for Objec...	2021-09-22	Code
31	DAB-DETR-DC5-R101	28.1	No	DAB-DETR: Dynamic Anchor Boxes are Better Querie...	2022-01-28	Code
32	CenterMask+VoVNetV2-57 (single-scale)	27.7	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
33	Mask R-CNN (HRNetV2p-W48, cascade)	27.5	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
34	Conditional DETR-DC5-R101	27.2	No	Conditional DETR for Fast Training Convergence	2021-08-13	Code
35	Faster RCNN-R101-FPN+	27.2	No	End-to-End Object Detection with Transformers	2020-05-26	Code
36	HTC (HRNetV2p-W32)	27	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
37	R3-CNN (ResNet-50-FPN, GC-Net)	27	No	Recursively Refined R-CNN: Instance Segmentation...	2021-04-03	Code
38	Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)	26.9	No	Sparse R-CNN: End-to-End Object Detection with L...	2020-11-25	Code
39	Faster R-CNN (FPN, X-volution)	26.9	No	X-volution: On the unification of convolution an...	2021-06-04	-
40	CenterMask+X101-32x8d (single-scale)	26.7	No	CenterMask : Real-Time Anchor-Free Instance Segm...	2019-11-15	Code
41	Sparse R-CNN (ResNet-50, FPN)	26.7	No	Sparse R-CNN: End-to-End Object Detection with L...	2020-11-25	Code
42	R3-CNN (ResNet-50-FPN, DCN)	26.6	No	Recursively Refined R-CNN: Instance Segmentation...	2021-04-03	Code
43	Pix2seq (R50-DC5 )	26.6	No	Pix2seq: A Language Modeling Framework for Objec...	2021-09-22	Code
44	HTC (HRNetV2p-W18)	26.6	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
45	Cascade R-CNN (HRNetV2p-W48)	26.3	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
46	Sparse R-CNN (ResNet-101, FPN)	26.1	No	Sparse R-CNN: End-to-End Object Detection with L...	2020-11-25	Code
47	PVT-Large (RetinaNet 3x,MS)	26.1	No	Pyramid Vision Transformer: A Versatile Backbone...	2021-02-24	Code
48	Mask R-CNN (HRNetV2p-W32, cascade)	26.1	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
49	Anchor DETR-DC5-R101	25.8	No	Anchor DETR: Query Design for Transformer-Based ...	2021-09-15	Code
50	PVT-Large (RetinaNet 1x)	25.8	No	Pyramid Vision Transformer: A Versatile Backbone...	2021-02-24	Code
51	ExtremeNet (Hourglass-104, multi-scale)	25.7	No	Bottom-up Object Detection by Grouping Extreme a...	2019-01-23	Code
52	Cascade R-CNN (HRNetV2p-W32)	25.6	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
53	HoughNet (HG-104)	25.5	No	HoughNet: Integrating near and long-range eviden...	2020-07-05	Code
54	CornerNet-Saccade (Hourglass-54)	25.5	No	CornerNet-Lite: Efficient Keypoint Based Object ...	2019-04-18	Code
55	Mask R-CNN-FPN (ResNeXt-101, GN+WS)	25.49	No	Micro-Batch Training with Batch-Channel Normaliz...	2019-03-25	Code
56	PPDet (ResNet-101-FPN)	25.4	No	Reducing Label Noise in Anchor-Free Object Detec...	2020-08-03	Code
57	Conditional DETR-DC5-R50	25.3	No	Conditional DETR for Fast Training Convergence	2021-08-13	Code
58	Faster R-CNN (LIP-ResNet-101)	25.2	No	LIP: Local Importance-based Pooling	2019-08-12	Code
59	Mask R-CNN (HRNetV2p-W32)	25	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
60	TridentNet (ResNet-101)	24.9	No	Scale-Aware Trident Networks for Object Detection	2019-01-07	Code
61	Anchor DETR-DC5-R50	24.7	No	Anchor DETR: Query Design for Transformer-Based ...	2021-09-15	Code
62	R3-CNN (ResNet-50-FPN)	24.5	No	Recursively Refined R-CNN: Instance Segmentation...	2021-04-03	Code
63	Faster R-CNN (HRNetV2p-W32)	24.4	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
64	R3-CNN (ResNet-50-FPN, GRoIE)	24.4	No	Recursively Refined R-CNN: Instance Segmentation...	2021-04-03	Code
65	GCnet (ResNet-50-FPN, GRoIE)	24.2	No	GCNet: Non-local Networks Meet Squeeze-Excitatio...	2019-04-25	Code
66	DAB-DETR-R101	24.1	No	DAB-DETR: Dynamic Anchor Boxes are Better Querie...	2022-01-28	Code
67	Cascade R-CNN (ResNet-101-FPN+, cascade)	23.8	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code
68	CornerNet-Saccade (Hourglass-104)	23.8	No	CornerNet-Lite: Efficient Keypoint Based Object ...	2019-04-18	Code
69	DETR-DC5 (ResNet-101)	23.7	No	End-to-End Object Detection with Transformers	2020-05-26	Code
70	Cascade R-CNN (HRNetV2p-W18)	23.7	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
71	Conditional DETR-R101	23.6	No	Conditional DETR for Fast Training Convergence	2021-08-13	Code
72	CenterNet511 (Hourglass-52)	23.6	No	CenterNet: Keypoint Triplets for Object Detection	2019-04-17	Code
73	Grid R-CNN (ResNet-101-FPN)	23.4	No	Grid R-CNN	2018-11-29	Code
74	Cascade R-CNN (ResNet-50-FPN+)	22.9	No	Cascade R-CNN: Delving into High Quality Object ...	2017-12-03	Code
75	FPN+	22.9	No	Feature Pyramid Networks for Object Detection	2016-12-09	Code
76	Libra R-CNN (ResNet-50 FPN)	22.9	No	Libra R-CNN: Towards Balanced Learning for Objec...	2019-04-04	Code
77	Mask R-CNN (ResNet-50-FPN, GRoIE)	22.9	No	A novel Region of Interest Extraction Layer for ...	2020-04-28	Code
78	Conditional DETR-R50	22.7	No	Conditional DETR for Fast Training Convergence	2021-08-13	Code
79	Grid R-CNN (ResNet-50-FPN)	22.6	No	Grid R-CNN	2018-11-29	Code
80	Faster R-CNN (HRNetV2p-W18)	22.6	No	Deep High-Resolution Representation Learning for...	2019-08-20	Code
81	FoveaBox (ResNet-101-FPN, 800x800)	22.3	No	FoveaBox: Beyond Anchor-based Object Detector	2019-04-08	Code
82	FCOS (ResNet-50-FPN + improvements)	22.3	No	FCOS: Fully Convolutional One-Stage Object Detec...	2019-04-02	Code
83	Faster R-CNN (ResNet-50-FPN, GRoIE)	22.3	No	A novel Region of Interest Extraction Layer for ...	2020-04-28	Code
84	Faster R-CNN (ResNet-101, DCNv2)	22.2	No	Deformable ConvNets v2: More Deformable, Better ...	2018-11-27	Code
85	ExtremeNet (Hourglass-104, single-scale)	21.6	No	Bottom-up Object Detection by Grouping Extreme a...	2019-01-23	Code
86	HTC (cascade)	20.3	No	Hybrid Task Cascade for Instance Segmentation	2019-01-22	Code
87	FSAF (ResNet-50)	19.8	No	Feature Selective Anchor-Free Module for Single-...	2019-03-02	Code
88	GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)	19.6	No	Gradient Harmonized Single-stage Detector	2018-11-13	Code
89	FoveaBox (ResNet-101-FPN, 600x600)	19.5	No	FoveaBox: Beyond Anchor-based Object Detector	2019-04-08	Code
90	CornerNet511 (Hourglass-104)	18.6	No	CornerNet: Detecting Objects as Paired Keypoints	2018-08-03	Code
91	FoveaBox (ResNet-50-FPN, 600x600)	18.6	No	FoveaBox: Beyond Anchor-based Object Detector	2019-04-08	Code
92	M2Det (ResNet-1o1, 320x320)	15.9	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
93	M2Det (VGG-16, 320x320)	15	No	M2Det: A Single-Shot Object Detector based on Mu...	2018-11-12	Code
94	Faster R-CNN (Res2Net-50)	14	No	Res2Net: A New Multi-scale Backbone Architecture	2019-04-02	Code

#1Focal-Stable-DINO (Focal-Huge, no TTA)SOTA
50.4
APS· Augmentations· 2023-04-25
A Strong and Reproducible Object Detector with Only Public Datasets Code
#2EVASOTA
49.4
APS· Augmentations· 2022-11-14
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale Code
#3UNINEXT-H
45.1
APS· Augmentations· 2023-03-12
Universal Instance Perception as Object Discovery and Retrieval Code
#4DyHead (Swin-L, multi scale)SOTA
44.5
APS· 2021-06-15
Dynamic Head: Unifying Object Detection Heads with Attentions Code
#5YOLOR-D6 (1280, single-scale, 31 fps)SOTA
40.4
APS· 2021-05-10
You Only Learn One Representation: Unified Network for Multiple Tasks Code
#6QueryInst (single scale)SOTA
40.2
APS· 2021-05-05
Instances as Queries Code
#7EfficientDet-D7x (single-scale)SOTA
40
APS· 2019-11-20
EfficientDet: Scalable and Efficient Object Detection Code
#8YOLOv4-P7 CSP-P7 (single-scale, 16 fps)
38.1
APS· 2020-11-16
Scaled-YOLOv4: Scaling Cross Stage Partial Network Code
#9YOLOR-P6 (1280, single-scale, 72 fps)
37.4
APS· 2021-05-10
You Only Learn One Representation: Unified Network for Multiple Tasks Code
#10UniverseNet-20.08d (Res2Net-101, DCN, multi-scale)
36.9
APS· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#11ResNeSt-200 (multi-scale)
36.8
APS· 2020-04-19
ResNeSt: Split-Attention Networks Code
#12DINO-5scale (36 epoch)
35
APS· 2022-03-07
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection Code
#13Cascade RCNN-RS (SpineNet-143L, single scale)
34.5
APS· 2021-06-30
Simple Training Strategies and Model Scaling for Object Detection Code
#14DINO-5scale (24 epoch)
34.5
APS· 2022-03-07
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection Code
#15Cascade RCNN-RS (ResNet-200, single scale)
33.9
APS· 2021-06-30
Simple Training Strategies and Model Scaling for Object Detection Code
#16UniverseNet-20.08d (Res2Net-101, DCN, single-scale)
33.5
APS· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#17ResNeSt-200-DCN (single-scale)
32.67
APS· 2020-04-19
ResNeSt: Split-Attention Networks Code
#18DN-Deformable-DETR-R50++
31.3
APS· 2022-03-02
DN-DETR: Accelerate DETR Training by Introducing Query DeNoising Code
#19UniverseNet-20.08 (Res2Net-50, DCN, single-scale)
30.6
APS· 2021-03-25
USB: Universal-Scale Object Detection Benchmark Code
#20MAE-Det(MAE-Det-L+GFLV2)
30.3
APS· 2021-11-26
MAE-DET: Revisiting Maximum Entropy Principle in Zero-Shot NAS for Efficient Object Detection Code
#21REGO-Deformable DETR-X101
30
APS· 2021-12-09
Recurrent Glimpse-based Decoder for Detection with Transformer Code
#22HoughNet (HG-104, MS)
30
APS· 2020-07-05
HoughNet: Integrating near and long-range evidence for bottom-up object detection Code
#23RetinaNet (ViL-Base, multi-scale, 3x)
29.9
APS· 2021-03-29
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding Code
#24CenterMask+VoVNetV2-99 (single-scale)SOTA
29.2
APS· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#25RetinaNet (ViL-Base)
28.9
APS· 2021-03-29
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding Code
#26HTC (HRNetV2p-W48)SOTA
28.8
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#27Res2Net101+HTCSOTA
28.6
APS· 2019-04-02
Res2Net: A New Multi-scale Backbone Architecture Code
#28Mask R-CNN (VoVNetV2-99, single-scale)
28.5
APS· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#29Sparse R-CNN (ResNet-101, learnable proposals, random crop aug, FPN)
28.3
APS· 2020-11-25
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals Code
#30Pix2seq (R101-DC5)
28.2
APS· 2021-09-22
Pix2seq: A Language Modeling Framework for Object Detection Code
#31DAB-DETR-DC5-R101
28.1
APS· 2022-01-28
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR Code
#32CenterMask+VoVNetV2-57 (single-scale)
27.7
APS· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#33Mask R-CNN (HRNetV2p-W48, cascade)
27.5
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#34Conditional DETR-DC5-R101
27.2
APS· 2021-08-13
Conditional DETR for Fast Training Convergence Code
#35Faster RCNN-R101-FPN+
27.2
APS· 2020-05-26
End-to-End Object Detection with Transformers Code
#36HTC (HRNetV2p-W32)
27
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#37R3-CNN (ResNet-50-FPN, GC-Net)
27
APS· 2021-04-03
Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing Code
#38Sparse R-CNN (ResNet-50, learnable proposals, random crop aug, FPN)
26.9
APS· 2020-11-25
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals Code
#39Faster R-CNN (FPN, X-volution)
26.9
APS· 2021-06-04
X-volution: On the unification of convolution and self-attention
#40CenterMask+X101-32x8d (single-scale)
26.7
APS· 2019-11-15
CenterMask : Real-Time Anchor-Free Instance Segmentation Code
#41Sparse R-CNN (ResNet-50, FPN)
26.7
APS· 2020-11-25
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals Code
#42R3-CNN (ResNet-50-FPN, DCN)
26.6
APS· 2021-04-03
Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing Code
#43Pix2seq (R50-DC5 )
26.6
APS· 2021-09-22
Pix2seq: A Language Modeling Framework for Object Detection Code
#44HTC (HRNetV2p-W18)
26.6
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#45Cascade R-CNN (HRNetV2p-W48)
26.3
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#46Sparse R-CNN (ResNet-101, FPN)
26.1
APS· 2020-11-25
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals Code
#47PVT-Large (RetinaNet 3x,MS)
26.1
APS· 2021-02-24
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions Code
#48Mask R-CNN (HRNetV2p-W32, cascade)
26.1
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#49Anchor DETR-DC5-R101
25.8
APS· 2021-09-15
Anchor DETR: Query Design for Transformer-Based Object Detection Code
#50PVT-Large (RetinaNet 1x)
25.8
APS· 2021-02-24
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions Code
#51ExtremeNet (Hourglass-104, multi-scale)SOTA
25.7
APS· 2019-01-23
Bottom-up Object Detection by Grouping Extreme and Center Points Code
#52Cascade R-CNN (HRNetV2p-W32)
25.6
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#53HoughNet (HG-104)
25.5
APS· 2020-07-05
HoughNet: Integrating near and long-range evidence for bottom-up object detection Code
#54CornerNet-Saccade (Hourglass-54)
25.5
APS· 2019-04-18
CornerNet-Lite: Efficient Keypoint Based Object Detection Code
#55Mask R-CNN-FPN (ResNeXt-101, GN+WS)
25.49
APS· 2019-03-25
Micro-Batch Training with Batch-Channel Normalization and Weight Standardization Code
#56PPDet (ResNet-101-FPN)
25.4
APS· 2020-08-03
Reducing Label Noise in Anchor-Free Object Detection Code
#57Conditional DETR-DC5-R50
25.3
APS· 2021-08-13
Conditional DETR for Fast Training Convergence Code
#58Faster R-CNN (LIP-ResNet-101)
25.2
APS· 2019-08-12
LIP: Local Importance-based Pooling Code
#59Mask R-CNN (HRNetV2p-W32)
25
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#60TridentNet (ResNet-101)SOTA
24.9
APS· 2019-01-07
Scale-Aware Trident Networks for Object Detection Code
#61Anchor DETR-DC5-R50
24.7
APS· 2021-09-15
Anchor DETR: Query Design for Transformer-Based Object Detection Code
#62R3-CNN (ResNet-50-FPN)
24.5
APS· 2021-04-03
Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing Code
#63Faster R-CNN (HRNetV2p-W32)
24.4
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#64R3-CNN (ResNet-50-FPN, GRoIE)
24.4
APS· 2021-04-03
Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing Code
#65GCnet (ResNet-50-FPN, GRoIE)
24.2
APS· 2019-04-25
GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond Code
#66DAB-DETR-R101
24.1
APS· 2022-01-28
DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR Code
#67Cascade R-CNN (ResNet-101-FPN+, cascade)SOTA
23.8
APS· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code
#68CornerNet-Saccade (Hourglass-104)
23.8
APS· 2019-04-18
CornerNet-Lite: Efficient Keypoint Based Object Detection Code
#69DETR-DC5 (ResNet-101)
23.7
APS· 2020-05-26
End-to-End Object Detection with Transformers Code
#70Cascade R-CNN (HRNetV2p-W18)
23.7
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#71Conditional DETR-R101
23.6
APS· 2021-08-13
Conditional DETR for Fast Training Convergence Code
#72CenterNet511 (Hourglass-52)
23.6
APS· 2019-04-17
CenterNet: Keypoint Triplets for Object Detection Code
#73Grid R-CNN (ResNet-101-FPN)
23.4
APS· 2018-11-29
Grid R-CNN Code
#74Cascade R-CNN (ResNet-50-FPN+)
22.9
APS· 2017-12-03
Cascade R-CNN: Delving into High Quality Object Detection Code
#75FPN+SOTA
22.9
APS· 2016-12-09
Feature Pyramid Networks for Object Detection Code
#76Libra R-CNN (ResNet-50 FPN)
22.9
APS· 2019-04-04
Libra R-CNN: Towards Balanced Learning for Object Detection Code
#77Mask R-CNN (ResNet-50-FPN, GRoIE)
22.9
APS· 2020-04-28
A novel Region of Interest Extraction Layer for Instance Segmentation Code
#78Conditional DETR-R50
22.7
APS· 2021-08-13
Conditional DETR for Fast Training Convergence Code
#79Grid R-CNN (ResNet-50-FPN)
22.6
APS· 2018-11-29
Grid R-CNN Code
#80Faster R-CNN (HRNetV2p-W18)
22.6
APS· 2019-08-20
Deep High-Resolution Representation Learning for Visual Recognition Code
#81FoveaBox (ResNet-101-FPN, 800x800)
22.3
APS· 2019-04-08
FoveaBox: Beyond Anchor-based Object Detector Code
#82FCOS (ResNet-50-FPN + improvements)
22.3
APS· 2019-04-02
FCOS: Fully Convolutional One-Stage Object Detection Code
#83Faster R-CNN (ResNet-50-FPN, GRoIE)
22.3
APS· 2020-04-28
A novel Region of Interest Extraction Layer for Instance Segmentation Code
#84Faster R-CNN (ResNet-101, DCNv2)
22.2
APS· 2018-11-27
Deformable ConvNets v2: More Deformable, Better Results Code
#85ExtremeNet (Hourglass-104, single-scale)
21.6
APS· 2019-01-23
Bottom-up Object Detection by Grouping Extreme and Center Points Code
#86HTC (cascade)
20.3
APS· 2019-01-22
Hybrid Task Cascade for Instance Segmentation Code
#87FSAF (ResNet-50)
19.8
APS· 2019-03-02
Feature Selective Anchor-Free Module for Single-Shot Object Detection Code
#88GHM-C + GHM-R (RetinaNet-FPN-ResNet-50, M=30)
19.6
APS· 2018-11-13
Gradient Harmonized Single-stage Detector Code
#89FoveaBox (ResNet-101-FPN, 600x600)
19.5
APS· 2019-04-08
FoveaBox: Beyond Anchor-based Object Detector Code
#90CornerNet511 (Hourglass-104)
18.6
APS· 2018-08-03
CornerNet: Detecting Objects as Paired Keypoints Code
#91FoveaBox (ResNet-50-FPN, 600x600)
18.6
APS· 2019-04-08
FoveaBox: Beyond Anchor-based Object Detector Code
#92M2Det (ResNet-1o1, 320x320)
15.9
APS· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#93M2Det (VGG-16, 320x320)
15
APS· 2018-11-12
M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network Code
#94Faster R-CNN (Res2Net-50)
14
APS· 2019-04-02
Res2Net: A New Multi-scale Backbone Architecture Code