Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Methodology
/
Multi-Label Classification
/
MS-COCO
Multi-Label Classification on MS-COCO
Metric: mAP (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
mAP (best first)
mAP (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
mAP
▼
Augmentations
Paper
Date
↕
Code
1
ADDS(ViT-L-336, resolution 1344)
93.54
No
Open Vocabulary Multi-Label Classification with ...
2022-08-19
-
2
ADDS(ViT-L-336, resolution 640)
93.41
No
Open Vocabulary Multi-Label Classification with ...
2022-08-19
-
3
ADDS(ViT-L-336, resolution 336)
91.76
No
Open Vocabulary Multi-Label Classification with ...
2022-08-19
-
4
ML-Decoder(TResNet-XL, resolution 640)
91.4
No
ML-Decoder: Scalable and Versatile Classificatio...
2021-11-25
Code
5
Q2L-CvT(ImageNet-21K pretraining, resolution 384)
91.3
No
Query2Label: A Simple Transformer Way to Multi-L...
2021-07-22
Code
6
MLD-TResNet-L-AAM[640x640]
91.3
No
Combining Metric Learning and Attention Heads Fo...
2022-09-14
Code
7
ML-Decoder(TResNet-L, resolution 640)
91.1
No
ML-Decoder: Scalable and Versatile Classificatio...
2021-11-25
Code
8
Q2L-SwinL(ImageNet-21K pretraining, resolution 384)
90.5
No
Query2Label: A Simple Transformer Way to Multi-L...
2021-07-22
Code
9
Q2L-TResL(ImageNet-21K pretraining, resolution 640)
90.3
No
Query2Label: A Simple Transformer Way to Multi-L...
2021-07-22
Code
10
IDA-SwinL
90.3
No
-
-
Code
11
CCD-SwinL
90.3
No
-
-
Code
12
MlTr-XL(ImageNet-21K pretraining, resolution 384)
90
No
MlTr: Multi-label Classification with Transformer
2021-06-11
Code
13
TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 640)
89.8
No
ImageNet-21K Pretraining for the Masses
2021-04-22
Code
14
MlTr-L(ImageNet-21K pretraining, resolution 384)
88.5
No
MlTr: Multi-label Classification with Transformer
2021-06-11
Code
15
TResNet-XL (resolution 640)
88.4
No
Asymmetric Loss For Multi-Label Classification
2020-09-29
Code
16
TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 448)
88.4
No
ImageNet-21K Pretraining for the Masses
2021-04-22
Code
17
GKGNet(resolution 576)
87.7
No
GKGNet: Group K-Nearest Neighbor based Graph Con...
2023-08-28
Code
18
M3TR(ImageNet-21K-P pretraining, resolution 448)
87.5
No
-
-
Code
19
GKGNet(resolution 448)
86.7
No
GKGNet: Group K-Nearest Neighbor based Graph Con...
2023-08-28
Code
20
TResNet-L (resolution 448)
86.6
No
Asymmetric Loss For Multi-Label Classification
2020-09-29
Code
21
IDA-R101
86.3
No
-
-
Code
22
TDRG-R101(576×576)
86
No
Transformer-based Dual Relation Graph for Multi-...
2021-10-10
Code
23
CCD-R101
85.3
No
-
-
Code
24
ADD-GCN
85.2
No
Attention-Driven Dynamic Graph Convolutional Net...
2020-12-05
Code
25
Q2L-R101(resolution 448)
84.9
No
Query2Label: A Simple Transformer Way to Multi-L...
2021-07-22
Code
26
TDRG-R101(448×448)
84.6
No
Transformer-based Dual Relation Graph for Multi-...
2021-10-10
Code
27
MCAR (ResNet101, 576x576)
84.5
No
Learning to Discover Multi-Class Attentional Reg...
2020-07-03
Code
28
MS-CMA
83.8
No
Cross-Modality Attention with Semantic Graph Emb...
2019-12-17
-
29
MCAR (ResNet101, 448x448)
83.8
No
Learning to Discover Multi-Class Attentional Reg...
2020-07-03
Code
30
KSSNet
83.7
No
Multi-Label Classification with Label Graph Supe...
2019-11-21
Code
31
MSRN
83.4
No
Multi-layered Semantic Representation Network fo...
2021-06-22
Code
32
ML-GCN
83
No
Multi-Label Graph Convolutional Network Represen...
2019-12-26
-
33
GKGNet(resolution 224)
82
No
GKGNet: Group K-Nearest Neighbor based Graph Con...
2023-08-28
Code
34
ResNet-SRN
77.1
No
Learning Spatial Regularization with Image-level...
2017-02-20
Code
#1
ADDS(ViT-L-336, resolution 1344)
SOTA
93.54
mAP
· 2022-08-19
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
#2
ADDS(ViT-L-336, resolution 640)
93.41
mAP
· 2022-08-19
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
#3
ADDS(ViT-L-336, resolution 336)
91.76
mAP
· 2022-08-19
Open Vocabulary Multi-Label Classification with Dual-Modal Decoder on Aligned Visual-Textual Features
#4
ML-Decoder(TResNet-XL, resolution 640)
SOTA
91.4
mAP
· 2021-11-25
ML-Decoder: Scalable and Versatile Classification Head
Code
#5
Q2L-CvT(ImageNet-21K pretraining, resolution 384)
SOTA
91.3
mAP
· 2021-07-22
Query2Label: A Simple Transformer Way to Multi-Label Classification
Code
#6
MLD-TResNet-L-AAM[640x640]
91.3
mAP
· 2022-09-14
Combining Metric Learning and Attention Heads For Accurate and Efficient Multilabel Image Classification
Code
#7
ML-Decoder(TResNet-L, resolution 640)
91.1
mAP
· 2021-11-25
ML-Decoder: Scalable and Versatile Classification Head
Code
#8
Q2L-SwinL(ImageNet-21K pretraining, resolution 384)
90.5
mAP
· 2021-07-22
Query2Label: A Simple Transformer Way to Multi-Label Classification
Code
#9
Q2L-TResL(ImageNet-21K pretraining, resolution 640)
90.3
mAP
· 2021-07-22
Query2Label: A Simple Transformer Way to Multi-Label Classification
Code
#10
IDA-SwinL
90.3
mAP
No paper
Code
#11
CCD-SwinL
90.3
mAP
No paper
Code
#12
MlTr-XL(ImageNet-21K pretraining, resolution 384)
SOTA
90
mAP
· 2021-06-11
MlTr: Multi-label Classification with Transformer
Code
#13
TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 640)
SOTA
89.8
mAP
· 2021-04-22
ImageNet-21K Pretraining for the Masses
Code
#14
MlTr-L(ImageNet-21K pretraining, resolution 384)
88.5
mAP
· 2021-06-11
MlTr: Multi-label Classification with Transformer
Code
#15
TResNet-XL (resolution 640)
SOTA
88.4
mAP
· 2020-09-29
Asymmetric Loss For Multi-Label Classification
Code
#16
TResNet-L-V2, (ImageNet-21K-P pretraining, resolution 448)
88.4
mAP
· 2021-04-22
ImageNet-21K Pretraining for the Masses
Code
#17
GKGNet(resolution 576)
87.7
mAP
· 2023-08-28
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Code
#18
M3TR(ImageNet-21K-P pretraining, resolution 448)
87.5
mAP
No paper
Code
#19
GKGNet(resolution 448)
86.7
mAP
· 2023-08-28
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Code
#20
TResNet-L (resolution 448)
86.6
mAP
· 2020-09-29
Asymmetric Loss For Multi-Label Classification
Code
#21
IDA-R101
86.3
mAP
No paper
Code
#22
TDRG-R101(576×576)
86
mAP
· 2021-10-10
Transformer-based Dual Relation Graph for Multi-label Image Recognition
Code
#23
CCD-R101
85.3
mAP
No paper
Code
#24
ADD-GCN
85.2
mAP
· 2020-12-05
Attention-Driven Dynamic Graph Convolutional Network for Multi-Label Image Recognition
Code
#25
Q2L-R101(resolution 448)
84.9
mAP
· 2021-07-22
Query2Label: A Simple Transformer Way to Multi-Label Classification
Code
#26
TDRG-R101(448×448)
84.6
mAP
· 2021-10-10
Transformer-based Dual Relation Graph for Multi-label Image Recognition
Code
#27
MCAR (ResNet101, 576x576)
SOTA
84.5
mAP
· 2020-07-03
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition
Code
#28
MS-CMA
SOTA
83.8
mAP
· 2019-12-17
Cross-Modality Attention with Semantic Graph Embedding for Multi-Label Classification
#29
MCAR (ResNet101, 448x448)
83.8
mAP
· 2020-07-03
Learning to Discover Multi-Class Attentional Regions for Multi-Label Image Recognition
Code
#30
KSSNet
SOTA
83.7
mAP
· 2019-11-21
Multi-Label Classification with Label Graph Superimposing
Code
#31
MSRN
83.4
mAP
· 2021-06-22
Multi-layered Semantic Representation Network for Multi-label Image Classification
Code
#32
ML-GCN
83
mAP
· 2019-12-26
Multi-Label Graph Convolutional Network Representation Learning
#33
GKGNet(resolution 224)
82
mAP
· 2023-08-28
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Code
#34
ResNet-SRN
SOTA
77.1
mAP
· 2017-02-20
Learning Spatial Regularization with Image-level Supervisions for Multi-label Image Classification
Code