Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Computer Vision
/
Image Retrieval
/
CREPE (Compositional REPresentation Evaluation)
Image Retrieval on CREPE (Compositional REPresentation Evaluation)
Metric: Recall@1 (HN-Comp, UC) (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
Recall@1 (HN-Comp, UC)
▼
Extra Data
Paper
Date
↕
Code
1
RN-50 (MosaiCLIP, CC-12M)
92.6
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
2
Swin-T (MosaiCLIP, CC-12M)
92.1
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
3
RN-50 (NegCLIP, CC-12M)
82
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
4
Swin-T (NegCLIP, CC-12M)
80.3
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
5
MosaiCLIP (CC-FT)
72.4
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
6
ViT-L-14 (LAION400M)
60.78
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
7
ViT-B-16+240 (LAION400M)
60.19
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
8
ViT-B-16 (LAION400M)
59
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
9
ViT-B-32 (LAION400M)
54.8
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
10
NegCLIP (CC-FT)
53.1
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
11
MosaiCLIP (YFCC-FT)
48.8
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
12
CLIP-FT (CC-FT)
45.8
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
13
RN50 (CC12M)
45.27
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
14
CLIP (CC-FT)
45.1
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
15
Swin-T (CLIP, CC-12M)
44.1
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
16
RN-50 (CLIP, CC-12M)
42.9
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
17
RN50 (YFCC15M)
39.83
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
18
CLIP (YFCC-FT)
39.8
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
19
RN101 (YFCC15M)
39.56
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code
20
NegCLIP (YFCC-FT)
38.8
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
21
CLIP-FT (YFCC-FT)
36.4
No
Coarse-to-Fine Contrastive Learning in Image-Tex...
2023-05-23
-
22
Random
14.29
No
CREPE: Can Vision-Language Foundation Models Rea...
2022-12-13
Code