TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/Image Retrieval/CREPE (Compositional REPresentation Evaluation)

Image Retrieval on CREPE (Compositional REPresentation Evaluation)

Metric: Recall@1 (HN-Comp, UC) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Recall@1 (HN-Comp, UC)▼Extra DataPaperDate↕Code
1RN-50 (MosaiCLIP, CC-12M)92.6NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
2Swin-T (MosaiCLIP, CC-12M)92.1NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
3RN-50 (NegCLIP, CC-12M)82NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
4Swin-T (NegCLIP, CC-12M)80.3NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
5MosaiCLIP (CC-FT)72.4NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
6ViT-L-14 (LAION400M)60.78NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
7ViT-B-16+240 (LAION400M)60.19NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
8ViT-B-16 (LAION400M)59NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
9ViT-B-32 (LAION400M)54.8NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
10NegCLIP (CC-FT)53.1NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
11MosaiCLIP (YFCC-FT)48.8NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
12CLIP-FT (CC-FT)45.8NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
13RN50 (CC12M)45.27NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
14CLIP (CC-FT)45.1NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
15Swin-T (CLIP, CC-12M)44.1NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
16RN-50 (CLIP, CC-12M)42.9NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
17RN50 (YFCC15M)39.83NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
18CLIP (YFCC-FT)39.8NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
19RN101 (YFCC15M)39.56NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code
20NegCLIP (YFCC-FT)38.8NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
21CLIP-FT (YFCC-FT)36.4NoCoarse-to-Fine Contrastive Learning in Image-Tex...2023-05-23-
22Random14.29NoCREPE: Can Vision-Language Foundation Models Rea...2022-12-13Code