TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/CC152K

CC152K

Conceptual Captions 152K

CC152K is a subset of Conceptual Captions. It contains 150,000 randomly selected samples from the training split for training, 1,000 samples from the validation split for validation, and 1,000 samples from the validation split for testing.

Benchmarks

Cross-Modal Information Retrieval/R-SumCross-Modal Information Retrieval/Image-to-text R@1Cross-Modal Information Retrieval/Image-to-text R@5Cross-Modal Information Retrieval/Image-to-text R@10Cross-Modal Information Retrieval/Text-to-image R@1Cross-Modal Information Retrieval/Text-to-image R@5Cross-Modal Information Retrieval/Text-to-image R@10Cross-Modal Retrieval/R-SumCross-Modal Retrieval/Image-to-text R@1Cross-Modal Retrieval/Image-to-text R@5Cross-Modal Retrieval/Image-to-text R@10Cross-Modal Retrieval/Text-to-image R@1Cross-Modal Retrieval/Text-to-image R@5Cross-Modal Retrieval/Text-to-image R@10Image Retrieval with Multi-Modal Query/R-SumImage Retrieval with Multi-Modal Query/Image-to-text R@1Image Retrieval with Multi-Modal Query/Image-to-text R@5Image Retrieval with Multi-Modal Query/Image-to-text R@10Image Retrieval with Multi-Modal Query/Text-to-image R@1Image Retrieval with Multi-Modal Query/Text-to-image R@5Image Retrieval with Multi-Modal Query/Text-to-image R@10

Statistics

Papers
17
Benchmarks
21

Links

Tasks

Cross-Modal Information RetrievalCross-Modal RetrievalCross-modal retrieval with noisy correspondenceImage Retrieval with Multi-Modal Query