Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Datasets/CC152K

CC152K

Conceptual Captions 152K

CC152K is a subset of Conceptual Captions. It contains 150,000 randomly selected samples from the training split for training, 1,000 samples from the validation split for validation, and 1,000 samples from the validation split for testing.

Benchmarks

Cross-Modal Information Retrieval/R-Sum Cross-Modal Information Retrieval/Image-to-text R@1 Cross-Modal Information Retrieval/Image-to-text R@5 Cross-Modal Information Retrieval/Image-to-text R@10 Cross-Modal Information Retrieval/Text-to-image R@1 Cross-Modal Information Retrieval/Text-to-image R@5 Cross-Modal Information Retrieval/Text-to-image R@10 Cross-Modal Retrieval/R-Sum Cross-Modal Retrieval/Image-to-text R@1 Cross-Modal Retrieval/Image-to-text R@5 Cross-Modal Retrieval/Image-to-text R@10 Cross-Modal Retrieval/Text-to-image R@1 Cross-Modal Retrieval/Text-to-image R@5 Cross-Modal Retrieval/Text-to-image R@10 Image Retrieval with Multi-Modal Query/R-Sum Image Retrieval with Multi-Modal Query/Image-to-text R@1 Image Retrieval with Multi-Modal Query/Image-to-text R@5 Image Retrieval with Multi-Modal Query/Image-to-text R@10 Image Retrieval with Multi-Modal Query/Text-to-image R@1 Image Retrieval with Multi-Modal Query/Text-to-image R@5 Image Retrieval with Multi-Modal Query/Text-to-image R@10

Statistics

Papers: 17
Benchmarks: 21

Links

Tasks

Cross-Modal Information Retrieval Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence Image Retrieval with Multi-Modal Query