TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps-XD entire

Image Captioning on nocaps-XD entire

Metric: B2 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B2▼Extra DataPaperDate↕Code
1GIT275.02NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT74.81NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3Microsoft Cognitive Services team71.36YesScaling Up Vision-Language Pre-training for Imag...2021-11-24-
4VLAF267.96No---
5Microsoft Cognitive Services team66.04NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6icp2ssi1_coco_si_0.02_5_test61.54No---
7test_cbs260.29No---
8UpDown + ELMo + CBS56.74No---
9Human56.46No---
10UpDown55.11No---
11Neural Baby Talk52.42No---
12Neural Baby Talk + CBS52.12No---