TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps-XD entire

Image Captioning on nocaps-XD entire

Metric: CIDEr (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕CIDEr▼Extra DataPaperDate↕Code
1GIT2124.77NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT123.39NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3Microsoft Cognitive Services team114.25YesScaling Up Vision-Language Pre-training for Imag...2021-11-24-
4VLAF2102.39No---
5Microsoft Cognitive Services team100.12NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Human85.34No---
7icp2ssi1_coco_si_0.02_5_test85.3No---
8test_cbs285.02No---
9UpDown + ELMo + CBS73.09No---
10Neural Baby Talk + CBS61.48No---
11UpDown54.25No---
12Neural Baby Talk53.36No---