TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps-XD entire

Image Captioning on nocaps-XD entire

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
1GIT263.19NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT63.12NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3Microsoft Cognitive Services team61.2YesScaling Up Vision-Language Pre-training for Imag...2021-11-24-
4VLAF258.99No---
5Microsoft Cognitive Services team58.26NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6icp2ssi1_coco_si_0.02_5_test54.59No---
7test_cbs253.39No---
8Human52.83No---
9UpDown + ELMo + CBS51.82No---
10UpDown50.92No---
11Neural Baby Talk48.87No---
12Neural Baby Talk + CBS48.74No---