TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps-XD entire

Image Captioning on nocaps-XD entire

Metric: SPICE (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕SPICE▼Extra DataPaperDate↕Code
1GIT216.06NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT15.94NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3Microsoft Cognitive Services team14.85YesScaling Up Vision-Language Pre-training for Imag...2021-11-24-
4VLAF214.71No---
5Human14.67No---
6Microsoft Cognitive Services team14.04NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
7test_cbs212.74No---
8icp2ssi1_coco_si_0.02_5_test11.84No---
9UpDown + ELMo + CBS11.2No---
10UpDown10.14No---
11Neural Baby Talk + CBS9.69No---
12Neural Baby Talk9.15No---