TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: METEOR (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕METEOR▼Extra DataPaperDate↕Code
1PaLI33.47NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model32.95NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model32.86NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain32.71No---
5Microsoft Cognitive Services team31.8NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL31.08No---
7Single Model30.97NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8FudanWYZ30.79No---
9firethehole30.48No---
10IEDA-LAB29.53No---
11vll@mk51429.11No---
12MD28.84No---
13Human28.42No---
14VinVL (Microsoft Cognitive Services + MSR)28.24NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS27.89No---
16camel XE26.87No---
17evertyhing26.68No---
18icgp2ssi1_coco_si_0.02_5_test26.63No---
19RCAL26.3No---
20vinvl_yuan_cbs25.98No---
21Oscar25.91No---
22cxy_nocaps_training25.64No---
23Xinyi25.64No---
24MQ-UpDown-C25.59No---
25UpDown + ELMo + CBS24.97No---
267_10-7_40000_predict_test.json24.52No---
27nocaps_training23.6No---
28UpDown23.6No---
29None23.12No---
30Neural Baby Talk + CBS22.55No---
31area_attention22.43No---
32B222.41No---
33YX22.27No---
34Neural Baby Talk21.93No---
35coco_all_1921.48No---
36Yu-Wu20.18No---
37CS395T20.05No---