TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
1PaLI61.35NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT, Single Model60.96NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT2, Single Model60.91NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain60.57No---
5Microsoft Cognitive Services team57.57NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL57.29No---
7Single Model56.69NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8FudanWYZ56.41No---
9firethehole55.08No---
10IEDA-LAB55No---
11ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS52.86No---
12MD52.54No---
13vll@mk51452.51No---
14VinVL (Microsoft Cognitive Services + MSR)51.99NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15icgp2ssi1_coco_si_0.02_5_test51.75No---
16evertyhing51.54No---
17Human51.5No---
18Oscar50No---
19vinvl_yuan_cbs49.5No---
20camel XE48.85No---
21RCAL48.81No---
22UpDown-C48.6No---
23cxy_nocaps_training47.53No---
24Xinyi47.23No---
25UpDown + ELMo + CBS47.13No---
267_10-7_40000_predict_test.json45.72No---
27nocaps_training44.84No---
28UpDown44.84No---
29Neural Baby Talk + CBS44.47No---
30B244.37No---
31YX44.23No---
32Neural Baby Talk44.11No---
33area_attention43.59No---
34CS395T43.02No---
35Yu-Wu42.46No---
36coco_all_1941.58No---
37Check31.57No---