TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: B1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B1▼Extra DataPaperDate↕Code
1PaLI86.28NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model86.28NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model85.99NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain84.75No---
5Microsoft Cognitive Services team81.73NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL81.44No---
7Single Model80.89NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8FudanWYZ80No---
9IEDA-LAB79.52No---
10MD76.81No---
11firethehole76.65No---
12vll@mk51476.41No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS76.2No---
14VinVL (Microsoft Cognitive Services + MSR)75.78NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15icgp2ssi1_coco_si_0.02_5_test75.71No---
16evertyhing75.5No---
17Oscar74.98No---
18Human74.84No---
19vinvl_yuan_cbs73.95No---
20cxy_nocaps_training73.07No---
21UpDown-C72.94No---
22Xinyi72.53No---
23RCAL72.47No---
24UpDown + ELMo + CBS71.57No---
25camel XE71.34No---
26nocaps_training66.54No---
27UpDown66.54No---
28YX66.44No---
29B266.32No---
307_10-7_40000_predict_test.json66.14No---
31Neural Baby Talk + CBS65.98No---
32area_attention64.58No---
33Neural Baby Talk64.45No---
34CS395T63No---
35coco_all_1961.62No---
36Yu-Wu60.95No---
37Check47.08No---