TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: B3 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B3▼Extra DataPaperDate↕Code
1GIT, Single Model52.66NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2PaLI52.63NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
3GIT2, Single Model52.36NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain52.13No---
5Microsoft Cognitive Services team45.58NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL45.26No---
7Single Model44.38NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8FudanWYZ43.58No---
9firethehole41.58No---
10IEDA-LAB40.14No---
11ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS36.37No---
12MD36.13No---
13vll@mk51435.99No---
14icgp2ssi1_coco_si_0.02_5_test35.94No---
15evertyhing34.53No---
16VinVL (Microsoft Cognitive Services + MSR)34.02NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
17Human33.51No---
18camel XE29.44No---
19vinvl_yuan_cbs29.34No---
20Oscar28.88No---
21UpDown-C28.32No---
22RCAL28.26No---
23cxy_nocaps_training27.58No---
24Xinyi27.18No---
25UpDown + ELMo + CBS25.77No---
267_10-7_40000_predict_test.json24.58No---
27nocaps_training24.23No---
28UpDown24.23No---
29B223.82No---
30area_attention21.71No---
31Neural Baby Talk21.48No---
32Neural Baby Talk + CBS21.16No---
33YX21.15No---
34CS395T19.99No---
35coco_all_1918.45No---
36Yu-Wu17.19No---
37Check7.41No---