TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: B2 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B2▼Extra DataPaperDate↕Code
1GIT, Single Model71.28NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2PaLI71.19NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
3GIT2, Single Model71.15NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain70.24No---
5Microsoft Cognitive Services team65.48NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL64.71No---
7Single Model64.21NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8FudanWYZ62.7No---
9IEDA-LAB61.01No---
10firethehole60.06No---
11MD57.39No---
12ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS57.25No---
13vll@mk51456.87No---
14icgp2ssi1_coco_si_0.02_5_test56.39No---
15evertyhing56.14No---
16VinVL (Microsoft Cognitive Services + MSR)56.1NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
17Human53.9No---
18Oscar53.26No---
19vinvl_yuan_cbs52.76No---
20RCAL52.01No---
21UpDown-C51.36No---
22cxy_nocaps_training50.81No---
23camel XE50.32No---
24Xinyi49.99No---
25UpDown + ELMo + CBS48.58No---
267_10-7_40000_predict_test.json44.7No---
27nocaps_training44.28No---
28UpDown44.28No---
29B244.27No---
30Neural Baby Talk + CBS43.2No---
31Neural Baby Talk42.8No---
32YX42.47No---
33area_attention41.56No---
34CS395T39.71No---
35coco_all_1938.55No---
36Yu-Wu38.3No---
37Check22.24No---