TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: B4 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B4▼Extra DataPaperDate↕Code
1PaLI32NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2CoCa - Google Brain31.89No---
3GIT2, Single Model30.15NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4GIT, Single Model30.04NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
5Microsoft Cognitive Services team25.78NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL25.31No---
7FudanWYZ24.57No---
8Single Model24.47NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
9firethehole22.66No---
10IEDA-LAB20.64No---
11icgp2ssi1_coco_si_0.02_5_test17.96No---
12MD17.85No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS17.68No---
14vll@mk51416.92No---
15evertyhing16.69No---
16Human16.6No---
17VinVL (Microsoft Cognitive Services + MSR)15.86NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
18camel XE12.99No---
19Oscar12.42No---
20UpDown-C11.99No---
21RCAL11.94No---
22vinvl_yuan_cbs11.69No---
23cxy_nocaps_training10.98No---
24Xinyi10.57No---
25nocaps_training10.17No---
26UpDown10.17No---
277_10-7_40000_predict_test.json10.14No---
28UpDown + ELMo + CBS9.68No---
29B29.46No---
30area_attention8.72No---
31YX8.54No---
32CS395T8.2No---
33Neural Baby Talk7.92No---
34coco_all_197.55No---
35Neural Baby Talk + CBS7.5No---
36Yu-Wu6.11No---
37Check1.83No---