TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: B4 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B4▼Extra DataPaperDate↕Code
1PaLI39.98NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model38.95NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3CoCa - Google Brain38.92No---
4GIT, Single Model38.44NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
5Microsoft Cognitive Services team36.31NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model33.74NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL33.46No---
8FudanWYZ32.72No---
9firethehole31.42No---
10IEDA-LAB30.78No---
11MD29.96No---
12vll@mk51429No---
13VinVL (Microsoft Cognitive Services + MSR)27.97NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS27.94No---
15icgp2ssi1_coco_si_0.02_5_test25.85No---
16camel XE25.06No---
17evertyhing24.8No---
18RCAL22.56No---
19Oscar22.37No---
20vinvl_yuan_cbs21.53No---
21MQ-UpDown-C21No---
22cxy_nocaps_training20.97No---
23Xinyi20.72No---
24nocaps_training20.49No---
25UpDown20.49No---
26Human19.85No---
27UpDown + ELMo + CBS19.85No---
287_10-7_40000_predict_test.json18.95No---
29B218.79No---
30None18.04No---
31area_attention17.49No---
32YX17.28No---
33coco_all_1916.14No---
34Neural Baby Talk15.99No---
35Neural Baby Talk + CBS13.85No---
36Yu-Wu12.6No---
37CS395T12.11No---