TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: B2 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B2▼Extra DataPaperDate↕Code
1GIT2, Single Model75.86NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2PaLI75.56NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
3GIT, Single Model75.48NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain74.49No---
5Microsoft Cognitive Services team72.6NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model69.83NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL69.66No---
8IEDA-LAB68.58No---
9FudanWYZ68.56No---
10MD67.99No---
11VinVL (Microsoft Cognitive Services + MSR)66.94NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
12firethehole66.65No---
13vll@mk51466.55No---
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS65.88No---
15icgp2ssi1_coco_si_0.02_5_test63.01No---
16evertyhing62.73No---
17Oscar62.32No---
18vinvl_yuan_cbs62.31No---
19RCAL62.26No---
20camel XE62.06No---
21cxy_nocaps_training60.75No---
22Xinyi60.52No---
23MQ-UpDown-C59No---
24UpDown + ELMo + CBS58.31No---
25Human56.97No---
26nocaps_training56.93No---
27UpDown56.93No---
28B255.53No---
297_10-7_40000_predict_test.json54.26No---
30Neural Baby Talk54.1No---
31YX53.98No---
32None53.74No---
33Neural Baby Talk + CBS53.67No---
34area_attention53.56No---
35coco_all_1950.79No---
36CS395T48.92No---
37Yu-Wu48.7No---