TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: B2 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B2▼Extra DataPaperDate↕Code
1GIT, Single Model76.1NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT2, Single Model75.86NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3PaLI75.21NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
4CoCa - Google Brain74.29No---
5Microsoft Cognitive Services team72.83NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model70NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7IEDA-LAB69.8No---
8FudanFVL69.57No---
9MD69.12No---
10vll@mk51468.7No---
11ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS68.09No---
12VinVL (Microsoft Cognitive Services + MSR)68.04NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
13FudanWYZ68.02No---
14firethehole67.2No---
15RCAL64.7No---
16camel XE64.48No---
17icgp2ssi1_coco_si_0.02_5_test63.94No---
18cxy_nocaps_training63.79No---
19作者给的test文件63.79No---
20Xinyi63.74No---
21Oscar63.27No---
22evertyhing63.09No---
23MQ-UpDown-C61.63No---
24UpDown60.34No---
25nocaps_training60.34No---
26B259.97No---
27UpDown + ELMo + CBS59.58No---
28YX58.76No---
29area_attention57.98No---
30Human57.3No---
317_10-7_40000_predict_test.json56.79No---
32Neural Baby Talk56.78No---
33Neural Baby Talk + CBS56.2No---
34None55.97No---
35coco_all_1953.52No---
36Yu-Wu52.89No---
37CS395T51.88No---