TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: B4 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B4▼Extra DataPaperDate↕Code
1GIT, Single Model41.65NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2PaLI41.16NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
3GIT2, Single Model41.1NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain39.24No---
5Microsoft Cognitive Services team37.97NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL34.8No---
7Single Model34.66NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8firethehole34.11No---
9FudanWYZ33.59No---
10MD33.15No---
11IEDA-LAB32.86No---
12vll@mk51432.76No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS31.24No---
14VinVL (Microsoft Cognitive Services + MSR)30.62NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15camel XE29.59No---
16icgp2ssi1_coco_si_0.02_5_test27.23No---
17RCAL27.09No---
18evertyhing26.07No---
19MQ-UpDown-C25.94No---
20Oscar25.78No---
21cxy_nocaps_training25.15No---
22作者给的test文件25.15No---
23Xinyi24.82No---
24UpDown24.57No---
25nocaps_training24.57No---
26B223.8No---
27UpDown + ELMo + CBS22.83No---
28YX21.96No---
29area_attention21.92No---
307_10-7_40000_predict_test.json21.91No---
31Human21.49No---
32None20.84No---
33coco_all_1919.45No---
34Neural Baby Talk17.39No---
35Yu-Wu16.71No---
36Neural Baby Talk + CBS15.14No---
37CS395T14.54No---