TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: B1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B1▼Extra DataPaperDate↕Code
1GIT2, Single Model88.9NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2PaLI88.57NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
3GIT, Single Model88.56NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain87.53No---
5Microsoft Cognitive Services team86.48NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6FudanFVL84.47No---
7Single Model84.36NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8IEDA-LAB84.04No---
9FudanWYZ83.71No---
10MD83.58No---
11VinVL (Microsoft Cognitive Services + MSR)82.77NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
12vll@mk51482.55No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS81.93No---
14firethehole81.62No---
15Oscar80.54No---
16vinvl_yuan_cbs80.24No---
17cxy_nocaps_training79.69No---
18evertyhing79.67No---
19icgp2ssi1_coco_si_0.02_5_test79.61No---
20Xinyi79.59No---
21RCAL79.21No---
22camel XE79.21No---
23MQ-UpDown-C77.76No---
24UpDown + ELMo + CBS77.68No---
25Human77.05No---
26nocaps_training75.25No---
27UpDown75.25No---
28Neural Baby Talk + CBS74.77No---
29B274.07No---
30YX73.73No---
31Neural Baby Talk73.69No---
327_10-7_40000_predict_test.json73.6No---
33area_attention73.19No---
34None72.91No---
35coco_all_1970.84No---
36CS395T70.05No---
37Yu-Wu68.86No---