TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: B1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B1▼Extra DataPaperDate↕Code
1GIT2, Single Model88.86NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT, Single Model88.55NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3PaLI88.02NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
4CoCa - Google Brain87.27No---
5Microsoft Cognitive Services team86.33NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model84.64NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7IEDA-LAB84.4No---
8FudanFVL84.2No---
9MD84.03No---
10vll@mk51483.77No---
11VinVL (Microsoft Cognitive Services + MSR)83.24NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
12FudanWYZ82.91No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS82.9No---
14firethehole81.86No---
15cxy_nocaps_training81.64No---
16作者给的test文件81.64No---
17Xinyi81.61No---
18Oscar80.7No---
19RCAL80.68No---
20camel XE80.5No---
21icgp2ssi1_coco_si_0.02_5_test80.26No---
22evertyhing79.58No---
23MQ-UpDown-C78.73No---
24UpDown77.68No---
25nocaps_training77.68No---
26UpDown + ELMo + CBS77.65No---
27B277.06No---
28Human76.89No---
29Neural Baby Talk + CBS76.49No---
30YX76.48No---
31area_attention76.12No---
32Neural Baby Talk75.91No---
337_10-7_40000_predict_test.json75.31No---
34None74.35No---
35coco_all_1972.76No---
36CS395T72.24No---
37Yu-Wu72.05No---