TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
1PaLI64.39NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT, Single Model64.02NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT2, Single Model63.82NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain63.12No---
5Microsoft Cognitive Services team62.48NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model61.01NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL60.52No---
8IEDA-LAB60.07No---
9vll@mk51459.75No---
10FudanWYZ59.67No---
11MD59.67No---
12firethehole59.54No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS58.62No---
14VinVL (Microsoft Cognitive Services + MSR)58.54NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15camel XE56.84No---
16RCAL56.76No---
17icgp2ssi1_coco_si_0.02_5_test56.4No---
18Oscar55.91No---
19evertyhing55.88No---
20MQ-UpDown-C55.25No---
21cxy_nocaps_training55.06No---
22作者给的test文件55.06No---
23Xinyi55.03No---
24UpDown54.42No---
25nocaps_training54.42No---
26UpDown + ELMo + CBS53.98No---
27B253.49No---
28Human53.47No---
29YX53.22No---
30area_attention52.53No---
317_10-7_40000_predict_test.json52.44No---
32None52.26No---
33Neural Baby Talk51.42No---
34Neural Baby Talk + CBS50.84No---
35coco_all_1950.53No---
36Yu-Wu49.64No---
37CS395T49.05No---