TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
1PaLI63.99NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model63.66NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model63.5NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain62.91No---
5Microsoft Cognitive Services team61.9NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model60.46NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL60.34No---
8FudanWYZ59.8No---
9IEDA-LAB59.23No---
10firethehole58.83No---
11MD58.47No---
12vll@mk51458.22No---
13VinVL (Microsoft Cognitive Services + MSR)57.95NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS57.34No---
15icgp2ssi1_coco_si_0.02_5_test55.63No---
16evertyhing55.37No---
17camel XE55.24No---
18Oscar54.78No---
19RCAL54.62No---
20vinvl_yuan_cbs54.52No---
21cxy_nocaps_training53.37No---
22Xinyi53.18No---
23MQ-UpDown-C53.15No---
24Human53.06No---
25UpDown + ELMo + CBS52.64No---
26nocaps_training51.84No---
27UpDown51.84No---
287_10-7_40000_predict_test.json51.23No---
29B250.77No---
30None50.53No---
31YX50No---
32area_attention49.79No---
33Neural Baby Talk49.63No---
34Neural Baby Talk + CBS49.45No---
35coco_all_1948.61No---
36Yu-Wu47.13No---
37CS395T47.04No---