TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: SPICE (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕SPICE▼Extra DataPaperDate↕Code
1GIT2, Single Model16.11NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT, Single Model15.96NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3PaLI15.75NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
4PaLI15.75NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
5CoCa - Google Brain15.54No---
6Microsoft Cognitive Services team15.06NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
7firethehole14.88No---
8FudanFVL14.79No---
9Human14.72No---
10FudanWYZ14.71No---
11Single Model14.61NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
12vll@mk51414.37No---
13IEDA-LAB14.15No---
14MD13.64No---
15VinVL (Microsoft Cognitive Services + MSR)13.36NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
16ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS12.98No---
17RCAL12.47No---
18evertyhing12.24No---
19camel XE12.14No---
20vinvl_yuan_cbs12.12No---
21icgp2ssi1_coco_si_0.02_5_test12.11No---
22Xinyi11.88No---
23MQ-UpDown-C11.87No---
24cxy_nocaps_training11.81No---
25Oscar11.53No---
26UpDown + ELMo + CBS11.45No---
27ClipCap (MLP + GPT2 tuning)11.26NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
287_10-7_40000_predict_test.json11.14No---
29ClipCap (Transformer)10.92NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
30nocaps_training10.33No---
31UpDown10.33No---
32None10.28No---
33Neural Baby Talk + CBS9.83No---
34YX9.7No---
35area_attention9.7No---
36B29.54No---
37coco_all_199.28No---
38Neural Baby Talk9.26No---
39Yu-Wu8.37No---
40CS395T8.28No---