TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: CIDEr (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕CIDEr▼Extra DataPaperDate↕Code
1PaLI149.1NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model124.18NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model122.4NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4PaLI121.09NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
5CoCa - Google Brain117.9No---
6Microsoft Cognitive Services team112.82NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
7Single Model108.98NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
8GRIT (zero-shot, no VL pretraining, no CBS)105.9NoGRIT: Faster and Better Image captioning Transfo...2022-07-20Code
9FudanFVL104.9No---
10FudanWYZ104.25No---
11IEDA-LAB102.64No---
12vll@mk514101.69No---
13MD100.03No---
14firethehole99.9No---
15VinVL (Microsoft Cognitive Services + MSR)97.99NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
16ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS96.63No---
17camel XE88.08No---
18evertyhing87.86No---
19RCAL87.28No---
20icgp2ssi1_coco_si_0.02_5_test87.21No---
21cxy_nocaps_training85.81No---
22作者给的test文件85.81No---
23ClipCap (Transformer)84.85NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
24Oscar84.83No---
25Xinyi84.79No---
26Human80.61No---
27MQ-UpDown-C80.19No---
28ClipCap (MLP + GPT2 tuning)79.73NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
29UpDown + ELMo + CBS76.02No---
30UpDown74.27No---
31nocaps_training74.27No---
327_10-7_40000_predict_test.json73.73No---
33None70.33No---
34YX69.59No---
35B268.98No---
36area_attention67.91No---
37coco_all_1964.37No---
38Neural Baby Talk + CBS62.96No---
39Neural Baby Talk60.89No---
40CS395T58.93No---
41Yu-Wu53.34No---