TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps out-of-domain

Image Captioning on nocaps out-of-domain

Metric: CIDEr (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕CIDEr▼Extra DataPaperDate↕Code
1PaLI126.67NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model122.27NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model122.04NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain121.69No---
5Microsoft Cognitive Services team110.14NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model109.49NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL106.55No---
8FudanWYZ103.75No---
9Human91.62No---
10firethehole88.54No---
11IEDA-LAB87.51No---
12icgp2ssi1_coco_si_0.02_5_test87.15No---
13evertyhing85.18No---
14vll@mk51478.91No---
15VinVL (Microsoft Cognitive Services + MSR)78.01NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
16MD77.39No---
17RCAL75.39No---
18Oscar73.75No---
19GRIT (zero-shot, no CBS, no VL pretraining, single model)72.6NoGRIT: Faster and Better Image captioning Transfo...2022-07-20Code
20ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS72.13No---
21vinvl_yuan_cbs71.43No---
22UpDown-C70.21No---
23Xinyi68.92No---
24cxy_nocaps_training68.5No---
25UpDown + ELMo + CBS66.67No---
26Neural Baby Talk + CBS58.48No---
27camel XE54.56No---
28ClipCap (MLP + GPT2 tuning)49.35NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
29ClipCap (Transformer)49.14NoClipCap: CLIP Prefix for Image Captioning2021-11-18Code
30Neural Baby Talk48.73No---
317_10-7_40000_predict_test.json43.2No---
32Yu-Wu39.39No---
33Check36.12No---
34nocaps_training30.09No---
35UpDown30.09No---
36area_attention26.55No---
37YX26.25No---
38B225.91No---
39coco_all_1923.07No---
40CS395T21.3No---