TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps entire

Image Captioning on nocaps entire

Metric: METEOR (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕METEOR▼Extra DataPaperDate↕Code
1GIT, Single Model32.5NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2CoCa - Google Brain32.29No---
3Microsoft Cognitive Services team31.27NoScaling Up Vision-Language Pre-training for Imag...2021-11-24-
4Prismer31.13NoPrismer: A Vision-Language Model with Multi-Task...2023-03-04Code
5FudanFVL30.64No---
6Single Model30.55NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanWYZ30.32No---
8firethehole30.07No---
9IEDA-LAB28.92No---
10vll@mk51428.46No---
11Human28.15No---
12MD28.09No---
13VinVL (Microsoft Cognitive Services + MSR)27.57NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS27.36No---
15evertyhing26.31No---
16icgp2ssi1_coco_si_0.02_5_test26.29No---
17camel XE26.15No---
18RCAL25.72No---
19vinvl_yuan_cbs25.44No---
20Oscar25.33No---
21MQ-UpDown-C25.18No---
22cxy_nocaps_training25.13No---
23Xinyi25.12No---
24UpDown + ELMo + CBS24.42No---
257_10-7_40000_predict_test.json23.89No---
26nocaps_training22.96No---
27UpDown22.96No---
28None22.53No---
29Neural Baby Talk + CBS22.06No---
30area_attention21.87No---
31B221.85No---
32YX21.72No---
33Neural Baby Talk21.52No---
34coco_all_1920.77No---
35Yu-Wu19.84No---
36CS395T19.61No---