TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: METEOR (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕METEOR▼Extra DataPaperDate↕Code
1PaLI34.22NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model33.83NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model33.41NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain33.01No---
5Microsoft Cognitive Services team32.7NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model31.97NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL31.77No---
8firethehole31.61No---
9FudanWYZ31.33No---
10vll@mk51430.51No---
11IEDA-LAB30.43No---
12MD30.06No---
13VinVL (Microsoft Cognitive Services + MSR)29.51NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS29.37No---
15camel XE28.7No---
16Human28.53No---
17evertyhing27.97No---
18RCAL27.7No---
19icgp2ssi1_coco_si_0.02_5_test27.7No---
20Xinyi27.27No---
21cxy_nocaps_training27.25No---
22作者给的test文件27.25No---
23MQ-UpDown-C27.25No---
24Oscar27.23No---
25UpDown + ELMo + CBS26.35No---
26UpDown26.04No---
27nocaps_training26.04No---
287_10-7_40000_predict_test.json26.02No---
29None25.1No---
30YX25.08No---
31area_attention25.07No---
32B225.06No---
33Neural Baby Talk23.8No---
34Neural Baby Talk + CBS23.68No---
35coco_all_1923.47No---
36CS395T22.04No---
37Yu-Wu22.04No---