TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps in-domain

Image Captioning on nocaps in-domain

Metric: B3 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B3▼Extra DataPaperDate↕Code
1GIT, Single Model60.53NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
2GIT2, Single Model59.94NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3PaLI59.38NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
4CoCa - Google Brain58.01No---
5Microsoft Cognitive Services team55.94NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model52.96NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL52.56No---
8IEDA-LAB51.89No---
9vll@mk51451.26No---
10MD51.16No---
11FudanWYZ50.75No---
12firethehole50.5No---
13ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS49.73No---
14VinVL (Microsoft Cognitive Services + MSR)49.68NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
15camel XE46.46No---
16RCAL45.33No---
17icgp2ssi1_coco_si_0.02_5_test44.65No---
18evertyhing43.92No---
19cxy_nocaps_training43.43No---
20作者给的test文件43.43No---
21Xinyi43.22No---
22Oscar42.86No---
23MQ-UpDown-C42.35No---
24UpDown41.5No---
25nocaps_training41.5No---
26B240.54No---
27UpDown + ELMo + CBS39.86No---
28YX39.28No---
29area_attention38.44No---
307_10-7_40000_predict_test.json37.85No---
31Human37.78No---
32None36.12No---
33Neural Baby Talk35.58No---
34coco_all_1934.13No---
35Neural Baby Talk + CBS33.73No---
36Yu-Wu31.92No---
37CS395T29.57No---