TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/nocaps near-domain

Image Captioning on nocaps near-domain

Metric: B3 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕B3▼Extra DataPaperDate↕Code
1PaLI58.99NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2GIT2, Single Model58.9NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
3GIT, Single Model58.46NoGIT: A Generative Image-to-text Transformer for ...2022-05-27Code
4CoCa - Google Brain57.89No---
5Microsoft Cognitive Services team55.26NoVIVO: Visual Vocabulary Pre-Training for Novel O...2020-09-28-
6Single Model52.42NoSimVLM: Simple Visual Language Model Pretraining...2021-08-24Code
7FudanFVL51.95No---
8FudanWYZ50.9No---
9IEDA-LAB49.98No---
10firethehole49.39No---
11MD49.29No---
12vll@mk51447.8No---
13VinVL (Microsoft Cognitive Services + MSR)47.02NoVinVL: Revisiting Visual Representations in Visi...2021-01-02Code
14ViTCAP-CIDEr-136.7-ENC-DEC-ViTbfocal10-test-CBS46.72No---
15icgp2ssi1_coco_si_0.02_5_test43.59No---
16evertyhing42.87No---
17camel XE42.51No---
18vinvl_yuan_cbs41.07No---
19RCAL40.77No---
20Oscar40.65No---
21cxy_nocaps_training39.06No---
22Xinyi38.95No---
23MQ-UpDown-C38.29No---
24UpDown + ELMo + CBS37.04No---
25nocaps_training36.91No---
26UpDown36.91No---
27Human36.84No---
28B235.22No---
297_10-7_40000_predict_test.json34.59No---
30None33.49No---
31YX33.1No---
32area_attention32.94No---
33Neural Baby Talk32.37No---
34Neural Baby Talk + CBS30.66No---
35coco_all_1930.26No---
36Yu-Wu26.85No---
37CS395T26.19No---