TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning/COCO Captions

Image Captioning on COCO Captions

Metric: ROUGE-L (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕ROUGE-L▼Extra DataPaperDate↕Code
1ExpansionNet v2 (No VL pretraining)61.1NoExploiting Multiple Sequence Lengths in Fast End...2022-08-13Code
2GRIT (No VL pretraining - base)60.7NoGRIT: Faster and Better Image captioning Transfo...2022-07-20Code
3Xmodal-Ctx60.4NoBeyond a Pre-Trained Object Detector: Cross-Moda...2022-05-09Code
4L-Verse60.4NoL-Verse: Bidirectional Generation Between Image ...2021-11-22Code
5Xmodal-Ctx59.5NoBeyond a Pre-Trained Object Detector: Cross-Moda...2022-05-09Code
6AoANet + VC59.3NoVisual Commonsense R-CNN2020-02-27Code
7X-Transformer59.1NoX-Linear Attention Networks for Image Captioning2020-03-31Code
8Transformer_NSC58.7NoA Better Variant of Self-Critical Sequence Train...2020-03-22Code
9LaDiC58.7NoLaDiC: Are Diffusion Models Really Inferior to A...2024-04-16Code
10Meshed-Memory Transformer58.6NoMeshed-Memory Transformer for Image Captioning2019-12-17Code
11CLIP Text Encoder (RL w/ CIDEr-reward)58.5NoFine-grained Image Captioning with CLIP Reward2022-05-26Code
12RefineCap (w/ REINFORCE)58NoRefineCap: Concept-Aware Refinement for Image Ca...2021-09-08-
13RDN57.4NoReflective Decoding Network for Image Captioning2019-08-30-