TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Image Captioning

Image Captioning

177 benchmarks1878 papers

Image Captioning is the task of describing the content of an image in words. This task lies at the intersection of computer vision and natural language processing. Most image captioning systems use an encoder-decoder framework, where an input image is encoded into an intermediate representation of the information in the image, and then decoded into a descriptive text sequence. The most popular benchmarks are nocaps and COCO, and models are typically evaluated according to a BLEU or CIDER metric.

( Image credit: Reflective Decoding Network for Image Captioning, ICCV'19)

Benchmarks

Image Captioning on VizWiz 2020 test-dev

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on nocaps in-domain

CIDErSPICEB1B2B3B4ROUGE-LMETEOR

Image Captioning on nocaps near-domain

SPICECIDErB1B2B3B4ROUGE-LMETEOR

Image Captioning on nocaps out-of-domain

CIDErSPICEB1B2B3B4ROUGE-LMETEOR

Image Captioning on nocaps entire

CIDErSPICEB1B2B3B4ROUGE-LMETEOR

Image Captioning on COCO Captions

CIDERBLEU-4METEORSPICEROUGE-LBLEU-1BLEU-2BLEU-3CLIPScore

Image Captioning on COCO (Common Objects in Context)

CIDErBLEU-4BLEU-1BLEU-2BLEU-3METEORROUGE-LROUGE

Image Captioning on VizWiz 2020 test

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on ScanRefer Dataset

CIDErBLEU-4METEORROUGE-L

Image Captioning on nocaps-XD entire

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on TextCaps 2020

CIDErB4ROUGE-LMETEORSPICE

Image Captioning on nocaps-XD in-domain

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on nocaps-XD near-domain

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on nocaps-XD out-of-domain

CIDErB1B2B3B4ROUGE-LMETEORSPICE

Image Captioning on nocaps-val-in-domain

CIDErSPICEPre-train (#images)

Image Captioning on nocaps-val-overall

CIDErSPICEPretrain (#images)

Image Captioning on Nr3D

CIDErBLEU-4METEORROUGE-L

Image Captioning on nocaps-val-near-domain

CIDErSPICEPre-train (#images)

Image Captioning on nocaps-val-out-domain

CIDErSPICEPretrain (#images)

Image Captioning on SCICAP

BLEU-4

Image Captioning on Flickr30k Captions test

CIDErSPICEBLEU-4METEOR

Image Captioning on WHOOPS!

BLEU-4CIDEr

Image Captioning on Object HalBench

chair_ichair_s

Image Captioning on nocaps val

CIDErSPICE

Image Captioning on COCO Captions test

BLEU-4CIDErMETEORSPICE

Image Captioning on Conceptual Captions

CIDErROUGE-LSPICE

Image Captioning on FlickrStyle10K

BLEU-1 (Romantic)CIDEr

Image Captioning on Localized Narratives

CIDEr

Image Captioning on relational captioning dataset

Image-Level Recall

Image Captioning on AIC-ICC

BLEUCIDErMETEORROUGE-L

Image Captioning on BanglaLekhaImageCaptions

BLEU-1BLEU-2BLEU-3BLEU-4CIDErMETEORROUGE-LSPICE

Image Captioning on ChEBI-20

BLEUExactLevenshteinMACCS FTSMorgan FTSRDK FTSValidity

Image Captioning on Flickr30k

CIDEr

Image Captioning on Hindi Visual Genome (Challenge Set)

BLEU

Image Captioning on Hindi Visual Genome (Test Set)

BLEU

Image Captioning on IU X-Ray

CIDEr

Image Captioning on MS-COCO

BLEU-1BLEU-4CIDErMETEORSPICETest ROGUE-L

Image Captioning on MSCOCO

BLEU-4

Image Captioning on Peir Gross

CIDErMETEORROUGE-L