Image Captioning on nocaps val

Metric: SPICE (higher is better)

LeaderboardDataset
Loading chart...
#ModelSPICEExtra DataPaperDateCode
1Prismer14.8NoPrismer: A Vision-Language Model with Multi-Task...2023-03-04Code
2MetaLM8.6NoLanguage Models are General-Purpose Interfaces2022-06-13Code
3VL-T55.3NoUnifying Vision-and-Language Tasks via Text Gene...2021-02-04Code