Metric: BLEU-2 (higher is better)
| # | Model↕ | BLEU-2▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | AOG + ARS | 44 | No | - | - | - |
| 2 | CoVS | 42.7 | No | - | - | - |
| 3 | IRW | 41.6 | No | - | - | - |
| 4 | SentiStory | 40.7 | No | - | - | - |
| 5 | StoryAnchor: w/ Predicted Nouns | 40 | No | Visual Storytelling via Predicting Anchor Word E... | 2020-01-13 | - |
| 6 | TAVST (RL) | 39.6 | No | Keep it Consistent: Topic-Aware Storytelling fro... | 2019-11-11 | - |
| 7 | AREL-t-100 | 39.1 | No | No Metrics Are Perfect: Adversarial Reward Learn... | 2018-04-24 | Code |
| 8 | GAN | 38.8 | No | No Metrics Are Perfect: Adversarial Reward Learn... | 2018-04-24 | Code |
| 9 | SGEmb | 38.7 | No | - | - | - |
| 10 | XE-ss | 38.2 | No | No Metrics Are Perfect: Adversarial Reward Learn... | 2018-04-24 | Code |
| 11 | ViT-model | 37.5 | No | Vision Transformer Based Model for Describing a ... | 2022-10-06 | - |
| 12 | CST | 36.5 | No | Contextualize, Show and Tell: A Neural Visual St... | 2018-06-03 | Code |
| 13 | INet | 0.401 | No | Hide-and-Tell: Learning to Bridge Photo Streams ... | 2020-02-03 | - |