Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Visual Storytelling
/
VIST
Visual Storytelling on VIST
Metric: CIDEr (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
CIDEr
▼
Extra Data
Paper
Date
↕
Code
1
HEGR
14.1
Yes
-
-
-
2
TAPM
13.8
Yes
-
-
-
3
K-Storyteller
12.1
No
-
-
Code
4
AOG + ARS
12
No
-
-
-
5
CoVS
11.5
No
-
-
-
6
IRW
11
No
-
-
-
7
MCSM+RNN
11
No
Commonsense Knowledge Aware Concept Selection Fo...
2021-02-05
-
8
HSRL w/ Joint Training
10.71
No
Hierarchically Structured Reinforcement Learning...
2018-05-21
-
9
SentiStory
10.1
No
-
-
-
10
INet
10
No
Hide-and-Tell: Learning to Bridge Photo Streams ...
2020-02-03
-
11
StoryAnchor: w/ Predicted Nouns
9.9
No
Visual Storytelling via Predicting Anchor Word E...
2020-01-13
-
12
SGVST
9.8
No
-
-
-
13
AREL-t-100
9.4
No
No Metrics Are Perfect: Adversarial Reward Learn...
2018-04-24
Code
14
TAVST (RL)
9.2
No
Keep it Consistent: Topic-Aware Storytelling fro...
2019-11-11
-
15
AREL
9.1
No
What Makes A Good Story? Designing Composite Rew...
2019-09-11
Code
16
VSCMR
9
No
Informative Visual Storytelling with Cross-modal...
2019-07-07
Code
17
GAN
9
No
No Metrics Are Perfect: Adversarial Reward Learn...
2018-04-24
Code
18
XE-ss
8.7
No
No Metrics Are Perfect: Adversarial Reward Learn...
2018-04-24
Code
19
SGEmb
8.6
No
-
-
-
20
ReCo-RL
8.6
No
What Makes A Good Story? Designing Composite Rew...
2019-09-11
Code
21
BERT-hLSTMs
8.37
No
BERT-hLSTMs: BERT and Hierarchical LSTMs for Vis...
2020-12-03
-
22
TAPM (no V&L)
8.3
No
-
-
-
23
hLSTMs
7.98
No
BERT-hLSTMs: BERT and Hierarchical LSTMs for Vis...
2020-12-03
-
24
h-attn-rank
7.38
No
Hierarchically-Attentive RNN for Album Summariza...
2017-08-09
-
25
MLE
7.2
No
What Makes A Good Story? Designing Composite Rew...
2019-09-11
Code
26
BLEU-RL
6.7
No
What Makes A Good Story? Designing Composite Rew...
2019-09-11
Code
27
HSRL
5.9
No
What Makes A Good Story? Designing Composite Rew...
2019-09-11
Code
28
CST
5.1
No
Contextualize, Show and Tell: A Neural Visual St...
2018-06-03
Code
29
ViT-model
4.4
No
Vision Transformer Based Model for Describing a ...
2022-10-06
-