Junjie Hu, Yu Cheng, Zhe Gan, Jingjing Liu, Jianfeng Gao, Graham Neubig
Previous storytelling approaches mostly focused on optimizing traditional metrics such as BLEU, ROUGE and CIDEr. In this paper, we re-examine this problem from a different angle, by looking deep into what defines a realistically-natural and topically-coherent story. To this end, we propose three assessment criteria: relevance, coherence and expressiveness, which we observe through empirical analysis could constitute a "high-quality" story to the human eye. Following this quality guideline, we propose a reinforcement learning framework, ReCo-RL, with reward functions designed to capture the essence of these quality criteria. Experiments on the Visual Storytelling Dataset (VIST) with both automatic and human evaluations demonstrate that our ReCo-RL model achieves better performance than state-of-the-art baselines on both traditional metrics and the proposed new criteria.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Generation | VIST | BLEU-4 | 14.4 | BLEU-RL |
| Text Generation | VIST | CIDEr | 6.7 | BLEU-RL |
| Text Generation | VIST | METEOR | 35.2 | BLEU-RL |
| Text Generation | VIST | ROUGE-L | 30.1 | BLEU-RL |
| Text Generation | VIST | SPICE | 8.3 | BLEU-RL |
| Text Generation | VIST | BLEU-4 | 14.3 | MLE |
| Text Generation | VIST | CIDEr | 7.2 | MLE |
| Text Generation | VIST | METEOR | 34.8 | MLE |
| Text Generation | VIST | ROUGE-L | 30 | MLE |
| Text Generation | VIST | SPICE | 8.5 | MLE |
| Text Generation | VIST | BLEU-4 | 13.6 | AREL |
| Text Generation | VIST | CIDEr | 9.1 | AREL |
| Text Generation | VIST | METEOR | 35.2 | AREL |
| Text Generation | VIST | ROUGE-L | 29.3 | AREL |
| Text Generation | VIST | SPICE | 8.9 | AREL |
| Text Generation | VIST | BLEU-4 | 12.4 | ReCo-RL |
| Text Generation | VIST | CIDEr | 8.6 | ReCo-RL |
| Text Generation | VIST | METEOR | 33.9 | ReCo-RL |
| Text Generation | VIST | ROUGE-L | 29.9 | ReCo-RL |
| Text Generation | VIST | SPICE | 8.3 | ReCo-RL |
| Text Generation | VIST | BLEU-4 | 9.8 | HSRL |
| Text Generation | VIST | CIDEr | 5.9 | HSRL |
| Text Generation | VIST | METEOR | 30.1 | HSRL |
| Text Generation | VIST | ROUGE-L | 25.1 | HSRL |
| Text Generation | VIST | SPICE | 7.5 | HSRL |
| Data-to-Text Generation | VIST | BLEU-4 | 14.4 | BLEU-RL |
| Data-to-Text Generation | VIST | CIDEr | 6.7 | BLEU-RL |
| Data-to-Text Generation | VIST | METEOR | 35.2 | BLEU-RL |
| Data-to-Text Generation | VIST | ROUGE-L | 30.1 | BLEU-RL |
| Data-to-Text Generation | VIST | SPICE | 8.3 | BLEU-RL |
| Data-to-Text Generation | VIST | BLEU-4 | 14.3 | MLE |
| Data-to-Text Generation | VIST | CIDEr | 7.2 | MLE |
| Data-to-Text Generation | VIST | METEOR | 34.8 | MLE |
| Data-to-Text Generation | VIST | ROUGE-L | 30 | MLE |
| Data-to-Text Generation | VIST | SPICE | 8.5 | MLE |
| Data-to-Text Generation | VIST | BLEU-4 | 13.6 | AREL |
| Data-to-Text Generation | VIST | CIDEr | 9.1 | AREL |
| Data-to-Text Generation | VIST | METEOR | 35.2 | AREL |
| Data-to-Text Generation | VIST | ROUGE-L | 29.3 | AREL |
| Data-to-Text Generation | VIST | SPICE | 8.9 | AREL |
| Data-to-Text Generation | VIST | BLEU-4 | 12.4 | ReCo-RL |
| Data-to-Text Generation | VIST | CIDEr | 8.6 | ReCo-RL |
| Data-to-Text Generation | VIST | METEOR | 33.9 | ReCo-RL |
| Data-to-Text Generation | VIST | ROUGE-L | 29.9 | ReCo-RL |
| Data-to-Text Generation | VIST | SPICE | 8.3 | ReCo-RL |
| Data-to-Text Generation | VIST | BLEU-4 | 9.8 | HSRL |
| Data-to-Text Generation | VIST | CIDEr | 5.9 | HSRL |
| Data-to-Text Generation | VIST | METEOR | 30.1 | HSRL |
| Data-to-Text Generation | VIST | ROUGE-L | 25.1 | HSRL |
| Data-to-Text Generation | VIST | SPICE | 7.5 | HSRL |
| Visual Storytelling | VIST | BLEU-4 | 14.4 | BLEU-RL |
| Visual Storytelling | VIST | CIDEr | 6.7 | BLEU-RL |
| Visual Storytelling | VIST | METEOR | 35.2 | BLEU-RL |
| Visual Storytelling | VIST | ROUGE-L | 30.1 | BLEU-RL |
| Visual Storytelling | VIST | SPICE | 8.3 | BLEU-RL |
| Visual Storytelling | VIST | BLEU-4 | 14.3 | MLE |
| Visual Storytelling | VIST | CIDEr | 7.2 | MLE |
| Visual Storytelling | VIST | METEOR | 34.8 | MLE |
| Visual Storytelling | VIST | ROUGE-L | 30 | MLE |
| Visual Storytelling | VIST | SPICE | 8.5 | MLE |
| Visual Storytelling | VIST | BLEU-4 | 13.6 | AREL |
| Visual Storytelling | VIST | CIDEr | 9.1 | AREL |
| Visual Storytelling | VIST | METEOR | 35.2 | AREL |
| Visual Storytelling | VIST | ROUGE-L | 29.3 | AREL |
| Visual Storytelling | VIST | SPICE | 8.9 | AREL |
| Visual Storytelling | VIST | BLEU-4 | 12.4 | ReCo-RL |
| Visual Storytelling | VIST | CIDEr | 8.6 | ReCo-RL |
| Visual Storytelling | VIST | METEOR | 33.9 | ReCo-RL |
| Visual Storytelling | VIST | ROUGE-L | 29.9 | ReCo-RL |
| Visual Storytelling | VIST | SPICE | 8.3 | ReCo-RL |
| Visual Storytelling | VIST | BLEU-4 | 9.8 | HSRL |
| Visual Storytelling | VIST | CIDEr | 5.9 | HSRL |
| Visual Storytelling | VIST | METEOR | 30.1 | HSRL |
| Visual Storytelling | VIST | ROUGE-L | 25.1 | HSRL |
| Visual Storytelling | VIST | SPICE | 7.5 | HSRL |
| Story Generation | VIST | BLEU-4 | 14.4 | BLEU-RL |
| Story Generation | VIST | CIDEr | 6.7 | BLEU-RL |
| Story Generation | VIST | METEOR | 35.2 | BLEU-RL |
| Story Generation | VIST | ROUGE-L | 30.1 | BLEU-RL |
| Story Generation | VIST | SPICE | 8.3 | BLEU-RL |
| Story Generation | VIST | BLEU-4 | 14.3 | MLE |
| Story Generation | VIST | CIDEr | 7.2 | MLE |
| Story Generation | VIST | METEOR | 34.8 | MLE |
| Story Generation | VIST | ROUGE-L | 30 | MLE |
| Story Generation | VIST | SPICE | 8.5 | MLE |
| Story Generation | VIST | BLEU-4 | 13.6 | AREL |
| Story Generation | VIST | CIDEr | 9.1 | AREL |
| Story Generation | VIST | METEOR | 35.2 | AREL |
| Story Generation | VIST | ROUGE-L | 29.3 | AREL |
| Story Generation | VIST | SPICE | 8.9 | AREL |
| Story Generation | VIST | BLEU-4 | 12.4 | ReCo-RL |
| Story Generation | VIST | CIDEr | 8.6 | ReCo-RL |
| Story Generation | VIST | METEOR | 33.9 | ReCo-RL |
| Story Generation | VIST | ROUGE-L | 29.9 | ReCo-RL |
| Story Generation | VIST | SPICE | 8.3 | ReCo-RL |
| Story Generation | VIST | BLEU-4 | 9.8 | HSRL |
| Story Generation | VIST | CIDEr | 5.9 | HSRL |
| Story Generation | VIST | METEOR | 30.1 | HSRL |
| Story Generation | VIST | ROUGE-L | 25.1 | HSRL |
| Story Generation | VIST | SPICE | 7.5 | HSRL |