TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/What Makes A Good Story? Designing Composite Rewards for V...

What Makes A Good Story? Designing Composite Rewards for Visual Storytelling

Junjie Hu, Yu Cheng, Zhe Gan, Jingjing Liu, Jianfeng Gao, Graham Neubig

2019-09-11Reinforcement LearningVisual Storytelling
PaperPDFCode(official)

Abstract

Previous storytelling approaches mostly focused on optimizing traditional metrics such as BLEU, ROUGE and CIDEr. In this paper, we re-examine this problem from a different angle, by looking deep into what defines a realistically-natural and topically-coherent story. To this end, we propose three assessment criteria: relevance, coherence and expressiveness, which we observe through empirical analysis could constitute a "high-quality" story to the human eye. Following this quality guideline, we propose a reinforcement learning framework, ReCo-RL, with reward functions designed to capture the essence of these quality criteria. Experiments on the Visual Storytelling Dataset (VIST) with both automatic and human evaluations demonstrate that our ReCo-RL model achieves better performance than state-of-the-art baselines on both traditional metrics and the proposed new criteria.

Results

TaskDatasetMetricValueModel
Text GenerationVISTBLEU-414.4BLEU-RL
Text GenerationVISTCIDEr6.7BLEU-RL
Text GenerationVISTMETEOR35.2BLEU-RL
Text GenerationVISTROUGE-L30.1BLEU-RL
Text GenerationVISTSPICE8.3BLEU-RL
Text GenerationVISTBLEU-414.3MLE
Text GenerationVISTCIDEr7.2MLE
Text GenerationVISTMETEOR34.8MLE
Text GenerationVISTROUGE-L30MLE
Text GenerationVISTSPICE8.5MLE
Text GenerationVISTBLEU-413.6AREL
Text GenerationVISTCIDEr9.1AREL
Text GenerationVISTMETEOR35.2AREL
Text GenerationVISTROUGE-L29.3AREL
Text GenerationVISTSPICE8.9AREL
Text GenerationVISTBLEU-412.4ReCo-RL
Text GenerationVISTCIDEr8.6ReCo-RL
Text GenerationVISTMETEOR33.9ReCo-RL
Text GenerationVISTROUGE-L29.9ReCo-RL
Text GenerationVISTSPICE8.3ReCo-RL
Text GenerationVISTBLEU-49.8HSRL
Text GenerationVISTCIDEr5.9HSRL
Text GenerationVISTMETEOR30.1HSRL
Text GenerationVISTROUGE-L25.1HSRL
Text GenerationVISTSPICE7.5HSRL
Data-to-Text GenerationVISTBLEU-414.4BLEU-RL
Data-to-Text GenerationVISTCIDEr6.7BLEU-RL
Data-to-Text GenerationVISTMETEOR35.2BLEU-RL
Data-to-Text GenerationVISTROUGE-L30.1BLEU-RL
Data-to-Text GenerationVISTSPICE8.3BLEU-RL
Data-to-Text GenerationVISTBLEU-414.3MLE
Data-to-Text GenerationVISTCIDEr7.2MLE
Data-to-Text GenerationVISTMETEOR34.8MLE
Data-to-Text GenerationVISTROUGE-L30MLE
Data-to-Text GenerationVISTSPICE8.5MLE
Data-to-Text GenerationVISTBLEU-413.6AREL
Data-to-Text GenerationVISTCIDEr9.1AREL
Data-to-Text GenerationVISTMETEOR35.2AREL
Data-to-Text GenerationVISTROUGE-L29.3AREL
Data-to-Text GenerationVISTSPICE8.9AREL
Data-to-Text GenerationVISTBLEU-412.4ReCo-RL
Data-to-Text GenerationVISTCIDEr8.6ReCo-RL
Data-to-Text GenerationVISTMETEOR33.9ReCo-RL
Data-to-Text GenerationVISTROUGE-L29.9ReCo-RL
Data-to-Text GenerationVISTSPICE8.3ReCo-RL
Data-to-Text GenerationVISTBLEU-49.8HSRL
Data-to-Text GenerationVISTCIDEr5.9HSRL
Data-to-Text GenerationVISTMETEOR30.1HSRL
Data-to-Text GenerationVISTROUGE-L25.1HSRL
Data-to-Text GenerationVISTSPICE7.5HSRL
Visual StorytellingVISTBLEU-414.4BLEU-RL
Visual StorytellingVISTCIDEr6.7BLEU-RL
Visual StorytellingVISTMETEOR35.2BLEU-RL
Visual StorytellingVISTROUGE-L30.1BLEU-RL
Visual StorytellingVISTSPICE8.3BLEU-RL
Visual StorytellingVISTBLEU-414.3MLE
Visual StorytellingVISTCIDEr7.2MLE
Visual StorytellingVISTMETEOR34.8MLE
Visual StorytellingVISTROUGE-L30MLE
Visual StorytellingVISTSPICE8.5MLE
Visual StorytellingVISTBLEU-413.6AREL
Visual StorytellingVISTCIDEr9.1AREL
Visual StorytellingVISTMETEOR35.2AREL
Visual StorytellingVISTROUGE-L29.3AREL
Visual StorytellingVISTSPICE8.9AREL
Visual StorytellingVISTBLEU-412.4ReCo-RL
Visual StorytellingVISTCIDEr8.6ReCo-RL
Visual StorytellingVISTMETEOR33.9ReCo-RL
Visual StorytellingVISTROUGE-L29.9ReCo-RL
Visual StorytellingVISTSPICE8.3ReCo-RL
Visual StorytellingVISTBLEU-49.8HSRL
Visual StorytellingVISTCIDEr5.9HSRL
Visual StorytellingVISTMETEOR30.1HSRL
Visual StorytellingVISTROUGE-L25.1HSRL
Visual StorytellingVISTSPICE7.5HSRL
Story GenerationVISTBLEU-414.4BLEU-RL
Story GenerationVISTCIDEr6.7BLEU-RL
Story GenerationVISTMETEOR35.2BLEU-RL
Story GenerationVISTROUGE-L30.1BLEU-RL
Story GenerationVISTSPICE8.3BLEU-RL
Story GenerationVISTBLEU-414.3MLE
Story GenerationVISTCIDEr7.2MLE
Story GenerationVISTMETEOR34.8MLE
Story GenerationVISTROUGE-L30MLE
Story GenerationVISTSPICE8.5MLE
Story GenerationVISTBLEU-413.6AREL
Story GenerationVISTCIDEr9.1AREL
Story GenerationVISTMETEOR35.2AREL
Story GenerationVISTROUGE-L29.3AREL
Story GenerationVISTSPICE8.9AREL
Story GenerationVISTBLEU-412.4ReCo-RL
Story GenerationVISTCIDEr8.6ReCo-RL
Story GenerationVISTMETEOR33.9ReCo-RL
Story GenerationVISTROUGE-L29.9ReCo-RL
Story GenerationVISTSPICE8.3ReCo-RL
Story GenerationVISTBLEU-49.8HSRL
Story GenerationVISTCIDEr5.9HSRL
Story GenerationVISTMETEOR30.1HSRL
Story GenerationVISTROUGE-L25.1HSRL
Story GenerationVISTSPICE7.5HSRL

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17