TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/No Metrics Are Perfect: Adversarial Reward Learning for Vi...

No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling

Xin Wang, Wenhu Chen, Yuan-Fang Wang, William Yang Wang

2018-04-24ACL 2018 7Reinforcement LearningImage CaptioningVisual Storytelling
PaperPDFCodeCode(official)

Abstract

Though impressive results have been achieved in visual captioning, the task of generating abstract stories from photo streams is still a little-tapped problem. Different from captions, stories have more expressive language styles and contain many imaginary concepts that do not appear in the images. Thus it poses challenges to behavioral cloning algorithms. Furthermore, due to the limitations of automatic metrics on evaluating story quality, reinforcement learning methods with hand-crafted rewards also face difficulties in gaining an overall performance boost. Therefore, we propose an Adversarial REward Learning (AREL) framework to learn an implicit reward function from human demonstrations, and then optimize policy search with the learned reward function. Though automatic eval- uation indicates slight performance boost over state-of-the-art (SOTA) methods in cloning expert behaviors, human evaluation shows that our approach achieves significant improvement in generating more human-like stories than SOTA systems.

Results

TaskDatasetMetricValueModel
Text GenerationVISTBLEU-163.8AREL-t-100
Text GenerationVISTBLEU-239.1AREL-t-100
Text GenerationVISTBLEU-323.2AREL-t-100
Text GenerationVISTBLEU-414.1AREL-t-100
Text GenerationVISTCIDEr9.4AREL-t-100
Text GenerationVISTMETEOR35AREL-t-100
Text GenerationVISTROUGE-L29.5AREL-t-100
Text GenerationVISTBLEU-162.8GAN
Text GenerationVISTBLEU-238.8GAN
Text GenerationVISTBLEU-323GAN
Text GenerationVISTBLEU-414GAN
Text GenerationVISTCIDEr9GAN
Text GenerationVISTMETEOR35GAN
Text GenerationVISTROUGE-L29.5GAN
Text GenerationVISTBLEU-162.3XE-ss
Text GenerationVISTBLEU-238.2XE-ss
Text GenerationVISTBLEU-322.5XE-ss
Text GenerationVISTBLEU-413.7XE-ss
Text GenerationVISTCIDEr8.7XE-ss
Text GenerationVISTMETEOR34.8XE-ss
Text GenerationVISTROUGE-L29.7XE-ss
Data-to-Text GenerationVISTBLEU-163.8AREL-t-100
Data-to-Text GenerationVISTBLEU-239.1AREL-t-100
Data-to-Text GenerationVISTBLEU-323.2AREL-t-100
Data-to-Text GenerationVISTBLEU-414.1AREL-t-100
Data-to-Text GenerationVISTCIDEr9.4AREL-t-100
Data-to-Text GenerationVISTMETEOR35AREL-t-100
Data-to-Text GenerationVISTROUGE-L29.5AREL-t-100
Data-to-Text GenerationVISTBLEU-162.8GAN
Data-to-Text GenerationVISTBLEU-238.8GAN
Data-to-Text GenerationVISTBLEU-323GAN
Data-to-Text GenerationVISTBLEU-414GAN
Data-to-Text GenerationVISTCIDEr9GAN
Data-to-Text GenerationVISTMETEOR35GAN
Data-to-Text GenerationVISTROUGE-L29.5GAN
Data-to-Text GenerationVISTBLEU-162.3XE-ss
Data-to-Text GenerationVISTBLEU-238.2XE-ss
Data-to-Text GenerationVISTBLEU-322.5XE-ss
Data-to-Text GenerationVISTBLEU-413.7XE-ss
Data-to-Text GenerationVISTCIDEr8.7XE-ss
Data-to-Text GenerationVISTMETEOR34.8XE-ss
Data-to-Text GenerationVISTROUGE-L29.7XE-ss
Visual StorytellingVISTBLEU-163.8AREL-t-100
Visual StorytellingVISTBLEU-239.1AREL-t-100
Visual StorytellingVISTBLEU-323.2AREL-t-100
Visual StorytellingVISTBLEU-414.1AREL-t-100
Visual StorytellingVISTCIDEr9.4AREL-t-100
Visual StorytellingVISTMETEOR35AREL-t-100
Visual StorytellingVISTROUGE-L29.5AREL-t-100
Visual StorytellingVISTBLEU-162.8GAN
Visual StorytellingVISTBLEU-238.8GAN
Visual StorytellingVISTBLEU-323GAN
Visual StorytellingVISTBLEU-414GAN
Visual StorytellingVISTCIDEr9GAN
Visual StorytellingVISTMETEOR35GAN
Visual StorytellingVISTROUGE-L29.5GAN
Visual StorytellingVISTBLEU-162.3XE-ss
Visual StorytellingVISTBLEU-238.2XE-ss
Visual StorytellingVISTBLEU-322.5XE-ss
Visual StorytellingVISTBLEU-413.7XE-ss
Visual StorytellingVISTCIDEr8.7XE-ss
Visual StorytellingVISTMETEOR34.8XE-ss
Visual StorytellingVISTROUGE-L29.7XE-ss
Story GenerationVISTBLEU-163.8AREL-t-100
Story GenerationVISTBLEU-239.1AREL-t-100
Story GenerationVISTBLEU-323.2AREL-t-100
Story GenerationVISTBLEU-414.1AREL-t-100
Story GenerationVISTCIDEr9.4AREL-t-100
Story GenerationVISTMETEOR35AREL-t-100
Story GenerationVISTROUGE-L29.5AREL-t-100
Story GenerationVISTBLEU-162.8GAN
Story GenerationVISTBLEU-238.8GAN
Story GenerationVISTBLEU-323GAN
Story GenerationVISTBLEU-414GAN
Story GenerationVISTCIDEr9GAN
Story GenerationVISTMETEOR35GAN
Story GenerationVISTROUGE-L29.5GAN
Story GenerationVISTBLEU-162.3XE-ss
Story GenerationVISTBLEU-238.2XE-ss
Story GenerationVISTBLEU-322.5XE-ss
Story GenerationVISTBLEU-413.7XE-ss
Story GenerationVISTCIDEr8.7XE-ss
Story GenerationVISTMETEOR34.8XE-ss
Story GenerationVISTROUGE-L29.7XE-ss

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17