TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Plot and Rework: Modeling Storylines for Visual Storytelling

Plot and Rework: Modeling Storylines for Visual Storytelling

Chi-Yang Hsu, Yun-Wei Chu, Ting-Hao 'Kenneth' Huang, Lun-Wei Ku

2021-05-14Findings (ACL) 2021 8FormVisual Storytelling
PaperPDFCode

Abstract

Writing a coherent and engaging story is not easy. Creative writers use their knowledge and worldview to put disjointed elements together to form a coherent storyline, and work and rework iteratively toward perfection. Automated visual storytelling (VIST) models, however, make poor use of external knowledge and iterative generation when attempting to create stories. This paper introduces PR-VIST, a framework that represents the input image sequence as a story graph in which it finds the best path to form a storyline. PR-VIST then takes this path and learns to generate the final story via an iterative training process. This framework produces stories that are superior in terms of diversity, coherence, and humanness, per both automatic and human evaluations. An ablation study shows that both plotting and reworking contribute to the model's superiority.

Results

TaskDatasetMetricValueModel
Text GenerationVISTBLEU-47.65PR-VIST
Text GenerationVISTBLEURT1.37PR-VIST
Text GenerationVISTMETEOR31.6PR-VIST
Text GenerationVISTMLTD45.79PR-VIST
Data-to-Text GenerationVISTBLEU-47.65PR-VIST
Data-to-Text GenerationVISTBLEURT1.37PR-VIST
Data-to-Text GenerationVISTMETEOR31.6PR-VIST
Data-to-Text GenerationVISTMLTD45.79PR-VIST
Visual StorytellingVISTBLEU-47.65PR-VIST
Visual StorytellingVISTBLEURT1.37PR-VIST
Visual StorytellingVISTMETEOR31.6PR-VIST
Visual StorytellingVISTMLTD45.79PR-VIST
Story GenerationVISTBLEU-47.65PR-VIST
Story GenerationVISTBLEURT1.37PR-VIST
Story GenerationVISTMETEOR31.6PR-VIST
Story GenerationVISTMLTD45.79PR-VIST

Related Papers

FreeAudio: Training-Free Timing Planning for Controllable Long-Form Text-to-Audio Generation2025-07-11Shape2Animal: Creative Animal Generation from Natural Silhouettes2025-06-25Controlled Retrieval-augmented Context Evaluation for Long-form RAG2025-06-24JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent2025-06-21FormGym: Doing Paperwork with Agents2025-06-17FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding2025-06-16Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks2025-06-16Consistent Story Generation with Asymmetry Zigzag Sampling2025-06-11