Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication

Ruize Wang, Zhongyu Wei, Ying Cheng, Piji Li, Haijun Shan, Ji Zhang, Qi Zhang, Xuanjing Huang

2019-11-11COLING 2020 8Image Captioning Question Generation Visual Storytelling

Abstract

Visual storytelling aims to generate a narrative paragraph from a sequence of images automatically. Existing approaches construct text description independently for each image and roughly concatenate them as a story, which leads to the problem of generating semantically incoherent content. In this paper, we propose a new way for visual storytelling by introducing a topic description task to detect the global semantic context of an image stream. A story is then constructed with the guidance of the topic description. In order to combine the two generation tasks, we propose a multi-agent communication framework that regards the topic description generator and the story generator as two agents and learn them simultaneously via iterative updating mechanism. We validate our approach on VIST dataset, where quantitative results, ablations, and human evaluation demonstrate our method's good ability in generating stories with higher quality compared to state-of-the-art methods.

Results

Task	Dataset	Metric	Value	Model
Text Generation	VIST	BLEU-1	64.2	TAVST (RL)
Text Generation	VIST	BLEU-2	39.6	TAVST (RL)
Text Generation	VIST	BLEU-3	23.7	TAVST (RL)
Text Generation	VIST	BLEU-4	14.6	TAVST (RL)
Text Generation	VIST	CIDEr	9.2	TAVST (RL)
Text Generation	VIST	METEOR	35.7	TAVST (RL)
Text Generation	VIST	ROUGE-L	31	TAVST (RL)
Data-to-Text Generation	VIST	BLEU-1	64.2	TAVST (RL)
Data-to-Text Generation	VIST	BLEU-2	39.6	TAVST (RL)
Data-to-Text Generation	VIST	BLEU-3	23.7	TAVST (RL)
Data-to-Text Generation	VIST	BLEU-4	14.6	TAVST (RL)
Data-to-Text Generation	VIST	CIDEr	9.2	TAVST (RL)
Data-to-Text Generation	VIST	METEOR	35.7	TAVST (RL)
Data-to-Text Generation	VIST	ROUGE-L	31	TAVST (RL)
Visual Storytelling	VIST	BLEU-1	64.2	TAVST (RL)
Visual Storytelling	VIST	BLEU-2	39.6	TAVST (RL)
Visual Storytelling	VIST	BLEU-3	23.7	TAVST (RL)
Visual Storytelling	VIST	BLEU-4	14.6	TAVST (RL)
Visual Storytelling	VIST	CIDEr	9.2	TAVST (RL)
Visual Storytelling	VIST	METEOR	35.7	TAVST (RL)
Visual Storytelling	VIST	ROUGE-L	31	TAVST (RL)
Story Generation	VIST	BLEU-1	64.2	TAVST (RL)
Story Generation	VIST	BLEU-2	39.6	TAVST (RL)
Story Generation	VIST	BLEU-3	23.7	TAVST (RL)
Story Generation	VIST	BLEU-4	14.6	TAVST (RL)
Story Generation	VIST	CIDEr	9.2	TAVST (RL)
Story Generation	VIST	METEOR	35.7	TAVST (RL)
Story Generation	VIST	ROUGE-L	31	TAVST (RL)

Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication

Abstract

Results

Related Papers

Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication

Abstract

Results

Related Papers