Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

Hong Chen, Yifei HUANG, Hiroya Takamura, Hideki Nakayama

2021-02-05Informativeness Visual Storytelling

Abstract

Visual storytelling is a task of generating relevant and interesting stories for given image sequences. In this work we aim at increasing the diversity of the generated stories while preserving the informative content from the images. We propose to foster the diversity and informativeness of a generated story by using a concept selection module that suggests a set of concept candidates. Then, we utilize a large scale pre-trained model to convert concepts and images into full stories. To enrich the candidate concepts, a commonsense knowledge graph is created for each image sequence from which the concept candidates are proposed. To obtain appropriate concepts from the graph, we propose two novel modules that consider the correlation among candidate concepts and the image-concept correlation. Extensive automatic and human evaluation results demonstrate that our model can produce reasonable concepts. This enables our model to outperform the previous models by a large margin on the diversity and informativeness of the story, while retaining the relevance of the story to the image sequence.

Results

Task	Dataset	Metric	Value	Model
Text Generation	VIST	BLEU-3	23.1	MCSM+RNN
Text Generation	VIST	BLEU-4	13	MCSM+RNN
Text Generation	VIST	CIDEr	11	MCSM+RNN
Text Generation	VIST	METEOR	36.1	MCSM+RNN
Text Generation	VIST	ROUGE-L	30.7	MCSM+RNN
Data-to-Text Generation	VIST	BLEU-3	23.1	MCSM+RNN
Data-to-Text Generation	VIST	BLEU-4	13	MCSM+RNN
Data-to-Text Generation	VIST	CIDEr	11	MCSM+RNN
Data-to-Text Generation	VIST	METEOR	36.1	MCSM+RNN
Data-to-Text Generation	VIST	ROUGE-L	30.7	MCSM+RNN
Visual Storytelling	VIST	BLEU-3	23.1	MCSM+RNN
Visual Storytelling	VIST	BLEU-4	13	MCSM+RNN
Visual Storytelling	VIST	CIDEr	11	MCSM+RNN
Visual Storytelling	VIST	METEOR	36.1	MCSM+RNN
Visual Storytelling	VIST	ROUGE-L	30.7	MCSM+RNN
Story Generation	VIST	BLEU-3	23.1	MCSM+RNN
Story Generation	VIST	BLEU-4	13	MCSM+RNN
Story Generation	VIST	CIDEr	11	MCSM+RNN
Story Generation	VIST	METEOR	36.1	MCSM+RNN
Story Generation	VIST	ROUGE-L	30.7	MCSM+RNN

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

Abstract

Results

Related Papers

Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling

Abstract

Results

Related Papers