VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions

Qing Li, Qingyi Tao, Shafiq Joty, Jianfei Cai, Jiebo Luo

2018-03-20ECCV 2018 9Question Answering Explanatory Visual Question Answering Multi-Task Learning Visual Question Answering (VQA)Visual Question Answering

Paper PDF

Abstract

Most existing works in visual question answering (VQA) are dedicated to improving the accuracy of predicted answers, while disregarding the explanations. We argue that the explanation for an answer is of the same or even more importance compared with the answer itself, since it makes the question and answering process more understandable and traceable. To this end, we propose a new task of VQA-E (VQA with Explanation), where the computational models are required to generate an explanation with the predicted answer. We first construct a new dataset, and then frame the VQA-E problem in a multi-task learning architecture. Our VQA-E dataset is automatically derived from the VQA v2 dataset by intelligently exploiting the available captions. We have conducted a user study to validate the quality of explanations synthesized by our method. We quantitatively show that the additional supervision from explanations can not only produce insightful textual sentences to justify the answers, but also improve the performance of answer prediction. Our model outperforms the state-of-the-art methods by a clear margin on the VQA v2 dataset.

Results

Task	Dataset	Metric	Value	Model
Visual Question Answering (VQA)	GQA-REX	BLEU-4	42.56	VQAE
Visual Question Answering (VQA)	GQA-REX	CIDEr	358.2	VQAE
Visual Question Answering (VQA)	GQA-REX	GQA-test	57.24	VQAE
Visual Question Answering (VQA)	GQA-REX	GQA-val	65.19	VQAE
Visual Question Answering (VQA)	GQA-REX	Grounding	31.29	VQAE
Visual Question Answering (VQA)	GQA-REX	METEOR	34.51	VQAE
Visual Question Answering (VQA)	GQA-REX	ROUGE-L	73.59	VQAE
Visual Question Answering (VQA)	GQA-REX	SPICE	40.39	VQAE
Visual Question Answering	GQA-REX	BLEU-4	42.56	VQAE
Visual Question Answering	GQA-REX	CIDEr	358.2	VQAE
Visual Question Answering	GQA-REX	GQA-test	57.24	VQAE
Visual Question Answering	GQA-REX	GQA-val	65.19	VQAE
Visual Question Answering	GQA-REX	Grounding	31.29	VQAE
Visual Question Answering	GQA-REX	METEOR	34.51	VQAE
Visual Question Answering	GQA-REX	ROUGE-L	73.59	VQAE
Visual Question Answering	GQA-REX	SPICE	40.39	VQAE
Explanatory Visual Question Answering	GQA-REX	BLEU-4	42.56	VQAE
Explanatory Visual Question Answering	GQA-REX	CIDEr	358.2	VQAE
Explanatory Visual Question Answering	GQA-REX	GQA-test	57.24	VQAE
Explanatory Visual Question Answering	GQA-REX	GQA-val	65.19	VQAE
Explanatory Visual Question Answering	GQA-REX	Grounding	31.29	VQAE
Explanatory Visual Question Answering	GQA-REX	METEOR	34.51	VQAE
Explanatory Visual Question Answering	GQA-REX	ROUGE-L	73.59	VQAE
Explanatory Visual Question Answering	GQA-REX	SPICE	40.39	VQAE

VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions

Abstract

Results

Related Papers

VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions

Abstract

Results

Related Papers