Explanatory Visual Question Answering

16 benchmarks5 papers

Explanatory Visual Question Answering (EVQA) requires answering visual questions and generating multimodal explanations for the reasoning processes.

Benchmarks