TVQA+

TextsVideosUnknown

TVQA+ contains 310.8K bounding boxes, linking depicted objects to visual concepts in questions and answers.