TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Reasoning/Video Question Answering/NExT-GQA

Video Question Answering on NExT-GQA

Metric: Acc@GQA (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Acc@GQA▼Extra DataPaperDate↕Code
1DeVi (Gemini 2.0)28.9NoQuestion-Answering Dense Video Events2024-09-06Code
2VideoMind(7B)28.2NoVideoMind: A Chain-of-LoRA Agent for Long Video ...2025-03-17Code
3DeVi (GPT-4)28NoQuestion-Answering Dense Video Events2024-09-06Code
4LLoVi (GPT-4)26.8NoA Simple LLM Framework for Long-Range Video Ques...2023-12-28Code
5VideoMind (2B)25.2NoVideoMind: A Chain-of-LoRA Agent for Long Video ...2025-03-17Code
6VideoStreaming17.8NoStreaming Long Video Understanding with Large La...2024-05-25-
7LangRepo (12B)17.1NoLanguage Repository for Long Video Understanding2024-03-21Code
8LLoVi (7B)11.2NoA Simple LLM Framework for Long-Range Video Ques...2023-12-28Code
9Mistral (7B)9.2NoMistral 7B2023-10-10Code