Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Question Answering
/
TGIF-QA
Question Answering on TGIF-QA
Metric: Accuracy (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
#
Model
↕
Accuracy
▼
Extra Data
Paper
Date
↕
Code
1
Tarsier (34B)
82.5
No
Tarsier: Recipes for Training and Evaluating Lar...
2024-06-30
Code
2
LinVT-Qwen2-VL (7B)
81.3
No
LinVT: Empower Your Image-level Large Language M...
2024-12-06
Code
3
TS-LLaVA-34B
81
No
TS-LLaVA: Constructing Visual Tokens through Thu...
2024-11-17
Code
4
PLLaVA
80.6
No
PLLaVA : Parameter-free LLaVA Extension from Ima...
2024-04-25
Code
5
SlowFast-LLaVA-34B
80.6
No
SlowFast-LLaVA: A Strong Training-Free Baseline ...
2024-07-22
Code
6
IG-VLM
79.1
No
An Image Grid Can Be Worth a Video: Zero-shot Vi...
2024-03-27
Code
7
VideoGPT+
74.6
No
VideoGPT+: Integrating Image and Video Encoders ...
2024-06-13
Code
8
MiniGPT4-video-7B
72.22
No
MiniGPT4-Video: Advancing Multimodal LLMs for Vi...
2024-04-04
Code
9
Video-LLaVA-7B
70
No
Video-LLaVA: Learning United Visual Representati...
2023-11-16
Code
10
Chat-UniVi-7B
69
No
Chat-UniVi: Unified Visual Representation Empowe...
2023-11-14
Code
11
Elysium
66.6
No
Elysium: Exploring Object-level Perception in Vi...
2024-03-25
Code
12
Video-ChatGPT-7B
51.4
No
Video-ChatGPT: Towards Detailed Video Understand...
2023-06-08
Code
13
FrozenBiLM
41.9
No
Zero-Shot Video Question Answering via Frozen Bi...
2022-06-16
Code
14
Video Chat-7B
34.4
No
VideoChat: Chat-Centric Video Understanding
2023-05-10
Code