TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/TVQA

Question Answering on TVQA

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1FrozenBiLM (with speech)59.7NoZero-Shot Video Question Answering via Frozen Bi...2022-06-16Code
2IG-VLM (no speech, GPT-4V)57.8NoAn Image Grid Can Be Worth a Video: Zero-shot Vi...2024-03-27Code
3MiniGPT4-video-7B54.21NoMiniGPT4-Video: Advancing Multimodal LLMs for Vi...2024-04-04Code
4VideoChat_HD_mistral (no speech)50.6NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
5VideoChat_mistral (no speech)46.4NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
6VideoChat2 (no speech)40.6NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
7SEVILA (no speech)38.2NoSelf-Chained Image-Language Model for Video Loca...2023-05-11Code
8InternVideo (no speech)35.9NoInternVideo: General Video Foundation Models via...2022-12-06Code
9FrozenBILM (no speech)29.7NoZero-Shot Video Question Answering via Frozen Bi...2022-06-16Code