TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Reasoning/Video Question Answering/ActivityNet-QA

Video Question Answering on ActivityNet-QA

Metric: Confidence score (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Confidence score▲Extra DataPaperDate↕Code
1Video Chat2.2NoVideoChat: Chat-Centric Video Understanding2023-05-10Code
2Video-ChatGPT2.7NoVideo-ChatGPT: Towards Detailed Video Understand...2023-06-08Code
3LLaMA Adapter V22.7NoLLaMA-Adapter V2: Parameter-Efficient Visual Ins...2023-04-28Code
4MovieChat3.1NoMovieChat: From Dense Token to Sparse Memory for...2023-07-31Code
5VideoChat23.3NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
6LLaMA-VID-13B (2 Token)3.3NoLLaMA-VID: An Image is Worth 2 Tokens in Large L...2023-11-28Code
7LLaMA-VID-7B (2 Token)3.3NoLLaMA-VID: An Image is Worth 2 Tokens in Large L...2023-11-28Code
8Chat-UniVi-13B3.3NoChat-UniVi: Unified Visual Representation Empowe...2023-11-14Code
9Video-LLaVA3.3NoVideo-LLaVA: Learning United Visual Representati...2023-11-16Code
10BT-Adapter (zero-shot)3.6NoBT-Adapter: Video Conversation is Feasible Witho...2023-09-27Code