TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/NExT-QA

Question Answering on NExT-QA

Metric: WUPS (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕WUPS▼Extra DataPaperDate↕Code
1PaLI-X38.3YesPaLI-X: On Scaling up a Multilingual Vision and ...2023-05-29Code
2PaLI-337.7YesPaLI-3 Vision Language Models: Smaller, Faster, ...2023-10-13Code
3R2A34.7YesRetrieving-to-Answer: Zero-Shot Video Question A...2023-06-15-
4Flamingo(32-shot)33.5YesFlamingo: a Visual Language Model for Few-Shot L...2022-04-29Code
5Gemini Ultra (zero-shot)29.9NoGemini: A Family of Highly Capable Multimodal Mo...2023-12-19Code
6Gemini Pro (zero-shot)28NoGemini: A Family of Highly Capable Multimodal Mo...2023-12-19Code
7Flamingo(0-shot)26.7YesFlamingo: a Visual Language Model for Few-Shot L...2022-04-29Code
8Emu(0-shot)23.4YesEmu: Generative Pretraining in Multimodality2023-07-11Code