Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VideoChat2 | 59 | No | MVBench: A Comprehensive Multi-modal Video Under... | 2023-11-28 | Code |
| 2 | VidCtx (7B) | 51.1 | No | VidCtx: Context-aware Video Question Answering w... | 2024-12-23 | Code |
| 3 | Flamingo-9B | 41.8 | No | Flamingo: a Visual Language Model for Few-Shot L... | 2022-04-29 | Code |
| 4 | InternVideo | 41.6 | No | InternVideo: General Video Foundation Models via... | 2022-12-06 | Code |