Metric: Reasoning (higher is better)
| # | Model↕ | Reasoning▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VideoGPT+ | 3.63 | No | VideoGPT+: Integrating Image and Video Encoders ... | 2024-06-13 | Code |
| 2 | BT-Adapter | 3.62 | No | BT-Adapter: Video Conversation is Feasible Witho... | 2023-09-27 | Code |
| 3 | Video-ChatGPT | 3.6 | No | Video-ChatGPT: Towards Detailed Video Understand... | 2023-06-08 | Code |
| 4 | Chat-UniVi | 3.59 | No | Chat-UniVi: Unified Visual Representation Empowe... | 2023-11-14 | Code |
| 5 | VTimeLLM | 3.45 | No | VTimeLLM: Empower LLM to Grasp Video Moments | 2023-11-30 | Code |
| 6 | VideoChat2 | 3.13 | No | MVBench: A Comprehensive Multi-modal Video Under... | 2023-11-28 | Code |