Metric: Temporal Understanding (higher is better)
| # | Model↕ | Temporal Understanding▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | VideoGPT+ | 1.78 | No | VideoGPT+: Integrating Image and Video Encoders ... | 2024-06-13 | Code |
| 2 | VideoChat2 | 1.66 | No | MVBench: A Comprehensive Multi-modal Video Under... | 2023-11-28 | Code |
| 3 | Chat-UniVi | 1.56 | No | Chat-UniVi: Unified Visual Representation Empowe... | 2023-11-14 | Code |
| 4 | VTimeLLM | 1.46 | No | VTimeLLM: Empower LLM to Grasp Video Moments | 2023-11-30 | Code |
| 5 | Video-ChatGPT | 1.39 | No | Video-ChatGPT: Towards Detailed Video Understand... | 2023-06-08 | Code |
| 6 | BT-Adapter | 1.29 | No | BT-Adapter: Video Conversation is Feasible Witho... | 2023-09-27 | Code |