Video-RAG (Based on LLaVA-Video)
Reported on 4 benchmarks across 2 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- Accuracy (%)· 2024-11-2077.4best: 81.3 (Gemini 1.5 Pro)
- Accuracy· 2024-11-2066.7best: 71.14 (BIMBA-LLaVA-Qwen2-7B)
Reasoning2 results
- Accuracy (%)· 2024-11-2077.4best: 81.3 (Gemini 1.5 Pro)
- Accuracy· 2024-11-2066.7best: 71.14 (BIMBA-LLaVA-Qwen2-7B)