SUM-shot+Vicuna

Reported on 2 benchmarks across 2 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing1 result

Question AnsweringonMSRVTT-QA
Accuracy· 2023-12-16
56.8
best: 72.4 (Flash-VStream)
Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos arXiv:2312.10300

Reasoning1 result

Video Question AnsweringonMSRVTT-QA
Accuracy· 2023-12-16
56.8
best: 72.4 (Flash-VStream)
Shot2Story20K: A New Benchmark for Comprehensive Understanding of Multi-shot Videos arXiv:2312.10300