TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/VCGBench-Diverse/VideoInstruct

VCGBench-Diverse on VideoInstruct

Metric: Temporal Understanding (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Temporal Understanding▼Extra DataPaperDate↕Code
1VideoGPT+1.78NoVideoGPT+: Integrating Image and Video Encoders ...2024-06-13Code
2VideoChat21.66NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
3Chat-UniVi1.56NoChat-UniVi: Unified Visual Representation Empowe...2023-11-14Code
4VTimeLLM1.46NoVTimeLLM: Empower LLM to Grasp Video Moments2023-11-30Code
5Video-ChatGPT1.39NoVideo-ChatGPT: Towards Detailed Video Understand...2023-06-08Code
6BT-Adapter1.29NoBT-Adapter: Video Conversation is Feasible Witho...2023-09-27Code