TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/VCGBench-Diverse/VideoInstruct

VCGBench-Diverse on VideoInstruct

Metric: Spatial Understanding (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Spatial Understanding▼Extra DataPaperDate↕Code
1VideoGPT+2.8NoVideoGPT+: Integrating Image and Video Encoders ...2024-06-13Code
2VideoChat22.43NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
3Chat-UniVi2.36NoChat-UniVi: Unified Visual Representation Empowe...2023-11-14Code
4BT-Adapter2.35NoBT-Adapter: Video Conversation is Feasible Witho...2023-09-27Code
5VTimeLLM2.29NoVTimeLLM: Empower LLM to Grasp Video Moments2023-11-30Code
6Video-ChatGPT2.25NoVideo-ChatGPT: Towards Detailed Video Understand...2023-06-08Code