TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Computer Vision/VCGBench-Diverse/VideoInstruct

VCGBench-Diverse on VideoInstruct

Metric: Contextual Understanding (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Contextual Understanding▼Extra DataPaperDate↕Code
1VideoGPT+2.81NoVideoGPT+: Integrating Image and Video Encoders ...2024-06-13Code
2Chat-UniVi2.66NoChat-UniVi: Unified Visual Representation Empowe...2023-11-14Code
3BT-Adapter2.59NoBT-Adapter: Video Conversation is Feasible Witho...2023-09-27Code
4VideoChat22.51NoMVBench: A Comprehensive Multi-modal Video Under...2023-11-28Code
5VTimeLLM2.48NoVTimeLLM: Empower LLM to Grasp Video Moments2023-11-30Code
6Video-ChatGPT2.46NoVideo-ChatGPT: Towards Detailed Video Understand...2023-06-08Code