TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Temporal Relation Extraction/Vinoground

Temporal Relation Extraction on Vinoground

Metric: Group Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Group Score▼Extra DataPaperDate↕Code
1GPT-4o (CoT)35No---
2GPT-4o24.6No---
3LLaVA-OneVision-Qwen2-72B21.8NoLLaVA-OneVision: Easy Visual Task Transfer2024-08-06Code
4Qwen2-VL-72B17.4NoQwen2-VL: Enhancing Vision-Language Model's Perc...2024-09-18Code
5Qwen2-VL-7B15.2NoQwen2-VL: Enhancing Vision-Language Model's Perc...2024-09-18Code
6LLaVA-OneVision-Qwen2-7B14.6NoLLaVA-OneVision: Easy Visual Task Transfer2024-08-06Code
7Gemini-1.5-Pro (CoT)12.4NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
8MiniCPM-2.611.2NoMiniCPM-V: A GPT-4V Level MLLM on Your Phone2024-08-03Code
9Claude 3.5 Sonnet10.6No---
10Gemini-1.5-Pro10.2NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
11InternLM-XC-2.59.6NoInternLM-XComposer-2.5: A Versatile Large Vision...2024-07-03Code
12InternLM-XC-2.5 (CoT)9NoInternLM-XComposer-2.5: A Versatile Large Vision...2024-07-03Code
13VideoLLaMA2-72B8.4NoVideoLLaMA 2: Advancing Spatial-Temporal Modelin...2024-06-11Code
14MA-LMM-Vicuna-7B6.8NoMA-LMM: Memory-Augmented Large Multimodal Model ...2024-04-08Code
15LLaVA-NeXT-Video-7B (CoT)6.8No---
16Video-LLaVA-7B6.6NoVideo-LLaVA: Learning United Visual Representati...2023-11-16Code
17Phi-3.5-Vision6.2No---
18LLaVA-NeXT-Video-7B6.2No---
19LLaVA-NeXT-Video-34B (CoT)5.2No---
20VTimeLLM5.2NoVTimeLLM: Empower LLM to Grasp Video Moments2023-11-30Code
21LLaVA-NeXT-Video-34B3.8No---
22VideoCLIP1.2NoVideoCLIP: Contrastive Pre-training for Zero-sho...2021-09-28Code
23LanguageBind1.2NoLanguageBind: Extending Video-Language Pretraini...2023-10-03Code
24ImageBind0.6NoImageBind: One Embedding Space To Bind Them All2023-05-09Code