TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Temporal Relation Extraction/Vinoground

Temporal Relation Extraction on Vinoground

Metric: Text Score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Text Score▼Extra DataPaperDate↕Code
1GPT-4o (CoT)59.2No---
2GPT-4o54No---
3Qwen2-VL-72B50.4NoQwen2-VL: Enhancing Vision-Language Model's Perc...2024-09-18Code
4LLaVA-OneVision-Qwen2-72B48.4NoLLaVA-OneVision: Easy Visual Task Transfer2024-08-06Code
5LLaVA-OneVision-Qwen2-7B41.6NoLLaVA-OneVision: Easy Visual Task Transfer2024-08-06Code
6Qwen2-VL-7B40.2NoQwen2-VL: Enhancing Vision-Language Model's Perc...2024-09-18Code
7Gemini-1.5-Pro (CoT)37NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
8VideoLLaMA2-72B36.2NoVideoLLaMA 2: Advancing Spatial-Temporal Modelin...2024-06-11Code
9Gemini-1.5-Pro35.8NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
10Claude 3.5 Sonnet32.8No---
11MiniCPM-2.632.6NoMiniCPM-V: A GPT-4V Level MLLM on Your Phone2024-08-03Code
12InternLM-XC-2.5 (CoT)30.8NoInternLM-XComposer-2.5: A Versatile Large Vision...2024-07-03Code
13InternLM-XC-2.528.8NoInternLM-XComposer-2.5: A Versatile Large Vision...2024-07-03Code
14LLaVA-NeXT-Video-34B (CoT)25.8No---
15Video-LLaVA-7B24.8NoVideo-LLaVA: Learning United Visual Representati...2023-11-16Code
16Phi-3.5-Vision24No---
17MA-LMM-Vicuna-7B23.8NoMA-LMM: Memory-Augmented Large Multimodal Model ...2024-04-08Code
18LLaVA-NeXT-Video-34B23No---
19LLaVA-NeXT-Video-7B (CoT)21.8No---
20LLaVA-NeXT-Video-7B21.8No---
21VTimeLLM19.4NoVTimeLLM: Empower LLM to Grasp Video Moments2023-11-30Code
22VideoCLIP17NoVideoCLIP: Contrastive Pre-training for Zero-sho...2021-09-28Code
23LanguageBind10.6NoLanguageBind: Extending Video-Language Pretraini...2023-10-03Code
24ImageBind9.4NoImageBind: One Embedding Space To Bind Them All2023-05-09Code