TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/MMNeedle

Long-Context Understanding on MMNeedle

Metric: 1 Image, 8*8 Stitching, Exact Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕1 Image, 8*8 Stitching, Exact Accuracy▼Extra DataPaperDate↕Code
1Gemini Pro 1.529.81NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
2GPT-4o19NoGPT-4 Technical Report2023-03-15Code
3GPT-4V7.3NoGPT-4 Technical Report2023-03-15Code
4LLaVA-Llama-33.3NoLLaVA-UHD: an LMM Perceiving Any Aspect Ratio an...2024-03-18Code
5InstructBLIP-Flan-T5-XXL2.2NoInstructBLIP: Towards General-purpose Vision-Lan...2023-05-11Code
6Gemini Pro 1.02.11NoGemini: A Family of Highly Capable Multimodal Mo...2023-12-19Code
7Claude 3 Opus1.6No---
8IDEFICS2-8B0.9NoWhat matters when building vision-language models?2024-05-03-
9mPLUG-Owl-v20.7NomPLUG-Owl2: Revolutionizing Multi-modal Large La...2023-11-07Code
10CogVLM-17B0.3NoCogVLM: Visual Expert for Pretrained Language Mo...2023-11-06Code
11CogVLM2-Llama-30.1NoCogVLM: Visual Expert for Pretrained Language Mo...2023-11-06Code
12InstructBLIP-Vicuna-13B0No--Code