Long-Context Understanding on MMNeedle

Metric: 10 Images, 2*2 Stitching, Exact Accuracy (higher is better)

LeaderboardDataset
Loading chart...