Long-Context Understanding on MMNeedle

Metric: 10 Images, 8*8 Stitching, Exact Accuracy (higher is better)

LeaderboardDataset
Loading chart...