Long-Context Understanding on MMNeedle

Metric: 1 Image, 2*2 Stitching, Exact Accuracy (higher is better)

LeaderboardDataset
Loading chart...