Long-Context Understanding on LongBench

Metric: Average Score (higher is better)

LeaderboardDataset
Loading chart...
#ModelAverage ScoreExtra DataPaperDateCode
1GALI(Llama3-8b-ins-4k-to-16k)46.22NoA Training-Free Length Extrapolation Approach fo...2025-02-04Code
2GALI(Llama3-8b-ins-8k-to-32k)45.38NoA Training-Free Length Extrapolation Approach fo...2025-02-04Code
3GALI(Llama3-8b-ins-8k-to-16k)45.17NoA Training-Free Length Extrapolation Approach fo...2025-02-04Code