Long-Context Understanding on L-Eval

Metric: Average Score (higher is better)

LeaderboardDataset

Loading chart...

Results

Submit a result

Sort:

#	Model↕	Average Score▼	Extra Data	Paper	Date↕	Code
1	GALI(Llama3-8b-ins-4k-to-16k)	59.21	No	A Training-Free Length Extrapolation Approach fo...	2025-02-04	Code
2	GALI(Llama3-8b-ins-4k-to-32k)	59.1	No	A Training-Free Length Extrapolation Approach fo...	2025-02-04	Code
3	GALI(Llama3-8b-ins-8k-to-32k)	42.79	No	A Training-Free Length Extrapolation Approach fo...	2025-02-04	Code
4	GALI(Llama3-8b-ins-8k-to-16k)	42.32	No	A Training-Free Length Extrapolation Approach fo...	2025-02-04	Code

#1GALI(Llama3-8b-ins-4k-to-16k)SOTA
59.21
Average Score· 2025-02-04
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)Code
#2GALI(Llama3-8b-ins-4k-to-32k)
59.1
Average Score· 2025-02-04
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)Code
#3GALI(Llama3-8b-ins-8k-to-32k)
42.79
Average Score· 2025-02-04
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)Code
#4GALI(Llama3-8b-ins-8k-to-16k)
42.32
Average Score· 2025-02-04
A Training-Free Length Extrapolation Approach for LLMs: Greedy Attention Logit Interpolation (GALI)Code