TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/Ada-LEval (BestAnswer)

Long-Context Understanding on Ada-LEval (BestAnswer)

Metric: 8k (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕8k▼Extra DataPaperDate↕Code
1GPT-4-Turbo-012556.5NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-110653.5NoGPT-4 Technical Report2023-03-15Code
3Claude-217No---
4GPT-3.5-Turbo-110617No---
5InternLM2-7b13.4NoInternLM2 Technical Report2024-03-26Code
6ChatGLM3-6b-32k3.4NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
7Vicuna-13b-v1.5-16k2.2NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
8LongChat-7b-v1.5-32k1.9NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
9Vicuna-7b-v1.5-16k1.8NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
10ChatGLM2-6b-32k1.6NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code