TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/Ada-LEval (BestAnswer)

Long-Context Understanding on Ada-LEval (BestAnswer)

Metric: 1k (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕1k▼Extra DataPaperDate↕Code
1GPT-4-Turbo-110674NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-012573.5NoGPT-4 Technical Report2023-03-15Code
3Claude-265No---
4GPT-3.5-Turbo-110661.5No---
5InternLM2-7b58.6NoInternLM2 Technical Report2024-03-26Code
6Vicuna-13b-v1.5-16k53.4NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
7ChatGLM3-6b-32k39.8NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
8Vicuna-7b-v1.5-16k37NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
9LongChat-7b-v1.5-32k32.4NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
10ChatGLM2-6b-32k31.2NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code