TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/Ada-LEval (BestAnswer)

Long-Context Understanding on Ada-LEval (BestAnswer)

Metric: 2k (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕2k▼Extra DataPaperDate↕Code
1GPT-4-Turbo-110673.5NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-012573.5NoGPT-4 Technical Report2023-03-15Code
3InternLM2-7b49.5NoInternLM2 Technical Report2024-03-26Code
4GPT-3.5-Turbo-110648.5No---
5Claude-243.5No---
6Vicuna-13b-v1.5-16k29.2NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
7ChatGLM3-6b-32k18.8NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
8Vicuna-7b-v1.5-16k11.1NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
9ChatGLM2-6b-32k10.9NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
10LongChat-7b-v1.5-32k10.7NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code