TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/Ada-LEval (TSort)

Long-Context Understanding on Ada-LEval (TSort)

Metric: 4k (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕4k▼Extra DataPaperDate↕Code
1GPT-4-Turbo-012516.5NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-110615.5NoGPT-4 Technical Report2023-03-15Code
3Vicuna-13b-v1.5-16k5NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
4LongChat-7b-v1.5-32k5NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
5Claude-25No---
6GPT-3.5-Turbo-11064.5No---
7InternLM2-7b3.9NoInternLM2 Technical Report2024-03-26Code
8ChatGLM3-6b-32k2.4NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
9Vicuna-7b-v1.5-16k2.2NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
10ChatGLM2-6b-32k0.2NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code