TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Long-Context Understanding/Ada-LEval (TSort)

Long-Context Understanding on Ada-LEval (TSort)

Metric: 2k (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕2k▼Extra DataPaperDate↕Code
1GPT-4-Turbo-110618.5NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-012515.5NoGPT-4 Technical Report2023-03-15Code
3Vicuna-13b-v1.5-16k5.4NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
4LongChat-7b-v1.5-32k5.3NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
5Vicuna-7b-v1.5-16k5.3NoJudging LLM-as-a-Judge with MT-Bench and Chatbot...2023-06-09Code
6InternLM2-7b5.1NoInternLM2 Technical Report2024-03-26Code
7Claude-25No---
8GPT-3.5-Turbo-11064No---
9ChatGLM3-6b-32k2.3NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code
10ChatGLM2-6b-32k0.9NoGLM-130B: An Open Bilingual Pre-trained Model2022-10-05Code