Long-Context Understanding on Ada-LEval (TSort)

Metric: 128k (higher is better)

LeaderboardDataset
Loading chart...
#Model128kExtra DataPaperDateCode
1GPT-4-Turbo-11066NoGPT-4 Technical Report2023-03-15Code
2GPT-4-Turbo-01252NoGPT-4 Technical Report2023-03-15Code