Metric: TDEX (higher is better)
| # | Model↕ | TDEX▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-4 Turbo | 67 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |
| 2 | Gemini-1.5 Flash | 65.8 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |
| 3 | GPT-3.5 Turbo | 64.1 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |
| 4 | Llama3-8B | 64 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |
| 5 | Llama3-70B | 62.8 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |
| 6 | SQLCoder-8B | 30.7 | No | Evaluating and Enhancing LLMs for Multi-turn Tex... | 2024-12-21 | Code |