Metric: AlignScore (higher is better)
| # | Model↕ | AlignScore▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-3.5-Turbo-0613-16k | 0.1378 | No | Language Models are Few-Shot Learners | 2020-05-28 | Code |
| 2 | Command-R-v01-34B | 0.1362 | No | - | - | - |
| 3 | GPT-4o-2024-08-06-128k | 0.1224 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 4 | Llama-3-IT-8B-8k | 0.1098 | No | The Llama 3 Herd of Models | 2024-07-31 | Code |
| 5 | Llama-3-IT-8B-32k | 0.1016 | No | The Llama 3 Herd of Models | 2024-07-31 | Code |
| 6 | Mistral-v02-7B-32k | 0.0827 | No | Mistral 7B | 2023-10-10 | Code |