Metric: Macro F1 (higher is better)
| # | Model↕ | Macro F1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Mistral-IT-v02-7B-32k | 0.4703 | No | Mistral 7B | 2023-10-10 | Code |
| 2 | Command-R-v01-34B-128k | 0.4197 | No | - | - | - |
| 3 | GPT-3.5-Turbo-0613-16k | 0.3304 | No | Language Models are Few-Shot Learners | 2020-05-28 | Code |
| 4 | Llama-3-IT-8B-8k | 0.3112 | No | The Llama 3 Herd of Models | 2024-07-31 | Code |
| 5 | GPT-4o-2024-08-06 | 0.3087 | No | GPT-4 Technical Report | 2023-03-15 | Code |
| 6 | Llama-3-IT-8B-32k | 0.2881 | No | The Llama 3 Herd of Models | 2024-07-31 | Code |