Mistral-v02-7B-32k
Reported on 3 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing3 results
- AlignScore· 2023-10-100.0827best: 0.1378 (GPT-3.5-Turbo-0613-16k)
- Prometheus-2 Answer Correctness· 2023-10-103.4245best: 3.0408 (GPT-3.5-Turbo-0613-16k)
- Rouge-L· 2023-10-100.1922best: 0.2414 (GPT-3.5-Turbo-0613-16k)