OpenAI/o3-mini
Reported on 3 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Knowledge Base3 results
- ROUGE-160.12
- ROUGE-254.22
- ROUGE-L57.21best: 60.29 (Riple/Saanvi-v0.1)