OpenAI/o3-2025-01-31-high
Reported on 2 benchmarks across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- EM· uses extra data· 2025-01-30SOTA92.52
- F1· uses extra data· 2025-01-3093.13best: 94.01 (Riple/Saanvi-v0.5-DeepAnalysis)