Codex 5-shot CoT
Reported on 4 benchmarks across 1 task · 1 paper · 4 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing4 results
- Accuracy· 2022-07-17SOTA78.2best: 81.6 (Meditron-70B (CoT + SC))
- Accuracy· 2022-07-17SOTA60.2best: 91.1 (Med-Gemini)
- Dev Set (Acc-%)· 2022-07-17SOTA0.597best: 66 (Meditron-70B (CoT + SC))
- Test Set (Acc-%)· 2022-07-17SOTA0.627best: 0.723 (Med-PaLM 2 (ER))