BERT (Devlin et al., 2019)-Base
Reported on 2 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- Dev Set (Acc-%)· 2022-03-270.35best: 66 (Meditron-70B (CoT + SC))
- Test Set (Acc-%)· 2022-03-270.33best: 0.723 (Med-PaLM 2 (ER))