Chain-of-Skills
Reported on 6 benchmarks across 1 task · 1 paper · 2 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- JOINT-EM· 2023-05-04SOTA0.457best: 0.505 (Beam Retrieval)
- SUP-EM· 2023-05-04SOTA0.613best: 0.663 (Beam Retrieval)
- ANS-EM· 2023-05-040.674best: 0.727 (Beam Retrieval)
- ANS-F1· 2023-05-040.801best: 0.85 (Beam Retrieval)
- JOINT-F1· 2023-05-040.717best: 0.775 (Beam Retrieval)
- SUP-F1· 2023-05-040.853best: 0.901 (Beam Retrieval)