Colin
Reported on 5 benchmarks across 1 task
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- 20.24best: 28.81 (B-Ultra)
- 23.7best: 39 (LXR955, No Ensemble)
- 45.53best: 55.4 (LXR955, No Ensemble)
- unanswerable82.02best: 80.48 (DVW)
- 59.85best: 74 (LXR955, No Ensemble)