Transformer + Pre-train with Pseudo Data
Reported on 2 benchmarks across 1 task · 1 paper · 1 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing2 results
- F0.5· 2019-09-02SOTA65best: 72.8 (Ensembles of best 7 models + GRECO + GTP-rerank)
- F0.5· 2019-09-0270.2best: 81.4 (Majority-voting ensemble on best 7 models)