Metric: EM (higher is better)
| # | Model↕ | EM▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Cluster-Former (#C=512) | 54 | No | Cluster-Former: Clustering-based Sparse Transfor... | 2020-09-13 | - |
| 2 | Locality-Sensitive Hashing | 53.2 | No | Reformer: The Efficient Transformer | 2020-01-13 | Code |
| 3 | Sparse Attention | 52.1 | No | Generating Long Sequences with Sparse Transformers | 2019-04-23 | Code |
| 4 | Multi-passage BERT | 51.1 | No | Multi-passage BERT: A Globally Normalized BERT M... | 2019-08-22 | - |
| 5 | Denoising QA | 42.2 | No | - | - | Code |
| 6 | DECAPROP | 38.6 | No | Densely Connected Attention Propagation for Read... | 2018-11-10 | Code |
| 7 | DrQA | 37.7 | No | Reading Wikipedia to Answer Open-Domain Questions | 2017-03-31 | Code |