Metric: F1 (higher is better)
| # | Model↕ | F1▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | PaCE | 63.8 | No | PaCE: Unified Multi-modal Dialogue Pre-training ... | 2023-05-24 | Code |
| 2 | T5-3B | 58.9 | No | Exploring the Limits of Transfer Learning with a... | 2019-10-23 | Code |
| 3 | T5-base | 58.1 | No | Exploring the Limits of Transfer Learning with a... | 2019-10-23 | Code |
| 4 | BERT | 53.2 | No | BERT: Pre-training of Deep Bidirectional Transfo... | 2018-10-11 | Code |
| 5 | ViLT | 52.4 | No | ViLT: Vision-and-Language Transformer Without Co... | 2021-02-05 | Code |
| 6 | ALBERT-base | 52.2 | No | ALBERT: A Lite BERT for Self-supervised Learning... | 2019-09-26 | Code |