Metric: pairwise accuracy (higher is better)
| # | Model↕ | pairwise accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | ViLBERT 12-in-1 | 72.4 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 2 | LXMERT | 64.4 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 3 | ViLBERT | 61.2 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 4 | CLIP | 56.2 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 5 | GPT1 | 53.1 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 6 | GPT2 | 51.9 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |
| 7 | VisualBERT | 45.7 | No | VALSE: A Task-Independent Benchmark for Vision a... | 2021-12-14 | Code |