Text + Text (no Multimodal Pretext Training)
Reported on 3 benchmarks across 1 task · 1 paper · 3 SOTA
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Reasoning3 results
- Accuracy· 2022-06-05SOTA41.4best: 61.6 (Tarsier (34B))
- Accuracy· 2022-06-05SOTA40.2
- Accuracy· 2022-06-05SOTA93.2