FrozenBiLM (0-shot)
Reported on 4 benchmarks across 1 task · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Reasoning4 results
- Accuracy· 2022-06-1625.9best: 61.6 (Tarsier (34B))
- Accuracy· 2022-06-1616.7best: 72.4 (Flash-VStream)
- Accuracy· 2022-06-1626.8best: 40.2 (Text + Text (no Multimodal Pretext Training))
- Accuracy· 2022-06-1658.4best: 93.2 (Text + Text (no Multimodal Pretext Training))