Phi-3.5-Vision
Reported on 6 benchmarks across 2 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing6 results
- Group Score6.2best: 35 (GPT-4o (CoT))
- Text Score24best: 59.2 (GPT-4o (CoT))
- Video Score22.4best: 51 (GPT-4o (CoT))
- Group Score6.2best: 35 (GPT-4o (CoT))
- Text Score24best: 59.2 (GPT-4o (CoT))
- Video Score22.4best: 51 (GPT-4o (CoT))