Qwen-vl-plus
Reported on 3 benchmarks across 2 tasks · 2 papers
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Reasoning2 results
- Recall· 2025-04-1020.37best: 39.27 (ChatGPT-4o)
- Recall· 2025-04-1031best: 63.24 (Claude-3-haiku)
Computer Vision1 result
- Total Column Score· uses extra data· 2023-08-24310best: 463 (Claude 3.5 Sonnet)