Claude 3.5 Sonnet

Reported on 8 benchmarks across 4 tasks

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing6 results

Knowledge Base1 result

Computer Vision1 result