TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering (VQA)/SME

Visual Question Answering (VQA) on SME

Metric: METEOR (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕METEOR▼Extra DataPaperDate↕Code
1MEAgent50.55No--Code
2GPT-4-1106-Vision-Preview35.17NoGPT-4 Technical Report2023-03-15Code
3Gemini-1.5 Pro34.61NoGemini 1.5: Unlocking multimodal understanding a...2024-03-08Code
4Qwen-VL-Max23.4NoQwen-VL: A Versatile Vision-Language Model for U...2023-08-24Code
5VCIN19.82No--Code
6GLM-4V17.53NoCogVLM: Visual Expert for Pretrained Language Mo...2023-11-06Code
7REX4.37NoREX: Reasoning-aware and Grounded Explanation2022-03-11Code