TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering/ViP-Bench

Visual Question Answering on ViP-Bench

Metric: GPT-4 score (human) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕GPT-4 score (human)▼Extra DataPaperDate↕Code
1GPT-4V-turbo-detail:high (Visual Prompt)59.9NoGPT-4 Technical Report2023-03-15Code
2GPT-4V-turbo-detail:low (Visual Prompt)51.4NoGPT-4 Technical Report2023-03-15Code
3LLaVA-NeXT-Inst-IT-Qwen2-7B (Visual Prompt49YesInst-IT: Boosting Multimodal Instance Understand...2024-12-04Code
4ViP-LLaVA-13B (Visual Prompt)48.2NoMaking Large Language Models Better Data Creators2023-10-31Code
5LLaVA-NeXT-Inst-IT-Vicuna-7B (Visual Prompt48.2YesInst-IT: Boosting Multimodal Instance Understand...2024-12-04Code
6LLaVA-1.5-13B (Visual Prompt)42.9NoImproved Baselines with Visual Instruction Tuning2023-10-05Code
7Qwen-VL-Chat (Visual Prompt)41.7NoQwen-VL: A Versatile Vision-Language Model for U...2023-08-24Code
8InstructBLIP-13B (Visual Prompt)35.2NoInstructBLIP: Towards General-purpose Vision-Lan...2023-05-11Code