TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Reasoning/Natural Language Visual Grounding/ScreenSpot

Natural Language Visual Grounding on ScreenSpot

Metric: Accuracy (%) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy (%)▼Extra DataPaperDate↕Code
1UGround-V1-7B86.34NoNavigating the Digital World as Humans Do: Unive...2024-10-07Code
2Aguvis-7B83NoAguvis: Unified Pure Vision Agents for Autonomou...2024-12-05Code
3OS-Atlas-Base-7B82.47NoOS-ATLAS: A Foundation Action Model for Generali...2024-10-30Code
4Aria-UI81.1NoAria-UI: Visual Grounding for GUI Instructions2024-12-20Code
5Aguvis-G-7B81NoAguvis: Unified Pure Vision Agents for Autonomou...2024-12-05Code
6UGround-V1-2B77.67NoNavigating the Digital World as Humans Do: Unive...2024-10-07Code
7ShowUI75.1NoShowUI: One Vision-Language-Action Model for GUI...2024-11-26Code
8ShowUI-G75NoShowUI: One Vision-Language-Action Model for GUI...2024-11-26Code
9UGround73.3NoNavigating the Digital World as Humans Do: Unive...2024-10-07Code
10OmniParser73NoOmniParser for Pure Vision Based GUI Agent2024-08-01Code
11OS-Atlas-Base-4B68NoOS-ATLAS: A Foundation Action Model for Generali...2024-10-30Code
12SeeClick53.4NoSeeClick: Harnessing GUI Grounding for Advanced ...2024-01-17Code
13CogAgent47.4NoCogAgent: A Visual Language Model for GUI Agents2023-12-14Code
14Qwen2-VL-7B42.1NoQwen2-VL: Enhancing Vision-Language Model's Perc...2024-09-18Code
15Qwen-GUI28.6NoGUICourse: From General Vision Language Models t...2024-06-17Code
16MiniGPT-v25.7NoMiniGPT-v2: large language model as a unified in...2023-10-14Code
17Groma5.2NoGroma: Localized Visual Tokenization for Groundi...2024-04-19Code
18Qwen-VL5.2NoQwen-VL: A Versatile Vision-Language Model for U...2023-08-24Code