TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Visual Question Answering (VQA)/TextVQA test-standard

Visual Question Answering (VQA) on TextVQA test-standard

Metric: overall (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕overall▼Extra DataPaperDate↕Code
1PaLI73.1NoPaLI: A Jointly-Scaled Multilingual Language-Ima...2022-09-14Code
2TAP53.97No---
3TAG53.69NoTAG: Boosting Text-VQA via Text-aware Visual Que...2022-08-03Code
4PromptCap51.8NoPromptCap: Prompt-Guided Task-Aware Image Captio...2022-11-15Code
5ssbaseline45.66No---
6SMA single model45.51No---
7SAM (Single Model)44.8No---
8colab_buaa44.73No---
9CRN (Single Model)40.96No---
10CIG40.77No---
11M4C40.46Yes---
12Shuai39.95No---
13mmgnn32.46No---