Metric: TextScenesHQ OCR (Accuracy) (higher is better)
| # | Model↕ | TextScenesHQ OCR (Accuracy)▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Dalle3 | 69.26 | No | - | - | - |
| 2 | Grok3 | 35.07 | No | - | - | - |
| 3 | SD3.5 Large | 19.03 | No | - | - | - |
| 4 | Infinity-2B | 1.06 | No | Infinity-MM: Scaling Multimodal Performance with... | 2024-10-24 | Code |
| 5 | TextDiffuser2 | 0.66 | No | TextDiffuser-2: Unleashing the Power of Language... | 2023-11-28 | - |
| 6 | Anytext | 0.42 | No | AnyText: Multilingual Visual Text Generation And... | 2023-11-06 | Code |
| 7 | PixArt-Sigma | 0.34 | No | PixArt-Σ: Weak-to-Strong Training of Diffusion T... | 2024-03-07 | Code |