TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/PixArt-a

PixArt-a

Reported on 24 benchmarks across 4 tasks · 1 paper · 24 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Audio12 results

  • 10-shot image generationonT2I-CompBench
    Color· 2023-09-30
    0.6886
    best: 0.7913 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 10-shot image generationonT2I-CompBench
    Complex· 2023-09-30
    0.4117
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 10-shot image generationonT2I-CompBench
    Non-Spatial· 2023-09-30
    0.3179
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 10-shot image generationonT2I-CompBench
    Shape· 2023-09-30
    0.5582
    best: 0.5846 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 10-shot image generationonT2I-CompBench
    Spatial· 2023-09-30
    0.2082
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 10-shot image generationonT2I-CompBench
    Texture· 2023-09-30
    0.7044
    best: 0.7422 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Color· 2023-09-30
    0.6886
    best: 0.7913 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Complex· 2023-09-30
    0.4117
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Non-Spatial· 2023-09-30
    0.3179
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Shape· 2023-09-30
    0.5582
    best: 0.5846 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Spatial· 2023-09-30
    0.2082
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • 1 Image, 2*2 StitchionT2I-CompBench
    Texture· 2023-09-30
    0.7044
    best: 0.7422 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426

Medical6 results

  • Image GenerationonT2I-CompBench
    Color· 2023-09-30
    0.6886
    best: 0.7913 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Image GenerationonT2I-CompBench
    Complex· 2023-09-30
    0.4117
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Image GenerationonT2I-CompBench
    Non-Spatial· 2023-09-30
    0.3179
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Image GenerationonT2I-CompBench
    Shape· 2023-09-30
    0.5582
    best: 0.5846 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Image GenerationonT2I-CompBench
    Spatial· 2023-09-30
    0.2082
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Image GenerationonT2I-CompBench
    Texture· 2023-09-30
    0.7044
    best: 0.7422 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426

Natural Language Processing6 results

  • Text-to-Image GenerationonT2I-CompBench
    Color· 2023-09-30
    0.6886
    best: 0.7913 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Text-to-Image GenerationonT2I-CompBench
    Complex· 2023-09-30
    0.4117
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Text-to-Image GenerationonT2I-CompBench
    Non-Spatial· 2023-09-30
    0.3179
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Text-to-Image GenerationonT2I-CompBench
    Shape· 2023-09-30
    0.5582
    best: 0.5846 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Text-to-Image GenerationonT2I-CompBench
    Spatial· 2023-09-30
    0.2082
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426
  • Text-to-Image GenerationonT2I-CompBench
    Texture· 2023-09-30
    0.7044
    best: 0.7422 (Emu3)
    SOTA
    PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image SynthesisarXiv:2310.00426