TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/SiD2

SiD2

Reported on 9 benchmarks across 3 tasks · 1 paper · 2 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Medical3 results

  • Image GenerationonImageNet 128x128
    FID· 2024-10-25
    1.26
    SOTA
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • Image GenerationonImageNet 256x256
    FID· 2024-10-25
    1.38
    best: 1.06 (SiT-XL/2 + UCGM-S (E2E-VAE + 40 sampling steps + CFG))
    SOTA
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • Image GenerationonImageNet 512x512
    FID· 2024-10-25
    1.48
    best: 1.21 (EDM2-L + DDO (SD-VAE, 25 steps, DPM-Solver-v3))
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324

Computer Vision3 results

  • VideoonKinetics-600 12 frames, 64x64
    Cond· 2024-10-25
    5
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • VideoonKinetics-600 12 frames, 64x64
    FVD· 2024-10-25
    2.3
    best: 224.73 (LVT)
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • VideoonKinetics-600 12 frames, 64x64
    Pred· 2024-10-25
    11
    best: 12 (Video VQ-VAE FVD)
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324

Time Series3 results

  • Video PredictiononKinetics-600 12 frames, 64x64
    Cond· 2024-10-25
    5
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • Video PredictiononKinetics-600 12 frames, 64x64
    FVD· 2024-10-25
    2.3
    best: 224.73 (LVT)
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324
  • Video PredictiononKinetics-600 12 frames, 64x64
    Pred· 2024-10-25
    11
    best: 12 (Video VQ-VAE FVD)
    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusionarXiv:2410.19324