TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Vidu 2.0

Vidu 2.0

Reported on 28 benchmarks across 4 tasks · 1 paper

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision14 results

  • VideoonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.4147
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.3511
    best: 0.5509 (Wan2.1-VACE-14B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.6757
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    Motion· 2024-11-26
    0.1352
    best: 0.416 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7144
    best: 0.7906 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4355
    best: 0.4592 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    Total Score· 2024-11-26
    0.4759
    best: 0.5446 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.4147
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.3511
    best: 0.5509 (Wan2.1-VACE-14B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.6757
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Motion· 2024-11-26
    0.1352
    best: 0.416 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7144
    best: 0.7906 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4355
    best: 0.4592 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Total Score· 2024-11-26
    0.4759
    best: 0.5446 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440

Natural Language Processing7 results

  • Video GenerationonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.4147
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.3511
    best: 0.5509 (Wan2.1-VACE-14B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.6757
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    Motion· 2024-11-26
    0.1352
    best: 0.416 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7144
    best: 0.7906 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4355
    best: 0.4592 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    Total Score· 2024-11-26
    0.4759
    best: 0.5446 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440

Audio7 results

  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.4147
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    FaceSim· 2024-11-26
    0.3511
    best: 0.5509 (Wan2.1-VACE-14B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    GmeScore· 2024-11-26
    0.6757
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Motion· 2024-11-26
    0.1352
    best: 0.416 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7144
    best: 0.7906 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4355
    best: 0.4592 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Total Score· 2024-11-26
    0.4759
    best: 0.5446 (Kling 1.6)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440