TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Kling 1.6

Kling 1.6

Reported on 28 benchmarks across 4 tasks · 1 paper · 20 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Computer Vision14 results

  • VideoonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.401
    best: 0.5509 (Wan2.1-VACE-14B)
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    Motion· 2024-11-26
    0.416
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7906
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4592
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    Total Score· 2024-11-26
    0.5446
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.401
    best: 0.5509 (Wan2.1-VACE-14B)
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Motion· 2024-11-26
    0.416
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7906
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4592
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Total Score· 2024-11-26
    0.5446
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.446
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • VideoonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.662
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.446
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Image to Video GenerationonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.662
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440

Natural Language Processing7 results

  • Video GenerationonOpenS2V-Eval
    FaceSim· 2024-11-26
    0.401
    best: 0.5509 (Wan2.1-VACE-14B)
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    Motion· 2024-11-26
    0.416
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7906
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4592
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    Total Score· 2024-11-26
    0.5446
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.446
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • Video GenerationonOpenS2V-Eval
    GmeScore· 2024-11-26
    0.662
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440

Audio7 results

  • 1 Image, 2*2 StitchionOpenS2V-Eval
    FaceSim· 2024-11-26
    0.401
    best: 0.5509 (Wan2.1-VACE-14B)
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Motion· 2024-11-26
    0.416
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    NaturalScore· 2024-11-26
    0.7906
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    NexusScore· 2024-11-26
    0.4592
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Total Score· 2024-11-26
    0.5446
    SOTA
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    Aesthetics· 2024-11-26
    0.446
    best: 0.4824 (Wan2.1-VACE-1.3B)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440
  • 1 Image, 2*2 StitchionOpenS2V-Eval
    GmeScore· 2024-11-26
    0.662
    best: 0.7138 (Wan2.1-VACE-1.3B-Preview)
    Identity-Preserving Text-to-Video Generation by Frequency DecompositionarXiv:2411.17440