DropletVideo-10M

VideosCC BY-SA 4.0Introduced 2025-03-08

DropletVideo is a project exploring high-order spatio-temporal consistency in image-to-video generation. It is trained on DropletVideo-10M. The model supports multi-resolution inputs, dynamic FPS control for motion intensity, and demonstrates potential for 3D consistency. The model supports multi-resolution inputs, dynamic FPS control for motion intensity, and demonstrates potential for 3D consistency. For further details, you can check our project page as well as the technical report.

Features:

  • Multi-resolution inputs, accommodating pixel values from 512x512x85(default 672x384x85) to 896x896x85(default 1120x640x85, and videos with different aspect ratios.

  • Dynamic FPS control for motion intensity.