Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
stepvideo_text_to_video.py		stepvideo_text_to_video.py
stepvideo_text_to_video_low_vram.py		stepvideo_text_to_video_low_vram.py
stepvideo_text_to_video_quantized.py		stepvideo_text_to_video_quantized.py

README.md

Stepvideo

StepVideo is a state-of-the-art (SoTA) text-to-video pre-trained model with 30 billion parameters and the capability to generate videos up to 204 frames.

Model: https://modelscope.cn/models/stepfun-ai/stepvideo-t2v/summary
GitHub: https://github.com/stepfun-ai/Step-Video-T2V
Technical report: https://arxiv.org/abs/2502.10248

Examples

For original BF16 version, please see ./stepvideo_text_to_video.py. 80G VRAM required.

We also support auto-offload, which can reduce the VRAM requirement to 24GB; however, it requires 2x time for inference. Please see ./stepvideo_text_to_video_low_vram.py.

video.mp4

For FP8 quantized version, please see ./stepvideo_text_to_video_quantized.py. 40G VRAM required.

video.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stepvideo

stepvideo

README.md

Stepvideo

Examples

Files

stepvideo

Directory actions

More options

Directory actions

More options

Latest commit

History

stepvideo

Folders and files

parent directory

README.md

Stepvideo

Examples