DiffSynth-Studio/examples/wanvideo at 8badd63a2d752a03e0690af372e905b7af672cbe - DiffSynth-Studio - pplokijuhyg个人git站点

theluyuan/DiffSynth-Studio

mirror of https://github.com/modelscope/DiffSynth-Studio.git synced 2026-04-08 17:18:21 +00:00

Files

History

CD22104 b1afff1728 camera

2025-06-11 17:24:09 +08:00

..

model_inference

camera

2025-06-11 17:24:09 +08:00

camera

2025-06-11 17:24:09 +08:00

README_zh.md

refine wan doc

2025-06-06 15:19:09 +08:00

README.md

new wan trainer

2025-06-06 14:58:41 +08:00

wan_1.3b_text_to_video_accelerate.py

support teacache in wan

2025-03-14 17:45:52 +08:00

wan_14b_text_to_video_usp.py

fix usp dependency

2025-03-25 19:26:24 +08:00

README.md

dataset
- --dataset_base_path: Base path of the Dataset.
- --dataset_metadata_path: Metadata path of the Dataset.
- --height: Image or video height. Leave height and width None to enable dynamic resolution.
- --width: Image or video width. Leave height and width None to enable dynamic resolution.
- --num_frames: Number of frames in each video. The frames are sampled from the prefix.
- --data_file_keys: Data file keys in metadata. Separated by commas.
- --dataset_repeat: Number of times the dataset is repeated in each epoch.
Model
- --model_paths: Model paths to be loaded. JSON format.
- --model_id_with_origin_paths: Model ID with original path, e.g., Wan-AI/Wan2.1-T2V-1.3B:diffusion_pytorch_model*.safetensors. Separated by commas.
Training
- --learning_rate: Learning rate.
- --num_epochs: Number of epochs.
- --output_path: Save path.
- --remove_prefix_in_ckpt: Remove prefix in ckpt.
Trainable module
- --trainable_models: Trainable models, e.g., dit, vae, text_encoder.
- --lora_base_model: Add LoRA on which model.
- --lora_target_modules: Add LoRA on which layer.
- --lora_rank: LoRA rank.
Extra model input
- --input_contains_input_image: Model input contains input_image
- --input_contains_end_image: Model input contains end_image.
- --input_contains_control_video: Model input contains control_video.
- --input_contains_reference_image: Model input contains reference_image.
- --input_contains_vace_video: Model input contains vace_video.
- --input_contains_vace_reference_image: Model input contains vace_reference_image.
- --input_contains_motion_bucket_id: Model input contains motion_bucket_id.