Files
DiffSynth-Studio/examples/image_synthesis
2024-08-17 17:51:31 +08:00
..
2024-08-17 17:51:31 +08:00
2024-07-10 10:07:02 +08:00
2024-08-02 14:46:29 +08:00
2024-08-17 17:51:31 +08:00
2024-08-17 17:51:31 +08:00
2024-07-10 13:41:11 +08:00
2024-07-26 12:15:40 +08:00
2024-06-27 19:43:50 +08:00

Image Synthesis

Image synthesis is the base feature of DiffSynth Studio. We can generate images with very high resolution.

Example: Stable Diffusion

Example script: sd_text_to_image.py

512*512 1024*1024 2048*2048 4096*4096
512 1024 2048 4096

Example: Stable Diffusion XL

Example script: sdxl_text_to_image.py

1024*1024 2048*2048
1024 2048

Example: Stable Diffusion 3

Example script: sd3_text_to_image.py

LoRA Training: ../train/stable_diffusion_3/

1024*1024 2048*2048
image_1024 image_2048

Example: Kolors

Example script: kolors_text_to_image.py

LoRA Training: ../train/kolors/

1024*1024 2048*2048
image_1024 image_2048

Kolors also support the models trained for SD-XL. For example, ControlNets and LoRAs. See kolors_with_sdxl_models.py

LoRA: https://civitai.com/models/73305/zyd232s-ink-style

Base model with LoRA (alpha=0.5) with LoRA (alpha=1.0) with LoRA (alpha=1.5)
image_0 0 image_0 5 image_1 0 image_1 5

ControlNet: https://huggingface.co/xinsir/controlnet-union-sdxl-1.0

Reference image Depth image with ControlNet with ControlNet
image_0 0 controlnet_input image_depth_1 image_depth_2

Example: Hunyuan-DiT

Example script: hunyuan_dit_text_to_image.py

LoRA Training: ../train/hunyuan_dit/

1024*1024 2048*2048
image_1024 image_2048

Example: Stable Diffusion XL Turbo

Example script: sdxl_turbo.py

We highly recommend you to use this model in the WebUI.

"black car" "red car"
black_car black_car_to_red_car

Example: Prompt Processing

If you are not native English user, we provide translation service for you. Our prompter can translate other language to English and refine it using "BeautifulPrompt" models. Please see sd_prompt_refining.py for more details.

Prompt: "一个漂亮的女孩". The translation model will translate it to English.

seed=0 seed=1 seed=2 seed=3
0_ 1_ 2_ 3_

Prompt: "一个漂亮的女孩". The translation model will translate it to English. Then the refining model will refine the translated prompt for better visual quality.

seed=0 seed=1 seed=2 seed=3
0 1 2 3