6.4 KiB
Image Synthesis
Image synthesis is the base feature of DiffSynth Studio. We can generate images with very high resolution.
Example: FLUX
Example script: flux_text_to_image.py
The original version of FLUX doesn't support classifier-free guidance; however, we believe that this guidance mechanism is an important feature for synthesizing beautiful images. You can enable it using the parameter cfg_scale, and the extra guidance scale introduced by FLUX is embedded_guidance.
| 1024*1024 (original) | 1024*1024 (classifier-free guidance) | 2048*2048 (highres-fix) |
|---|---|---|
Example: Stable Diffusion
Example script: sd_text_to_image.py
LoRA Training: ../train/stable_diffusion/
| 512*512 | 1024*1024 | 2048*2048 | 4096*4096 |
|---|---|---|---|
Example: Stable Diffusion XL
Example script: sdxl_text_to_image.py
LoRA Training: ../train/stable_diffusion_xl/
| 1024*1024 | 2048*2048 |
|---|---|
Example: Stable Diffusion 3
Example script: sd3_text_to_image.py
LoRA Training: ../train/stable_diffusion_3/
| 1024*1024 | 2048*2048 |
|---|---|
Example: Kolors
Example script: kolors_text_to_image.py
LoRA Training: ../train/kolors/
| 1024*1024 | 2048*2048 |
|---|---|
Kolors also support the models trained for SD-XL. For example, ControlNets and LoRAs. See kolors_with_sdxl_models.py
LoRA: https://civitai.com/models/73305/zyd232s-ink-style
| Base model | with LoRA (alpha=0.5) | with LoRA (alpha=1.0) | with LoRA (alpha=1.5) |
|---|---|---|---|
ControlNet: https://huggingface.co/xinsir/controlnet-union-sdxl-1.0
| Reference image | Depth image | with ControlNet | with ControlNet |
|---|---|---|---|
Example: Hunyuan-DiT
Example script: hunyuan_dit_text_to_image.py
LoRA Training: ../train/hunyuan_dit/
| 1024*1024 | 2048*2048 |
|---|---|
Example: Stable Diffusion XL Turbo
Example script: sdxl_turbo.py
We highly recommend you to use this model in the WebUI.
| "black car" | "red car" |
|---|---|
Example: Prompt Processing
If you are not native English user, we provide translation service for you. Our prompter can translate other language to English and refine it using "BeautifulPrompt" models. Please see sd_prompt_refining.py for more details.
Prompt: "一个漂亮的女孩". The translation model will translate it to English.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|
Prompt: "一个漂亮的女孩". The translation model will translate it to English. Then the refining model will refine the translated prompt for better visual quality.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|