mirror of
https://github.com/modelscope/DiffSynth-Studio.git
synced 2026-03-20 23:58:12 +00:00
102 lines
6.1 KiB
Markdown
102 lines
6.1 KiB
Markdown
# Image Synthesis
|
|
|
|
Image synthesis is the base feature of DiffSynth Studio. We can generate images with very high resolution.
|
|
|
|
### Example: FLUX
|
|
|
|
Example script: [`flux_text_to_image.py`](./flux_text_to_image.py)
|
|
|
|
|1024*1024 (original)|1024*1024 (classifier-free guidance)|2048*2048 (highres-fix)|
|
|
|-|-|-|
|
|
||||
|
|
|
|
### Example: Stable Diffusion
|
|
|
|
Example script: [`sd_text_to_image.py`](./sd_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion/`](../train/stable_diffusion/)
|
|
|
|
|512*512|1024*1024|2048*2048|4096*4096|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
### Example: Stable Diffusion XL
|
|
|
|
Example script: [`sdxl_text_to_image.py`](./sdxl_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion_xl/`](../train/stable_diffusion_xl/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Stable Diffusion 3
|
|
|
|
Example script: [`sd3_text_to_image.py`](./sd3_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion_3/`](../train/stable_diffusion_3/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Kolors
|
|
|
|
Example script: [`kolors_text_to_image.py`](./kolors_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/kolors/`](../train/kolors/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
Kolors also support the models trained for SD-XL. For example, ControlNets and LoRAs. See [`kolors_with_sdxl_models.py`](./kolors_with_sdxl_models.py)
|
|
|
|
LoRA: https://civitai.com/models/73305/zyd232s-ink-style
|
|
|
|
|Base model|with LoRA (alpha=0.5)|with LoRA (alpha=1.0)|with LoRA (alpha=1.5)|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
ControlNet: https://huggingface.co/xinsir/controlnet-union-sdxl-1.0
|
|
|
|
|Reference image|Depth image|with ControlNet|with ControlNet|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
### Example: Hunyuan-DiT
|
|
|
|
Example script: [`hunyuan_dit_text_to_image.py`](./hunyuan_dit_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/hunyuan_dit/`](../train/hunyuan_dit/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Stable Diffusion XL Turbo
|
|
|
|
Example script: [`sdxl_turbo.py`](./sdxl_turbo.py)
|
|
|
|
We highly recommend you to use this model in the WebUI.
|
|
|
|
|"black car"|"red car"|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Prompt Processing
|
|
|
|
If you are not native English user, we provide translation service for you. Our prompter can translate other language to English and refine it using "BeautifulPrompt" models. Please see [`sd_prompt_refining.py`](./sd_prompt_refining.py) for more details.
|
|
|
|
Prompt: "一个漂亮的女孩". The [translation model](https://huggingface.co/Helsinki-NLP/opus-mt-en-zh) will translate it to English.
|
|
|
|
|seed=0|seed=1|seed=2|seed=3|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
Prompt: "一个漂亮的女孩". The [translation model](https://huggingface.co/Helsinki-NLP/opus-mt-en-zh) will translate it to English. Then the [refining model](https://huggingface.co/alibaba-pai/pai-bloom-1b1-text2prompt-sd) will refine the translated prompt for better visual quality.
|
|
|
|
|seed=0|seed=1|seed=2|seed=3|
|
|
|-|-|-|-|
|
|
|||||
|