mirror of
https://github.com/modelscope/DiffSynth-Studio.git
synced 2026-03-19 06:39:43 +00:00
96 lines
5.4 KiB
Markdown
96 lines
5.4 KiB
Markdown
# Image Synthesis
|
|
|
|
Image synthesis is the base feature of DiffSynth Studio. We can generate images with very high resolution.
|
|
|
|
### OmniGen
|
|
|
|
OmniGen is a text-image-to-image model, you can synthesize an image according to several given reference images.
|
|
|
|
|Reference image 1|Reference image 2|Synthesized image|
|
|
|-|-|-|
|
|
||||
|
|
|
|
### Example: FLUX
|
|
|
|
Example script: [`flux_text_to_image.py`](./flux_text_to_image.py) and [`flux_text_to_image_low_vram.py`](./flux_text_to_image_low_vram.py)(low VRAM).
|
|
|
|
The original version of FLUX doesn't support classifier-free guidance; however, we believe that this guidance mechanism is an important feature for synthesizing beautiful images. You can enable it using the parameter `cfg_scale`, and the extra guidance scale introduced by FLUX is `embedded_guidance`.
|
|
|
|
|1024*1024 (original)|1024*1024 (classifier-free guidance)|2048*2048 (highres-fix)|
|
|
|-|-|-|
|
|
||||
|
|
|
|
### Example: Stable Diffusion
|
|
|
|
Example script: [`sd_text_to_image.py`](./sd_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion/`](../train/stable_diffusion/)
|
|
|
|
|512*512|1024*1024|2048*2048|4096*4096|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
### Example: Stable Diffusion XL
|
|
|
|
Example script: [`sdxl_text_to_image.py`](./sdxl_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion_xl/`](../train/stable_diffusion_xl/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Stable Diffusion 3
|
|
|
|
Example script: [`sd3_text_to_image.py`](./sd3_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/stable_diffusion_3/`](../train/stable_diffusion_3/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Kolors
|
|
|
|
Example script: [`kolors_text_to_image.py`](./kolors_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/kolors/`](../train/kolors/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
Kolors also support the models trained for SD-XL. For example, ControlNets and LoRAs. See [`kolors_with_sdxl_models.py`](./kolors_with_sdxl_models.py)
|
|
|
|
LoRA: https://civitai.com/models/73305/zyd232s-ink-style
|
|
|
|
|Base model|with LoRA (alpha=0.5)|with LoRA (alpha=1.0)|with LoRA (alpha=1.5)|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
ControlNet: https://huggingface.co/xinsir/controlnet-union-sdxl-1.0
|
|
|
|
|Reference image|Depth image|with ControlNet|with ControlNet|
|
|
|-|-|-|-|
|
|
|||||
|
|
|
|
### Example: Hunyuan-DiT
|
|
|
|
Example script: [`hunyuan_dit_text_to_image.py`](./hunyuan_dit_text_to_image.py)
|
|
|
|
LoRA Training: [`../train/hunyuan_dit/`](../train/hunyuan_dit/)
|
|
|
|
|1024*1024|2048*2048|
|
|
|-|-|
|
|
|||
|
|
|
|
### Example: Stable Diffusion XL Turbo
|
|
|
|
Example script: [`sdxl_turbo.py`](./sdxl_turbo.py)
|
|
|
|
We highly recommend you to use this model in the WebUI.
|
|
|
|
|"black car"|"red car"|
|
|
|-|-|
|
|
|||
|