Image Synthesis
Image synthesis is the base feature of DiffSynth Studio.
Example: Stable Diffusion
We can generate images with very high resolution. Please see sd_text_to_image.py for more details.
| 512*512 | 1024*1024 | 2048*2048 | 4096*4096 |
|---|---|---|---|
Example: Stable Diffusion XL
Generate images with Stable Diffusion XL. Please see sdxl_text_to_image.py for more details.
| 1024*1024 | 2048*2048 |
|---|---|
Example: Stable Diffusion 3
Generate images with Stable Diffusion 3. High resolution is also supported in this model. See sd3_text_to_image.py.
| 1024*1024 | 2048*2048 |
|---|---|
Example: Stable Diffusion XL Turbo
Generate images with Stable Diffusion XL Turbo. You can see sdxl_turbo.py for more details, but we highly recommend you to use it in the WebUI.
| "black car" | "red car" |
|---|---|
Example: Prompt Processing
If you are not native English user, we provide translation service for you. Our prompter can translate other language to English and refine it using "BeautifulPrompt" models. Please see sd_prompt_refining.py for more details.
Prompt: "一个漂亮的女孩". The translation model will translate it to English.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|
Prompt: "一个漂亮的女孩". The translation model will translate it to English. Then the refining model will refine the translated prompt for better visual quality.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|
Example: Stable Diffusion 3 with Textual Inversions (Experimental)
Since Stable Diffusion 3 utilizes the same text encoder as Stable Diffusion 1.x, it supports the textual inversions designed for Stable Diffusion 1.x. However, we found that the textual inversions may cause unpredictable effects to the model. We can only guarantee that these textual inversions can be loaded into the model. The example script is sd3_text_to_image_textual_inversion.py
Prompt: "a girl, highly detailed, absurd res, perfect image". Without any textual inversions.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|
Prompt: "a girl, highly detailed, absurd res, perfect image". With verybadimagenegative_v1.3 on the negative side.
| seed=0 | seed=1 | seed=2 | seed=3 |
|---|---|---|---|