Stable Diffusion WebUI: 159K+ Stars — The Complete Setup Guide 2026
Stable Diffusion WebUI (AUTOMATIC1111) is the most popular web interface for local AI image generation. Compatible with ControlNet, LoRA, ComfyUI workflows. Covers Windows, Linux, Docker install, extension setup, production hardening, and GPU benchmarks.
- ⭐ 159000
- AGPL-3.0
- Updated 2026-05-19
{{< resource-info >}}
Stable Diffusion WebUI by AUTOMATIC1111 remains the most widely adopted open-source interface for local AI image generation. With 159,000+ GitHub stars, it has accumulated a larger community than any competing interface — ComfyUI, InvokeAI, and Fooocus combined. If you are building a local AI image pipeline, understanding how to install, configure, and extend this tool is a practical necessity, not an option.
This stable diffusion webui tutorial walks through the complete automatic1111 setup across Windows, Linux, and sd webui docker deployment. You will find real commands for each platform, extension configuration for ControlNet and LoRA, production hardening tips, and performance benchmarks on consumer GPUs. Whether you need a quick stable diffusion setup for personal projects or a production image generation tutorial for your team, this guide covers every step with zero theory — only a working machine with an NVIDIA GPU.

What Is Stable Diffusion WebUI? #
Stable Diffusion WebUI is a browser-based interface for running Stable Diffusion models locally. It wraps the underlying inference pipeline in a tabbed web application accessible at http://localhost:7860, providing controls for txt2img, img2img, inpainting, upscaling, model merging, and training LoRA/DreamBooth — all without writing code.
The project is maintained by AUTOMATIC1111 under the AGPL-3.0 license. Version 1.10.1 (released early 2025) added SDXL refinement pass improvements, better memory management for 8GB GPUs, and native support for SD3 medium inference. The extension ecosystem includes over 1,000 community plugins covering everything from prompt auto-completion to batch processing pipelines.
How Stable Diffusion WebUI Works #
The architecture follows a modular Python backend + Gradio frontend pattern:

User Browser (Gradio UI)
|
v
Gradio Server (port 7860) --- API Endpoints (/sdapi/v1/)
|
v
Python Backend (modules/)
|-- Model Loader (safetensors/ckpt)
|-- Sampling (Euler, DPM++, etc.)
|-- VAE Decode/Encode
|-- Post-processing (GFPGAN, CodeFormer)
|
v
PyTorch + CUDA --- GPU (VRAM: 4-24GB)
Key concepts to understand before installation:
- Checkpoint: The main model file (
.safetensorsor.ckpt) containing the trained diffusion weights. SD 1.5 models are ~4GB; SDXL models are ~6-7GB. - VAE (Variational Autoencoder): Handles the encode/decode step between pixel space and latent space. A mismatched VAE produces desaturated or blurry outputs.
- Sampler: The algorithm that progressively denoises latent noise into an image. DPM++ 2M Karras is the most widely recommended for quality/speed balance.
- CFG Scale (Classifier-Free Guidance): Controls how strictly the model follows your prompt. Values of 7-9 work for most use cases; higher values increase contrast but may introduce artifacts.
- Extensions: Python plugins loaded at runtime that add UI tabs, API endpoints, or modify the inference pipeline. ControlNet and Adetailer are the most widely used.
Installation & Setup #
Stable Diffusion WebUI supports Windows, Linux, and macOS. The fastest path on all platforms is via the official install scripts.
System Requirements #
| Component | Minimum | Recommended |
|---|---|---|
| GPU | NVIDIA 4GB VRAM | NVIDIA RTX 3060 12GB+ |
| RAM | 8GB | 16GB |
| Storage | 20GB SSD | 100GB SSD (for models) |
| OS | Windows 10 / Ubuntu 20.04 | Windows 11 / Ubuntu 22.04 |
| Python | 3.10.6 | 3.10.11 |
Windows Installation (Automatic) #
The automatic installer handles Git, Python, and dependency setup:
:: Download sd.webui.zip from the releases page
:: Extract to C:\stable-diffusion-webui
:: Run the updater first
cd C:\stable-diffusion-webui
update.bat
:: Launch the web UI
run.bat
On first launch, the script downloads PyTorch, transformers, and the default SD 1.5 model (~4GB). Subsequent starts take 15-30 seconds. The UI becomes available at http://127.0.0.1:7860.
Windows Command-Line Arguments #
For GPUs with limited VRAM or specific optimization needs, edit webui-user.bat:
@echo off
set PYTHON=python
set GIT=git
set VENV_DIR=venv
set COMMANDLINE_ARGS=--xformers --autolaunch --update-check
:: VRAM optimization options (pick ONE):
:: set COMMANDLINE_ARGS=--medvram &:: 8GB GPUs
:: set COMMANDLINE_ARGS=--lowvram &:: 4GB GPUs
:: set COMMANDLINE_ARGS=--normalvram &:: 12GB+ GPUs
:: For RTX 40-series, add --xformers for 20-30% speedup
:: For black/green images, add --precision full --no-half
call webui.bat
Linux Installation (Manual) #
Manual installation gives full control over the Python environment:
# Install dependencies (Ubuntu/Debian)
sudo apt update && sudo apt install -y wget git python3 python3-venv libgl1 libglib2.0-0
# Clone the repository
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
cd stable-diffusion-webui
# Create virtual environment and install
python3 -m venv venv
source venv/bin/activate
pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
pip install -r requirements.txt
# Launch
./webui.sh --xformers --listen
For systems without a display (headless servers), add --listen to expose the UI on all interfaces and --gradio-auth username:password for basic authentication.
Docker Installation (Recommended for Production) #
Docker provides the most reproducible setup, especially for server deployments:
# Dockerfile.stable-diffusion-webui
FROM nvidia/cuda:12.1.1-devel-ubuntu22.04
ENV DEBIAN_FRONTEND=noninteractive
RUN apt-get update && apt-get install -y \
python3 python3-pip python3-venv git wget \
libgl1 libglib2.0-0 libsm6 libxext6 \
&& rm -rf /var/lib/apt/lists/*
RUN useradd -m -s /bin/bash sduser
WORKDIR /home/sduser
RUN git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
WORKDIR /home/sduser/stable-diffusion-webui
RUN python3 -m venv venv && \
. venv/bin/activate && \
pip install --no-cache-dir torch torchvision torchaudio \
--index-url https://download.pytorch.org/whl/cu121 && \
pip install -r requirements.txt
RUN chown -R sduser:sduser /home/sduser
USER sduser
EXPOSE 7860
ENTRYPOINT ["bash", "-c", \". venv/bin/activate && python3 launch.py --listen --api --xformers"]
Build and run:
# Build the image
docker build -f Dockerfile.stable-diffusion-webui -t sd-webui:latest .
# Run with GPU access and model persistence
docker run -d \
--name stable-diffusion-webui \
--gpus all \
-p 7860:7860 \
-v $(pwd)/models:/home/sduser/stable-diffusion-webui/models/Stable-diffusion \
-v $(pwd)/outputs:/home/sduser/stable-diffusion-webui/outputs \
-v $(pwd)/extensions:/home/sduser/stable-diffusion-webui/extensions \
-e NVIDIA_VISIBLE_DEVICES=all \
sd-webui:latest
For docker-compose users:
# docker-compose.yml
version: '3.8'
services:
stable-diffusion-webui:
build:
context: .
dockerfile: Dockerfile.stable-diffusion-webui
container_name: sd-webui
runtime: nvidia
ports:
- "7860:7860"
volumes:
- ./models:/home/sduser/stable-diffusion-webui/models/Stable-diffusion
- ./outputs:/home/sduser/stable-diffusion-webui/outputs
- ./extensions:/home/sduser/stable-diffusion-webui/extensions
- ./vae:/home/sduser/stable-diffusion-webui/models/VAE
environment:
- NVIDIA_VISIBLE_DEVICES=all
deploy:
resources:
reservations:
devices:
- driver: nvidia
count: all
capabilities: [gpu]
restart: unless-stopped
Deploy with one command:
docker-compose up -d
Cloud GPU Setup with DigitalOcean #
If local hardware is insufficient, DigitalOcean GPU Droplets provide on-demand NVIDIA H100 instances starting at competitive hourly rates. New accounts receive $200 in free credits valid for 60 days — enough to run an H100 instance for extended testing.
# On a DigitalOcean GPU Droplet (Ubuntu 22.04 pre-configured)
sudo apt update && sudo apt install -y git wget
# Clone and setup
git clone https://github.com/AUTOMATIC1111/stable-diffusion-webui.git
cd stable-diffusion-webui
# Install with CUDA 12 support
python3 -m venv venv
source venv/bin/activate
pip install torch torchvision --index-url https://download.pytorch.org/whl/cu121
pip install -r requirements.txt
# Launch with public access (configure firewall rules)
python3 launch.py --listen --port 7860 --xformers --gradio-auth admin:securepassword123
Disclosure: DigitalOcean links are affiliate links. We may earn a commission at no additional cost to you. We only recommend services we would use ourselves.
Integration with ControlNet, LoRA, and Popular Tools #
ControlNet Extension Setup #
ControlNet enables structure-guided generation — pose transfer, depth-aware composition, edge-based control:

# Install via Extensions tab (recommended)
# 1. Open WebUI → Extensions → Available
# 2. Click "Load from"
# 3. Search "ControlNet"
# 4. Click Install on "sd-webui-controlnet"
# 5. Restart UI
# Or install manually:
cd extensions
git clone https://github.com/Mikubill/sd-webui-controlnet.git
Download ControlNet models to models/ControlNet/:
# Essential ControlNet models (SD 1.5)
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/ControlNet-v1-1/resolve/main/control_v11p_sd15_openpose.pth
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/ControlNet-v1-1/resolve/main/control_v11f1p_sd15_depth.pth
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/ControlNet-v1-1/resolve/main/control_v11p_sd15_canny.pth
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/ControlNet-v1-1/resolve/main/control_v11p_sd15_lineart.pth
# SDXL ControlNet models
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/sd_control_collection/resolve/main/diffusers_xl_canny_mid.safetensors
wget -P models/ControlNet/ https://huggingface.co/lllyasviel/sd_control_collection/resolve/main/diffusers_xl_depth_mid.safetensors
Configure ControlNet in the UI:
// settings.json - ControlNet configuration
{
"control_net_max_models_num": 3,
"control_net_model_cache_size": 2,
"control_net_control_transfer": false,
"control_net_detectedmap_dir": "extensions/sd-webui-controlnet/detected_maps",
"control_net_models_path": "models/ControlNet",
"control_net_modules_path": "extensions/sd-webui-controlnet/annotator",
"control_net_unit_mode": false,
"control_net_sync_field_args": true
}
LoRA (Low-Rank Adaptation) Integration #
LoRA files are lightweight adapters (~10-200MB) that fine-tune model behavior without replacing the base checkpoint:
# Download LoRA models to the dedicated directory
# Place .safetensors LoRA files in:
# models/Lora/
# Example: Download a popular style LoRA
wget -P models/Lora/ "https://civitai.com/api/download/models/12345"
Using LoRA in prompts:
<lora:add-detail-xl:1.0>, masterpiece, best quality, portrait of a warrior
<lora:epiCRealismHelper:0.6>, photorealistic, 8k uhd
The syntax is <lora:filename:weight> where weight ranges from 0.0 to 1.0. Multiple LoRAs can be stacked in a single prompt.
ComfyUI Workflow Bridge #
For users who need node-based workflows alongside the WebUI interface:
# Install ComfyUI as a secondary tool (recommended over migration)
git clone https://github.com/comfyanonymous/ComfyUI.git
cd ComfyUI
pip install -r requirements.txt
# Share model directories to avoid duplication
ln -s /path/to/stable-diffusion-webui/models/Stable-diffusion models/checkpoints
ln -s /path/to/stable-diffusion-webui/models/Lora models/loras
ln -s /path/to/stable-diffusion-webui/models/ControlNet models/controlnet
# Launch on a different port
python main.py --port 8188
This setup lets you use Stable Diffusion WebUI for quick prototyping and ComfyUI for complex multi-stage pipelines, sharing the same model library.
Essential Extensions List #
# Install these extensions for a production-ready setup
cd extensions
# Adetailer - automatic face/hand/detail fixing
git clone https://github.com/Bing-su/adetailer.git
# Ultimate SD Upscale - high-resolution upscaling
git clone https://github.com/Coyote-A/ultimate-upscale-for-automatic1111.git
# Tiled VAE - reduce VRAM usage for large images
git clone https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111.git
# Tag Autocomplete - prompt helper
git clone https://github.com/DominikDoom/a1111-sd-webui-tagcomplete.git
# System Info + Benchmark
git clone https://github.com/vladmandic/sd-extension-system-info.git
# Inpaint Anything - segment anything + inpainting
git clone https://github.com/Uminosachi/sd-webui-inpaint-anything.git
# Restart the WebUI after installing extensions
Benchmarks / Real-World Use Cases #
GPU Performance Comparison #
All benchmarks use Stable Diffusion WebUI v1.10.1, DPM++ 2M Karras sampler, 20 steps, batch size 1:
| GPU | VRAM | SD 1.5 512x512 | SDXL 1024x1024 | SDXL + ControlNet |
|---|---|---|---|---|
| RTX 4060 Ti 16GB | 16 GB | ~4.2s | ~12.0s | ~16.5s |
| RTX 3090 | 24 GB | ~2.4s | ~5.6s | ~9.2s |
| RTX 4090 | 24 GB | ~1.1s | ~3.2s | ~4.8s |
| RTX 5090 | 32 GB | ~0.8s | ~2.2s | ~3.5s |
Sources: Community benchmarks via sd-extension-system-info, averaged over 10 runs.
VRAM Usage by Workflow #
| Workflow | VRAM Usage (RTX 4090) | Notes |
|---|---|---|
| txt2img SD 1.5 @ 512x512 | ~4.5 GB | Fits on any modern GPU |
| txt2img SDXL @ 1024x1024 | ~8.0 GB | Requires 8GB+ VRAM |
| SDXL + 1x ControlNet | ~12.5 GB | Use --medvram on 8GB cards |
| SDXL + Hi-Res Fix 2x | ~14.0 GB | Tiled VAE recommended |
| SDXL + 2x ControlNet + Adetailer | ~20.0 GB | RTX 3090/4090 recommended |
Memory Optimization Flags #
# For 4GB VRAM GPUs (entry-level):
python3 launch.py --lowvram --precision full --no-half --xformers
# For 6-8GB VRAM GPUs (mainstream):
python3 launch.py --medvram --xformers --opt-split-attention
# For 12GB+ VRAM GPUs (high-end):
python3 launch.py --xformers --opt-sdp-attention
# For 24GB VRAM GPUs (enthusiast):
python3 launch.py --xformers --opt-sdp-attention --no-half-vae
Advanced Usage / Production Hardening #
API Integration #
Stable Diffusion WebUI exposes a full REST API at /sdapi/v1/:
# Python client for txt2img API
import requests
import json
url = "http://localhost:7860/sdapi/v1/txt2img"
payload = {
"prompt": "masterpiece, best quality, portrait of a cyberpunk samurai, neon city background, 8k uhd",
"negative_prompt": "blurry, low quality, deformed, ugly, duplicate",
"steps": 25,
"cfg_scale": 7.0,
"width": 1024,
"height": 1024,
"sampler_name": "DPM++ 2M Karras",
"batch_size": 1,
"seed": -1,
"override_settings": {
"sd_model_checkpoint": "sd_xl_base_1.0.safetensors"
}
}
response = requests.post(url, json=payload)
result = response.json()
# Save the generated image
import base64
for i, img_data in enumerate(result['images']):
with open(f"output_{i}.png", "wb") as f:
f.write(base64.b64decode(img_data))
Batch Processing Script #
# batch_generate.py - process multiple prompts
import requests
import csv
import base64
API_URL = "http://localhost:7860/sdapi/v1/txt2img"
def generate_image(prompt, filename, width=1024, height=1024):
payload = {
"prompt": prompt,
"negative_prompt": "blurry, low quality, deformed",
"steps": 25,
"cfg_scale": 7.0,
"width": width,
"height": height,
"sampler_name": "DPM++ 2M Karras"
}
response = requests.post(API_URL, json=payload)
result = response.json()
with open(filename, "wb") as f:
f.write(base64.b64decode(result['images'][0]))
return filename
# Process prompt list from CSV
with open("prompts.csv", "r") as f:
reader = csv.DictReader(f)
for i, row in enumerate(reader):
filename = f"output_{i:04d}.png"
generate_image(row['prompt'], filename)
print(f"Generated: {filename}")
Security Hardening for Public Deployment #
# 1. Enable authentication
python3 launch.py --listen --gradio-auth admin:strongpassword
# 2. Use HTTPS reverse proxy (nginx)
# /etc/nginx/sites-available/sd-webui
server {
listen 443 ssl http2;
server_name sd.example.com;
ssl_certificate /etc/letsencrypt/live/sd.example.com/fullchain.pem;
ssl_certificate_key /etc/letsencrypt/live/sd.example.com/privkey.pem;
location / {
proxy_pass http://127.0.0.1:7860;
proxy_http_version 1.1;
proxy_set_header Host $host;
proxy_set_header X-Real-IP $remote_addr;
proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
proxy_set_header X-Forwarded-Proto $scheme;
# WebSocket support for Gradio
proxy_set_header Upgrade $http_upgrade;
proxy_set_header Connection "upgrade";
}
}
# 3. Firewall rules (ufw)
sudo ufw default deny incoming
sudo ufw allow ssh
sudo ufw allow 443/tcp
sudo ufw enable
# 4. Rate limiting via nginx limit_req
limit_req_zone $binary_remote_addr zone=sd_limit:10m rate=5r/m;
Monitoring and Logging #
# Create a simple health check script
#!/bin/bash
# health_check.sh
STATUS=$(curl -s -o /dev/null -w "%{http_code}" http://localhost:7860/)
if [ "$STATUS" != "200" ]; then
echo "$(date): WebUI is down (HTTP $STATUS), restarting..." >> /var/log/sd-webui.log
systemctl restart sd-webui
fi
# systemd service for auto-start
# /etc/systemd/system/sd-webui.service
[Unit]
Description=Stable Diffusion WebUI
After=network.target
[Service]
Type=simple
User=sduser
WorkingDirectory=/home/sduser/stable-diffusion-webui
ExecStart=/home/sduser/stable-diffusion-webui/venv/bin/python launch.py --xformers --listen --port 7860
Restart=on-failure
RestartSec=30
Environment="PYTHONUNBUFFERED=1"
[Install]
WantedBy=multi-user.target
Enable auto-start:
sudo systemctl daemon-reload
sudo systemctl enable sd-webui
sudo systemctl start sd-webui
Comparison with Alternatives #
| Feature | Stable Diffusion WebUI | ComfyUI | InvokeAI | Fooocus |
|---|---|---|---|---|
| UI Type | Tab-based web interface | Node-based graph editor | Web app with canvas | Minimal single-page |
| GitHub Stars | 159,000+ | 75,000+ | 25,000+ | 42,000+ |
| Extension Ecosystem | 1,000+ extensions | 1,500+ custom nodes | ~100 community nodes | Limited (presets) |
| Learning Curve | Moderate | Steep | Moderate | Minimal |
| ControlNet Support | Excellent (full range) | Extensive (all models) | Built-in (depth, pose, canny) | Basic (limited) |
| LoRA Support | Full (stackable, weight control) | Full | Built-in manager | Basic |
| VRAM Minimum (SDXL) | 4GB (with –lowvram) | 4GB (with tricks) | 6GB | 4GB |
| Batch Processing | Built-in | Queue-based workflows | Queue system | Manual only |
| API | Full REST API (/sdapi/v1/) | REST + WebSocket | Full REST | None |
| Best For | All-around use, extensions | Power users, automation | Artists, canvas editing | Beginners, quick generation |
When to Choose Which Tool #
Stable Diffusion WebUI: Choose when you need the largest extension ecosystem, proven community support, and a balance between ease of use and flexibility. The tabbed interface is intuitive for users coming from Photoshop or similar tools. The 159K stars translate to more tutorials, more extensions, and faster bug fixes.
ComfyUI: Choose when building automated pipelines, video generation workflows, or multi-stage processing. The node system is powerful but requires 2-4 hours of learning. ComfyUI consistently benchmarks 10-20% faster than WebUI on identical hardware due to lower UI overhead.
InvokeAI: Choose when canvas-based inpainting and professional artist workflows are priorities. The unified canvas is the best in-class for manual editing.
Fooocus: Choose for the absolute simplest setup — download, extract, run. No configuration needed. SDXL quality out of the box. Not suitable for production pipelines due to lack of API and limited automation.
Limitations / Honest Assessment #
Stable Diffusion WebUI is not the right tool for every image generation use case. Here are the concrete limitations:
Higher VRAM usage than alternatives: The Gradio-based UI adds ~500MB-1GB of VRAM overhead compared to ComfyUI’s leaner frontend. On 4-6GB GPUs, this difference matters — you may need to use
--lowvramwhile ComfyUI handles the same workflow on the same hardware without optimization flags.Not the fastest option: Benchmarks consistently show ComfyUI outpacing WebUI by 10-20% on identical hardware and models. The gap widens on batch processing where ComfyUI’s queue system is more efficient.
Flux model support is limited: While SD 1.5, SDXL, and SD3 are well-supported, running Flux models requires either the Forge fork or manual configuration. ComfyUI has better day-0 support for new model architectures.
Extension conflicts: With 1,000+ extensions available, incompatibilities between extensions are common. Updates can break existing workflows. The
extensions/folder sometimes needs manual cleanup when things go wrong.Not ideal for video generation: While extensions like AnimateDiff exist, the WebUI interface is fundamentally designed for static images. ComfyUI’s node system is significantly better suited for video and animation workflows.
Single-user design: The WebUI does not have built-in multi-user support. Running a shared instance requires external authentication and session management via reverse proxy.
Frequently Asked Questions #
What GPU do I need for Stable Diffusion WebUI? #
An NVIDIA GPU with at least 4GB VRAM is the minimum for SD 1.5 at 512x512 resolution. For SDXL at 1024x1024, 8GB VRAM is the practical minimum (with --medvram optimization). A 12GB VRAM card (RTX 3060, RTX 4060 Ti) provides comfortable SDXL generation without compromises. For professional use with ControlNet and Hi-Res Fix, 24GB (RTX 3090/4090) is recommended.
How do I update Stable Diffusion WebUI? #
Run git pull in the installation directory to fetch the latest code, then restart the WebUI. On Windows, double-click update.bat. If extensions break after updating, delete the venv folder to force dependency reinstallation. For the dev branch (required for RTX 50-series GPUs), run git checkout dev before updating.
Can I run Stable Diffusion WebUI without an NVIDIA GPU? #
Yes, but with significant limitations. AMD GPUs work on Linux via ROCm (--precision full --no-half). Apple Silicon Macs can run via ./webui.sh with MPS backend, though generation is 3-5x slower than equivalent NVIDIA hardware. CPU-only mode is possible with --use-cpu all but generates a single 512x512 image in 5-10 minutes versus 2-4 seconds on a mid-range GPU.
Why are my generated images black or green? #
This is a half-precision issue on certain GPU/driver combinations. Add --precision full --no-half to your launch arguments. For NaN errors specifically in the VAE, use --no-half-vae instead. RTX 16-series and some GTX cards are more prone to this issue.
How do I fix “CUDA out of memory” errors? #
First, enable xFormers with --xformers (20-30% VRAM reduction). For 8GB cards, add --medvram. For 4-6GB cards, use --lowvram. Install the Tiled VAE extension for high-resolution generation. Consider using FP8 or NF4 quantized models which use 50% less VRAM at a minor quality cost. If all else fails, reduce image resolution or batch size.
Is it safe to expose Stable Diffusion WebUI to the internet? #
No — not without additional security. The built-in --listen flag exposes the UI without authentication. Always combine --gradio-auth username:password with an HTTPS reverse proxy (nginx/Caddy) and firewall rules. The WebUI was designed for local use; public deployment requires treating it as a production service with proper access controls.
Conclusion #
Stable Diffusion WebUI by AUTOMATIC1111 remains the most practical starting point for local AI image generation in 2026. The 159,000+ GitHub stars reflect not just popularity — they represent an extension ecosystem, knowledge base, and community support infrastructure that no competitor has matched. For developers building image generation pipelines, this tool offers the best balance of features, documentation, and proven stability.
Action items to get started:
- Verify your GPU has 8GB+ VRAM and install the WebUI using the automatic installer (Windows) or Docker (Linux/server)
- Install ControlNet + 4 core models (openpose, depth, canny, lineart) for structure-guided generation
- Download 2-3 quality SDXL checkpoints and 5-10 LoRA adapters for your use case
- Configure
--xformersand the appropriate VRAM flag for your hardware - Set up nginx reverse proxy with authentication if deploying beyond localhost
Discuss this guide or get help in our Telegram group: t.me/dibi8opensource
Disclosure: This article contains affiliate links to DigitalOcean. We may receive a commission if you sign up through these links — this comes at no additional cost to you and helps support our open-source content. All recommendations are based on hands-on testing, not affiliate partnerships.
Recommended Hosting & Infrastructure #
Before you deploy any of the tools above into production, you’ll need solid infrastructure. Two options dibi8 actually uses and recommends:
- DigitalOcean — $200 free credit for 60 days across 14+ global regions. The default option for indie devs running open-source AI tools.
- HTStack — Hong Kong VPS with low-latency access from mainland China. This is the same IDC that hosts dibi8.com — battle-tested in production.
Affiliate links — they don’t cost you extra and they help keep dibi8.com running.
Sources & Further Reading #
- AUTOMATIC1111/stable-diffusion-webui GitHub Repository
- Official Wiki — Install and Run on NVIDIA GPUs
- ControlNet Extension Repository
- Stable Diffusion WebUI Docker Setup
- ComfyUI — Node-based Alternative
- InvokeAI — Artist-focused Alternative
- DigitalOcean GPU Droplets Documentation
- Civitai — Model and LoRA Hub
- Hugging Face — Model Downloads
💬 Discussion