https://github.com/chyinan/uvr-headless-runner

Production-ready UVR5 CLI & Docker image. Run SOTA separation models (Roformer, SCNet, MDX, Demucs, VR Architecture) on headless GPU servers without dependency hell.
https://github.com/chyinan/uvr-headless-runner
automated batch-processing cli command-line command-line-interface command-line-tool demucs docker headless mdx mdx-net pytorch roformer scnet ultimate-vocal-remover uvr uvr-cli uvr5 vr-architecture
Last synced: about 1 month ago
JSON representation
Production-ready UVR5 CLI & Docker image. Run SOTA separation models (Roformer, SCNet, MDX, Demucs, VR Architecture) on headless GPU servers without dependency hell.
Host: GitHub
URL: https://github.com/chyinan/uvr-headless-runner
Owner: chyinan
License: mit
Created: 2026-02-03T12:08:22.000Z (5 months ago)
Default Branch: master
Last Pushed: 2026-02-07T11:52:49.000Z (5 months ago)
Last Synced: 2026-05-18T16:09:55.453Z (about 2 months ago)
Topics: automated, batch-processing, cli, command-line, command-line-interface, command-line-tool, demucs, docker, headless, mdx, mdx-net, pytorch, roformer, scnet, ultimate-vocal-remover, uvr, uvr-cli, uvr5, vr-architecture
Language: Python
Homepage:
Size: 2.27 MB
Stars: 86
Watchers: 0
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project

README

          


  

    

    

  



🎧 Separate vocals, instruments, drums, bass & more from any audio




  Command-line audio source separation powered by UVR





  

    

  

  

    

  

  

    

  

  

    

  

  

    

  





  🇨🇳 中文 | 🇬🇧 English | 🐳 Docker



---

## ✨ Features

### 🎸 MDX-Net Runner

- MDX-Net / MDX-C models

- **Roformer** (MelBandRoformer, BSRoformer)

- **SCNet** (Sparse Compression Network)

- ONNX & PyTorch checkpoints

### 🥁 Demucs Runner

- Demucs v1 / v2 / v3 / v4

- **htdemucs** / **htdemucs_ft**

- **6-stem separation** (Guitar, Piano)

- Auto model download

### 🎤 VR Runner

- VR Architecture models

- **VR 5.1** model support

- Window size / Aggression tuning

- TTA & Post-processing

### 🚀 Highlights

Feature

Description

🎯 GUI-IdenticalExactly replicates UVR GUI behavior

⚡ GPU AcceleratedNVIDIA CUDA & AMD DirectML support

🔧 Zero ConfigAuto-detect model parameters

📦 Batch ReadyPerfect for automation & pipelines

🎚️ Bit Depth Control16/24/32-bit PCM, 32/64-bit float

📥 Auto DownloadOfficial UVR model registry with auto-download

🛡️ Robust Error HandlingGPU fallback, retry, fuzzy matching

🔗 Unified CLIuvr mdx / uvr demucs / uvr vr — one command for all

📦 PyPI Readypip install uvr-headless-runner — instant setup

---

## 📖 Design Philosophy

> 

> 

> **This project is a headless automation layer for [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui).**

> 

> It does **NOT** reimplement any separation logic.  

> It **EXACTLY REPLICATES** UVR GUI behavior — model loading, parameter fallback, and auto-detection.

> 

> **✅ If a model works in UVR GUI, it works here — no extra config needed.**

---

## 🤔 Why uvr-headless-runner?

> Built for maximum flexibility. Load any custom model without waiting for upstream updates.

### 🎨 Full Custom Model Support

Directly load any `.pth` or `.ckpt` file.  

**Perfect for testing new finetunes or experimental models immediately.**

### 🖥️ Headless & Remote Ready

Built for seamless integration into  

**web services or automation scripts.**

### 👥 By Users, For Users

Designed by audio enthusiasts who  

**prioritize complete control and native UVR compatibility.**

---

## 📋 Requirements

| Component | Requirement |

|-----------|-------------|

| **Python** | 3.9.x (3.10+ not fully tested) |

| **GPU** | NVIDIA CUDA or AMD DirectML *(optional)* |

| **OS** | Windows / Linux / macOS |

---

## 🔧 Installation

🚀 Option 1: pip install from PyPI (Recommended)

```bash

# Install from PyPI

pip install uvr-headless-runner

# GPU support (NVIDIA)

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

# ONNX GPU (optional)

pip install onnxruntime-gpu

```

After installation, you get the **`uvr` unified CLI** — no need to clone the repo!

```bash

uvr mdx -m "UVR-MDX-NET Inst HQ 3" -i song.wav -o output/

uvr demucs -m htdemucs -i song.wav -o output/

uvr vr -m "UVR-De-Echo-Normal" -i song.wav -o output/

```

📦 Option 2: Poetry (from source)

```bash

# Clone repository

git clone https://github.com/chyinan/uvr-headless-runner.git

cd uvr-headless-runner

# Install dependencies

poetry install

# GPU support (NVIDIA)

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

# ONNX GPU (optional)

pip install onnxruntime-gpu

```

📦 Option 3: pip + venv (from source)

```bash

# Clone repository

git clone https://github.com/chyinan/uvr-headless-runner.git

cd uvr-headless-runner

# Create virtual environment

python -m venv venv

source venv/bin/activate  # Linux/macOS

# venv\Scripts\activate   # Windows

# Install dependencies

pip install -r requirements.txt

# GPU support (NVIDIA)

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

```

🔴 AMD GPU (DirectML)

```bash

# Install DirectML support

pip install torch-directml

# Use with --directml flag

python mdx_headless_runner.py -m model.ckpt -i song.wav -o output/ --directml

```

> ⚠️ DirectML is experimental. NVIDIA CUDA recommended for best performance.

### ✅ Verify Installation (Native Python Only)

```bash

python -c "import torch; print(f'PyTorch: {torch.__version__}'); print(f'CUDA: {torch.cuda.is_available()}')"

```

> 💡 Skip this if using Docker - the container includes all dependencies.

🐳 Option 4: Docker Hub (No Build Required!)

**Fastest way to get started - just pull and run!**

```bash

# Pull pre-built image from Docker Hub

docker pull chyinan/uvr-headless-runner:latest

# Run directly (GPU mode)

docker run --rm --gpus all \

  -v ~/.uvr_models:/models \

  -v $(pwd):/data \

  chyinan/uvr-headless-runner:latest \

  uvr-mdx -m "UVR-MDX-NET Inst HQ 3" -i /data/song.wav -o /data/output/

# Run directly (CPU mode)

docker run --rm \

  -v ~/.uvr_models:/models \

  -v $(pwd):/data \

  chyinan/uvr-headless-runner:latest \

  uvr-mdx -m "UVR-MDX-NET Inst HQ 3" -i /data/song.wav -o /data/output/ --cpu

```

**Or install CLI wrappers for native experience:**

```bash

# One-click install (auto-detects GPU)

./docker/install.sh      # Linux/macOS

.\docker\install.ps1     # Windows

# Then use like native commands

uvr-mdx -m "UVR-MDX-NET Inst HQ 3" -i song.wav -o output/

uvr-demucs -m htdemucs -i song.wav -o output/

uvr-vr -m "UVR-De-Echo-Normal" -i song.wav -o output/

```

📖 **[Full Docker Guide →](DOCKER_README.md)**

---

## 🎼 Quick Start

### Unified CLI (pip install / Docker)

After installing via `pip install uvr-headless-runner` or Docker, you can use the **short commands**:

```bash

# MDX-Net / Roformer separation

uvr mdx -m "UVR-MDX-NET Inst HQ 3" -i song.wav -o output/ --gpu

# Demucs separation

uvr demucs -m htdemucs -i song.wav -o output/ --gpu

# VR Architecture separation

uvr vr -m "UVR-De-Echo-Normal" -i song.wav -o output/ --gpu

# List all available models

uvr list all

# Download a model

uvr download "UVR-MDX-NET Inst HQ 3" --arch mdx

# Show system info

uvr info

```

> 💡 You can also use standalone commands: `uvr-mdx`, `uvr-demucs`, `uvr-vr`

### MDX-Net / Roformer / SCNet

```bash

# Basic separation

python mdx_headless_runner.py -m "model.ckpt" -i "song.flac" -o "output/" --gpu

# Vocals only (24-bit)

python mdx_headless_runner.py -m "model.ckpt" -i "song.flac" -o "output/" --gpu --vocals-only --wav-type PCM_24

```

### Demucs

```bash

# All 4 stems

python demucs_headless_runner.py --model htdemucs --input "song.flac" --output "output/" --gpu

# Vocals only

python demucs_headless_runner.py --model htdemucs --input "song.flac" --output "output/" --gpu --stem Vocals --primary-only

```

### VR Architecture

```bash

# Basic separation (model in database)

python vr_headless_runner.py -m "model.pth" -i "song.flac" -o "output/" --gpu

# Custom model (not in database)

python vr_headless_runner.py -m "model.pth" -i "song.flac" -o "output/" --gpu \

    --param 4band_v3 --primary-stem Vocals

```

---

## 📥 Model Download Center

All runners now include **automatic model downloading** from official UVR sources - just like the GUI!

### List Available Models

```bash

# List all MDX-Net models

python mdx_headless_runner.py --list

# List only installed models

python mdx_headless_runner.py --list-installed

# List models not yet downloaded

python mdx_headless_runner.py --list-uninstalled

# Same for Demucs and VR

python demucs_headless_runner.py --list

python vr_headless_runner.py --list

```

### Download Models

```bash

# Download a specific model (without running inference)

python mdx_headless_runner.py --download "UVR-MDX-NET Inst HQ 3"

python demucs_headless_runner.py --download "htdemucs_ft"

python vr_headless_runner.py --download "UVR-De-Echo-Normal by FoxJoy"

```

### Auto-Download on Inference

```bash

# Just use the model name - it will download automatically if not installed!

python mdx_headless_runner.py -m "UVR-MDX-NET Inst HQ 3" -i "song.flac" -o "output/" --gpu

# Demucs models auto-download too

python demucs_headless_runner.py --model htdemucs_ft --input "song.flac" --output "output/" --gpu

```

### Model Info & Fuzzy Matching

```bash

# Get detailed info about a model

python mdx_headless_runner.py --model-info "UVR-MDX-NET Inst HQ 3"

# Typo? Get suggestions!

python mdx_headless_runner.py --model-info "UVR-MDX Inst HQ"

# Output: Did you mean: UVR-MDX-NET Inst HQ 1, UVR-MDX-NET Inst HQ 2, ...

```

### Features

| Feature | Description |

|---------|-------------|

| 🌐 **Official Registry** | Syncs with UVR's official model list |

| 🔄 **Resume Downloads** | Interrupted downloads can be resumed |

| ⏱️ **Retry with Backoff** | Automatic retry on network errors |

| 💾 **Disk Space Check** | Pre-checks available space before download |

| 🔍 **Fuzzy Matching** | Suggests similar model names on typos |

| ✅ **Integrity Check** | Validates downloaded files |

---

## 🛡️ Error Handling & GPU Fallback

All runners include **robust error handling** with automatic GPU-to-CPU fallback:

```bash

# If GPU runs out of memory, automatically falls back to CPU

python mdx_headless_runner.py -m "model.ckpt" -i "song.flac" -o "output/" --gpu

# Output on GPU error:

# ============================================================

# ERROR: GPU memory exhausted

# ============================================================

# Suggestion: Try: (1) Use --cpu flag, (2) Reduce --batch-size...

#

# Attempting to fall back to CPU mode...

```

### Error Messages

Errors now include clear explanations and suggestions:

| Before | After |

|--------|-------|

| `FileNotFoundError` | `Audio file not found: song.wav` |

| `CUDA out of memory` | `GPU memory exhausted. Try: --cpu or reduce --batch-size` |

| `Model not found` | `Model 'xyz' not found. Did you mean: UVR-MDX-NET...?` |

---

## 📊 CLI Progress Display

All runners feature a **professional CLI progress system** with real-time feedback:

```

╭──────────────────────────────────────────────────────────────────────────╮

│                          UVR Audio Separation                            │

├──────────────────────────────────────────────────────────────────────────┤

│  Model         │ UVR-MDX-NET Inst HQ 3                                   │

│  Input         │ song.flac                                               │

│  Output        │ ./output/                                               │

│  Device        │ CUDA:0                                                  │

│  Architecture  │ MDX-Net                                                 │

╰──────────────────────────────────────────────────────────────────────────╯

⠹ Downloading model: UVR-MDX-NET Inst HQ 3

  ████████████████████████████████████████ 100% • 245.3 MB • 12.5 MB/s • 0:00:00

✓ Model downloaded

⠹ Running inference

  ████████████████░░░░░░░░░░░░░░░░░░░░░░░░  42% • 0:01:23 • 0:01:52

✓ Inference complete

╭──────────────────────────────────────────────────────────────────────────╮

│              ✓ Processing completed in 3:15                              │

╰──────────────────────────────────────────────────────────────────────────╯

Output files:

  • output/song_(Vocals).wav

  • output/song_(Instrumental).wav

```

### Features

| Feature | Description |

|---------|-------------|

| 📥 **Download Progress** | Real-time speed, ETA, and transfer stats for model downloads |

| 🎯 **Inference Progress** | Chunk-based progress tracking during audio processing |

| ⏱️ **Time Estimates** | Elapsed time and remaining time (ETA) display |

| 🎨 **Rich Output** | Beautiful terminal UI with `rich` library |

| 🐳 **Docker Compatible** | Works seamlessly inside containers |

| 📉 **Graceful Fallback** | Falls back to basic output if `rich` unavailable |

### Progress Library Support

The system automatically selects the best available library:

1. **`rich`** (preferred) - Full-featured progress bars with colors

2. **`tqdm`** (fallback) - Standard progress bars

3. **Basic** (no deps) - Simple text-based progress

Install `rich` for the best experience:

```bash

pip install rich

```

### Quiet Mode

Disable progress output for scripting:

```bash

python mdx_headless_runner.py -m model.ckpt -i song.wav -o output/ --quiet

```

---

## 🎛️ MDX-Net Runner

### Command Line Arguments

| Argument | Short | Default | Description |

|----------|-------|---------|-------------|

| `--model` | `-m` | **Required** | Model file path (.ckpt/.onnx) |

| `--input` | `-i` | **Required** | Input audio file |

| `--output` | `-o` | **Required** | Output directory |

| `--gpu` | | Auto | Use NVIDIA CUDA |

| `--directml` | | | Use AMD DirectML |

| `--overlap` | | `0.25` | MDX overlap (0.25-0.99) |

| `--overlap-mdxc` | | `2` | MDX-C/Roformer overlap (2-50) |

| `--wav-type` | | `PCM_24` | Output: PCM_16/24/32, FLOAT, DOUBLE |

| `--vocals-only` | | | Output vocals only |

| `--instrumental-only` | | | Output instrumental only |

📋 All Arguments

| Argument | Description |

|----------|-------------|

| `--name` `-n` | Output filename base |

| `--json` | Model JSON config |

| `--cpu` | Force CPU |

| `--device` `-d` | GPU device ID |

| `--segment-size` | Segment size (default: 256) |

| `--batch-size` | Batch size (default: 1) |

| `--primary-only` | Save primary stem only |

| `--secondary-only` | Save secondary stem only |

| `--stem` | MDX-C stem select |

| `--quiet` `-q` | Quiet mode |

### Examples

```bash

# Roformer with custom overlap

python mdx_headless_runner.py \

    -m "MDX23C-8KFFT-InstVoc_HQ.ckpt" \

    -i "song.flac" -o "output/" \

    --gpu --overlap-mdxc 8

# 32-bit float output

python mdx_headless_runner.py \

    -m "model.ckpt" -i "song.flac" -o "output/" \

    --gpu --wav-type FLOAT

```

---

## 🥁 Demucs Runner

### Supported Models

| Model | Version | Stems | Quality |

|-------|---------|-------|---------|

| `htdemucs` | v4 | 4 | ⭐⭐⭐ |

| `htdemucs_ft` | v4 | 4 | ⭐⭐⭐⭐ Fine-tuned |

| `htdemucs_6s` | v4 | 6 | ⭐⭐⭐⭐ +Guitar/Piano |

| `hdemucs_mmi` | v4 | 4 | ⭐⭐⭐ |

| `mdx_extra_q` | v3 | 4 | ⭐⭐⭐ |

### Command Line Arguments

| Argument | Short | Default | Description |

|----------|-------|---------|-------------|

| `--model` | `-m` | **Required** | Model name or path |

| `--input` | `-i` | **Required** | Input audio file |

| `--output` | `-o` | **Required** | Output directory |

| `--gpu` | | Auto | Use NVIDIA CUDA |

| `--segment` | | Default | Segment size (1-100+) |

| `--shifts` | | `2` | Time shifts |

| `--stem` | | | Vocals/Drums/Bass/Other/Guitar/Piano |

| `--wav-type` | | `PCM_24` | Output bit depth |

| `--primary-only` | | | Output primary stem only |

### Stem Selection

| GUI Action | CLI Command |

|------------|-------------|

| All Stems | *(no --stem)* |

| Vocals only | `--stem Vocals --primary-only` |

| Instrumental only | `--stem Vocals --secondary-only` |

### Examples

```bash

# 6-stem separation

python demucs_headless_runner.py \

    --model htdemucs_6s \

    --input "song.flac" --output "output/" \

    --gpu

# High quality with custom segment

python demucs_headless_runner.py \

    --model htdemucs_ft \

    --input "song.flac" --output "output/" \

    --gpu --segment 85

```

---

## 🎤 VR Architecture Runner

### Command Line Arguments

| Argument | Short | Default | Description |

|----------|-------|---------|-------------|

| `--model` | `-m` | **Required** | Model file path (.pth) |

| `--input` | `-i` | **Required** | Input audio file |

| `--output` | `-o` | **Required** | Output directory |

| `--gpu` | | Auto | Use NVIDIA CUDA |

| `--directml` | | | Use AMD DirectML |

| `--window-size` | | `512` | Window size (320/512/1024) |

| `--aggression` | | `5` | Aggression setting (0-50+) |

| `--wav-type` | | `PCM_16` | Output: PCM_16/24/32, FLOAT, DOUBLE |

| `--primary-only` | | | Output primary stem only |

| `--secondary-only` | | | Output secondary stem only |

📋 All Arguments

| Argument | Description |

|----------|-------------|

| `--name` `-n` | Output filename base |

| `--param` | Model param name (e.g., 4band_v3) |

| `--primary-stem` | Primary stem name (Vocals/Instrumental) |

| `--nout` | VR 5.1 nout parameter |

| `--nout-lstm` | VR 5.1 nout_lstm parameter |

| `--cpu` | Force CPU |

| `--device` `-d` | GPU device ID |

| `--batch-size` | Batch size (default: 1) |

| `--tta` | Enable Test-Time Augmentation |

| `--post-process` | Enable post-processing |

| `--post-process-threshold` | Post-process threshold (default: 0.2) |

| `--high-end-process` | Enable high-end mirroring |

| `--list-params` | List available model params |

### Model Parameters

When the model hash is not found in the database, you need to provide parameters manually:

```bash

# List available params

python vr_headless_runner.py --list-params

# Use custom params

python vr_headless_runner.py -m "model.pth" -i "song.flac" -o "output/" \

    --param 4band_v3 --primary-stem Vocals

# VR 5.1 model with nout/nout_lstm

python vr_headless_runner.py -m "model.pth" -i "song.flac" -o "output/" \

    --param 4band_v3 --primary-stem Vocals --nout 48 --nout-lstm 128

```

### Examples

```bash

# High quality with TTA

python vr_headless_runner.py \

    -m "UVR-MDX-NET-Voc_FT.pth" \

    -i "song.flac" -o "output/" \

    --gpu --tta --window-size 1024

# Aggressive separation

python vr_headless_runner.py \

    -m "model.pth" -i "song.flac" -o "output/" \

    --gpu --aggression 15 --post-process

# 24-bit output

python vr_headless_runner.py \

    -m "model.pth" -i "song.flac" -o "output/" \

    --gpu --wav-type PCM_24

```

---

## 📁 Output Structure

```

output/

├── song_(Vocals).wav        # Vocals

├── song_(Instrumental).wav  # Instrumental (MDX)

├── song_(Drums).wav         # Drums (Demucs)

├── song_(Bass).wav          # Bass (Demucs)

├── song_(Other).wav         # Other (Demucs)

├── song_(Guitar).wav        # Guitar (6-stem)

└── song_(Piano).wav         # Piano (6-stem)

```

---

## 🐍 Python API

```python

from mdx_headless_runner import run_mdx_headless

from demucs_headless_runner import run_demucs_headless

from vr_headless_runner import run_vr_headless

# MDX separation

run_mdx_headless(

    model_path='model.ckpt',

    audio_file='song.wav',

    export_path='output',

    use_gpu=True,

    verbose=True  # Print progress

)

# Output: output/song_(Vocals).wav, output/song_(Instrumental).wav

# Demucs separation (vocals only)

run_demucs_headless(

    model_path='htdemucs',

    audio_file='song.wav',

    export_path='output',

    use_gpu=True,

    demucs_stems='Vocals',  # or 'All Stems' for all

    primary_only=True,

    verbose=True

)

# Output: output/song_(Vocals).wav

# VR Architecture separation

run_vr_headless(

    model_path='model.pth',

    audio_file='song.wav',

    export_path='output',

    use_gpu=True,

    window_size=512,

    aggression_setting=5,

    is_tta=False,

    # For unknown models, provide params manually:

    # user_vr_model_param='4band_v3',

    # user_primary_stem='Vocals'

)

# Output: output/song_(Vocals).wav, output/song_(Instrumental).wav

```

> 💡 **Note**: Functions process audio and save to `export_path`. Check output directory for results.

---

## 🔍 Troubleshooting

❌ GPU not detected

```bash

# Check CUDA

python -c "import torch; print(torch.cuda.is_available())"

# Reinstall PyTorch with CUDA

pip uninstall torch torchvision torchaudio

pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu124

```

❌ Model not found

**Option 1: Use automatic download (recommended)**

```bash

# List available models

python mdx_headless_runner.py --list

# Download the model

python mdx_headless_runner.py --download "UVR-MDX-NET Inst HQ 3"

# Or just use it - auto-downloads!

python mdx_headless_runner.py -m "UVR-MDX-NET Inst HQ 3" -i song.wav -o output/

```

**Option 2: Manual download**

Default locations:

- **MDX**: `./models/MDX_Net_Models/`

- **Demucs**: `./models/Demucs_Models/v3_v4_repo/`

- **VR**: `./models/VR_Models/`

❌ Network/Download errors

```bash

# Force refresh model registry

python model_downloader.py --sync

# Check network connectivity

python -c "import urllib.request; urllib.request.urlopen('https://github.com')"

```

The downloader includes:

- Automatic retry (3 attempts with exponential backoff)

- Resume interrupted downloads

- Fallback to cached registry

❌ VR model hash not found

If your VR model isn't in the database, provide parameters manually:

```bash

# List available params

python vr_headless_runner.py --list-params

# Specify param and primary stem

python vr_headless_runner.py -m "model.pth" -i "song.wav" -o "output/" \

    --param 4band_v3 --primary-stem Vocals

```

Common params: `4band_v3`, `4band_v2`, `1band_sr44100_hl512`, `3band_44100`

❌ Poor output quality

- Try increasing `--overlap` or `--overlap-mdxc`

- For Demucs, increase `--segment` (e.g., 85)

- Ensure correct model config with `--json`

---

## 🙏 Acknowledgments










_{Anjok07 & aufr33}










_{Facebook Research}










_{Woosung Choi}










_tsurumeso

Special thanks to **[ZFTurbo](https://github.com/ZFTurbo)** for MDX23C & SCNet.

---

## 📄 License

```

MIT License

Copyright (c) 2022 Anjok07 (Ultimate Vocal Remover)

Copyright (c) 2026 UVR Headless Runner Contributors

```



  View Full License



---

## Contributing & Support

**Pull Requests** and **Issues** are welcome! Whether it's bug reports, feature suggestions, or code contributions, we greatly appreciate them all.

If you find this project helpful, please give us a **Star** ⭐ - it's the best support for us!

---



  _{Made with ❤️ for the audio separation community}