https://github.com/billwuhao/ComfyUI_SparkTTS
Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
https://github.com/billwuhao/ComfyUI_SparkTTS
Last synced: 3 months ago
JSON representation
Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
- Host: GitHub
- URL: https://github.com/billwuhao/ComfyUI_SparkTTS
- Owner: billwuhao
- License: apache-2.0
- Created: 2025-03-06T14:07:39.000Z (4 months ago)
- Default Branch: master
- Last Pushed: 2025-03-25T11:11:52.000Z (3 months ago)
- Last Synced: 2025-03-25T12:24:42.554Z (3 months ago)
- Language: Python
- Size: 4.11 MB
- Stars: 25
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README-CN.md
- License: LICENSE
Awesome Lists containing this project
- awesome-comfyui - **ComfyUI_SparkTTS** - TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokenss (All Workflows Sorted by GitHub Stars)
README
[中文](README.md) | [English](README-en.md)
# Spark-TTS 的 Comfyui 节点
在 Comfyui 中使用 Spark-TTS. Spark-TTS: 一种基于 LLM 的高效文本到语音模型,具有 Single-Stream 语音 Tokens.

## 更新
[2025-03-21]⚒️: 重构代码, 可选是否卸载模型, 生成速度更快. 添加更多可调参数. 支持克隆不同语言之间的声音.
[2025-03-07]⚒️: 发布版本 v1.0.0. 录音节点 `MW Audio Recorder for Spark` 可用麦克风录制音频, 进度条显示录制进度:

## 安装
```
cd ComfyUI/custom_nodes
git clone https://github.com/billwuhao/ComfyUI_SparkTTS.git
cd ComfyUI_SparkTTS
pip install -r requirements.txt# python_embeded
./python_embeded/python.exe -m pip install -r requirements.txt
```## 模型下载
下列模型下载到 `ComfyUI\models\TTS` 文件夹中.
[Spark-TTS-0.5B](https://https://huggingface.co/SparkAudio/Spark-TTS-0.5B)
将本仓库中 `Step-Audio-speakers` 文件夹移动到 `ComfyUI\models\TTS` 文件夹中.
结构如下:
```
ComfyUI\models\TTS
├── Spark-TTS-0.5B
├── Step-Audio-speakers
```**注意**: 如果你已经安装过 [ComfyUI_StepAudioTTS](https://github.com/billwuhao/ComfyUI_StepAudioTTS), 则无需移动, 它们是共享音频和配置文件的.
然后就可在 `ComfyUI\models\TTS\Step-Audio-speakers` 文件夹下随意自定义说话者即可使用. 注意说话者名称配置一定要一致:

## 致谢
[Spark-TTS](https://github.com/SparkAudio/Spark-TTS.git)