https://github.com/BIMer-99/ComfyUI_FishSpeech_EX

This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5
https://github.com/BIMer-99/ComfyUI_FishSpeech_EX

Last synced: about 2 months ago
JSON representation

This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5

Host: GitHub
URL: https://github.com/BIMer-99/ComfyUI_FishSpeech_EX
Owner: BIMer-99
Created: 2024-12-20T02:56:22.000Z (10 months ago)
Default Branch: main
Last Pushed: 2024-12-20T03:44:39.000Z (10 months ago)
Last Synced: 2024-12-20T04:18:07.824Z (10 months ago)
Language: Python
Size: 421 KB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-comfyui - **ComfyUI_FishSpeech_EX** - Speech-1.5 version and is only applicable to version 1.5 (All Workflows Sorted by GitHub Stars)
awesome-comfyui - **ComfyUI_FishSpeech_EX** - Speech-1.5 version and is only applicable to version 1.5 (Workflows (3283) sorted by GitHub Stars)
awesome-comfyui - **ComfyUI_FishSpeech_EX** - Speech-1.5 version and is only applicable to version 1.5 (Workflows (2905) sorted by GitHub Stars)

README

[English](README.md) | [简体中文](README_zh_CN.md)

# ComfyUI_FishSpeech_EX

This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5:
1. The plugin references the ComfyUI-fish-speech plugin for optimization, and modifies the overall configuration address and installation method.
2. The required Python libraries for the plugin have been improved, mainly vector-quantize-pytorch. If this library is not installed, the audio quality will be poor. `This problem has been bothering me for a few days, and I searched the entire FishSpeech plugin to find the sampling step issue. If this problem has also been bothering you, please give it a like, thank you!`

#### Specific nodes：

- **EX_AudioToPrompt**
1. **audio**: ComfyUI audio.
2. **vqgan**: VQGAN model.
3. **restored_audio**: Decoded audio.
4. **prompt_tokens**: Tokens corresponding to the prompt audio.

- **EX_Prompt2Semantic**
1. **prompt_tokens**: The token corresponding to the input prompt audio.
2. **codes**: The generated audio Code.

- **EX_LoadVQGAN**
Load the VQGAN model, input the model path, and output the model.

- **EX_Semantic2Image**
Analyze audio Code, output corresponding audio.

- **EX_SaveAudioToMp3**
Save the audio to an MP3 file.

## Work flow
![workflow.png](./workflow/show.png)

## Reference materials
- [AnyaCoder/ComfyUI-fish-speech](https://github.com/AnyaCoder/ComfyUI-fish-speech) - Official Implementaion
- [fishaudio/fish-speech](https://github.com/AnyaCoder/ComfyUI-fish-speech) - SOTA Open Source TTS.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/BIMer-99/ComfyUI_FishSpeech_EX

Awesome Lists containing this project

README