https://github.com/BIMer-99/ComfyUI_FishSpeech_EX
This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5
https://github.com/BIMer-99/ComfyUI_FishSpeech_EX
Last synced: 4 months ago
JSON representation
This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5
- Host: GitHub
- URL: https://github.com/BIMer-99/ComfyUI_FishSpeech_EX
- Owner: BIMer-99
- Created: 2024-12-20T02:56:22.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-12-20T03:44:39.000Z (4 months ago)
- Last Synced: 2024-12-20T04:18:07.824Z (4 months ago)
- Language: Python
- Size: 421 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-comfyui - **ComfyUI_FishSpeech_EX** - Speech-1.5 version and is only applicable to version 1.5 (All Workflows Sorted by GitHub Stars)
README
[English](README.md) | [简体中文](README_zh_CN.md)
# ComfyUI_FishSpeech_EX
This plugin is optimized for Fish-Speech-1.5 version and is only applicable to version 1.5:
1. The plugin references the ComfyUI-fish-speech plugin for optimization, and modifies the overall configuration address and installation method.
2. The required Python libraries for the plugin have been improved, mainly vector-quantize-pytorch. If this library is not installed, the audio quality will be poor. `This problem has been bothering me for a few days, and I searched the entire FishSpeech plugin to find the sampling step issue. If this problem has also been bothering you, please give it a like, thank you!`#### Specific nodes:
- **EX_AudioToPrompt**
1. **audio**: ComfyUI audio.
2. **vqgan**: VQGAN model.
3. **restored_audio**: Decoded audio.
4. **prompt_tokens**: Tokens corresponding to the prompt audio.- **EX_Prompt2Semantic**
1. **prompt_tokens**: The token corresponding to the input prompt audio.
2. **codes**: The generated audio Code.- **EX_LoadVQGAN**
Load the VQGAN model, input the model path, and output the model.- **EX_Semantic2Image**
Analyze audio Code, output corresponding audio.- **EX_SaveAudioToMp3**
Save the audio to an MP3 file.## Work flow
## Reference materials
- [AnyaCoder/ComfyUI-fish-speech](https://github.com/AnyaCoder/ComfyUI-fish-speech) - Official Implementaion
- [fishaudio/fish-speech](https://github.com/AnyaCoder/ComfyUI-fish-speech) - SOTA Open Source TTS.