Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/quickvc/QuickVC-VoiceConversion
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
https://github.com/quickvc/QuickVC-VoiceConversion
Last synced: 2 months ago
JSON representation
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
- Host: GitHub
- URL: https://github.com/quickvc/QuickVC-VoiceConversion
- Owner: quickvc
- License: mit
- Created: 2023-02-14T07:20:29.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2023-07-13T07:43:54.000Z (over 1 year ago)
- Last Synced: 2024-08-03T23:03:18.160Z (6 months ago)
- Language: Python
- Homepage:
- Size: 9.22 MB
- Stars: 213
- Watchers: 22
- Forks: 24
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# QuickVC
This repository contains the open source code, audio samples and pretrained models of my paper: QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
## [Demo Page](https://quickvc.github.io/quickvc-demo)## [Pretrained Model](https://drive.google.com/drive/folders/1DF6RgIHHkn2aoyyUMt4_hPitKSc2YR9d?usp=share_link)
Put pretrained model into logs/quickvc## Inference with pretrained model
```python
python convert.py
```
You can change convert.txt to select the target and source
## Preprocess
1. Hubert-Soft
```python
cd dataset
python encode.py soft dataset/VCTK-16K dataset/VCTK-16K
```
2. Spectrogram resize data augumentation, please refer to [FreeVC](https://github.com/OlaWod/FreeVC).## Train
```python
python train.py
```If you want to change the config and model name, change:
```python
parser.add_argument('-c', '--config', type=str, default="./configs/quickvc.json",help='JSON file for configuration')
parser.add_argument('-m', '--model', type=str,default="quickvc",help='Model name')
```
in utils.pyIn order to use the sr during training, change [this part](https://github.com/quickvc/QuickVC-VoiceConversion/blob/277118de9c81d1689e16be8a43408eda4223553d/data_utils_new_new.py#L70) to
```python
i = random.randint(68,92)
c_filename = filename.replace(".wav", f"_{i}.npy")
```
## References
If you have any question about the decoder, refer to [MS-ISTFT-VITS](https://github.com/MasayaKawamura/MB-iSTFT-VITS).If you have any question about the Hubert-soft, refer to [Soft-VC](https://github.com/bshall/hubert).
If you have any question about the data augumentation, refer to [FreeVC](https://github.com/OlaWod/FreeVC).
## If you meet any problem, welcome to contact with me.