Projects in Awesome Lists tagged with vocoder

https://github.com/coqui-ai/TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis

Last synced: 25 Oct 2024

https://github.com/coqui-ai/tts

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis

Last synced: 16 Dec 2024

https://github.com/paddlepaddle/paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 16 Dec 2024

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 29 Oct 2024

https://github.com/mozilla/tts

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 17 Dec 2024

https://github.com/mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 25 Oct 2024

https://github.com/fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

agent bert bert-vits bert-vits2 fish fish-speech llm tts vits vits2 vocoder

Last synced: 30 Oct 2024

https://github.com/open-mmlab/amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 17 Dec 2024

https://github.com/tensorspeech/tensorflowtts

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 16 Dec 2024

https://github.com/TensorSpeech/TensorflowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 28 Nov 2024

https://github.com/TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 29 Oct 2024

https://github.com/jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

deep-learning gan hifi-gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 18 Dec 2024

https://github.com/kan-bayashi/parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 19 Dec 2024

https://github.com/kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 06 Nov 2024

https://github.com/mmorise/world

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 19 Dec 2024

https://github.com/mmorise/World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 13 Nov 2024

https://github.com/haoheliu/voicefixer

General Speech Restoration

declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder

Last synced: 17 Dec 2024

https://github.com/gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

vocoder vocos

Last synced: 05 Nov 2024

https://github.com/rongjiehuang/fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 15 Dec 2024

https://github.com/Rongjiehuang/FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 28 Nov 2024

https://github.com/szechyjs/mbelib

P25 Phase 1 and ProVoice vocoder

c vocoder

Last synced: 16 Dec 2024

https://github.com/maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

deep-learning gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 11 Nov 2024

https://github.com/sh123/codec2_talkie

Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

amateur-radio amateurradio aprs bluetooth codec2 digital digital-voice dv fm freedv ham-radio hf kiss lora opus radio uhf vhf vocoder walkie-talkie

Last synced: 20 Dec 2024

https://github.com/maum-ai/phaseaug

ICASSP 2023 Accepted

gan speech-synthesis vocoder

Last synced: 18 Dec 2024

https://github.com/descriptinc/cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

audio autoregression gan vocoder

Last synced: 02 Oct 2024

https://github.com/k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

adversarial-learning cyclic-constraints speech-synthesis vocoder voice-conversion vqvae

Last synced: 17 Dec 2024

https://github.com/xcmyz/fastvocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

hifigan melgan speech-synthesis vocoder

Last synced: 01 Dec 2024

https://github.com/ncsoft/avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

avocodo gan pytorch vocoder

Last synced: 13 Nov 2024

https://github.com/x-lance/unicats-ctx-vec2wav

[AAAI 2024] Code for CTX-vec2wav in UniCATS

self-supervised-speech semantic-token speech-synthesis unicats vocoder vocoding

Last synced: 15 Dec 2024

https://github.com/jurihock/stftpitchshift

STFT based real-time pitch and timbre shifting in C++ and Python

algorithms audio audio-effect audio-processing cpp dafx dsp fft formants pitch pitch-shifting plugin python realtime smbpitchshift stft stftpitchshift timbre vocoder voice

Last synced: 15 Dec 2024

https://github.com/iamycy/golf

A DDSP-based neural voice synthesiser.

ddsp glottal-flow-model iir-filters linear-predictive-coding pytorch-implementation vocoder

Last synced: 15 Dec 2024

https://github.com/yoyololicon/golf

A DDSP-based neural voice synthesiser.

ddsp glottal-flow-model iir-filters linear-predictive-coding pytorch-implementation vocoder

Last synced: 03 Oct 2024

https://github.com/erogol/fftnet

FFTNet vocoder implementation

deep-learning fftnet pytorch text2speech vocoder

Last synced: 22 Oct 2024

https://github.com/yl4579/hiftnet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

deep-learning speech-synthesis text-to-speech tts vocoder vocoders

Last synced: 14 Nov 2024

https://github.com/zzw922cn/lpc_for_tts

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

audiocompression lpc lpcnet mel-spectrogram tts vocoder wavernn

Last synced: 11 Nov 2024

https://github.com/vtuber-plan/nsf-hifigan

Vocoder NSF-HiFiGAN (Moved into deepaudio)

vocoder

Last synced: 08 Dec 2024

https://github.com/yoyololicon/pytorch_fftnet

A pytorch implementation of FFTNet.

cnn fftnet vocoder

Last synced: 22 Oct 2024

https://github.com/revsic/tf-diffwave

Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis

diffusion diffwave tensorflow tts vocoder wavenet

Last synced: 30 Nov 2024

https://github.com/jurihock/stftpitchshiftplugin

Official JUCE plugin for stftPitchShift

au audio audio-effect audio-processing dafx dsp fft formants juce-plugins low-latency lv2 pitch-shifting plugin real-time stft stftpitchshift timbre vocoder voice vst

Last synced: 27 Oct 2024

https://github.com/ttop32/coqui_tts_korea

Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS

coqui coqui-ai deep-learning glow-tts half-life korea korean korean-language korean-letters korean-text-processing korean-tokenizer korean-tts multiband-melgan pytorch speech speech-synthesis text-to-speech tts vocoder voice-cloning

Last synced: 11 Nov 2024

https://github.com/yuzukitsuru/world.js

World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten

audio-processing d4c dsp emscripten f0-estimation javascript javascript-library morise speech synthesis vocoder world wrapper

Last synced: 23 Nov 2024

https://github.com/vtuber-plan/hifi-gan

An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.

vocoder

Last synced: 08 Dec 2024

https://github.com/yoyololicon/wavenet-like-vocoder

Basic wavenet and fftnet vocoder model.

fftnet mel-spectrogram pytorch vocoder wavenet

Last synced: 23 Oct 2024

https://github.com/mmorise/noisegenerators

Noise generators for vocoder

gaussian-white-noise modified-velvet-noise speech-synthesis velvet-noise vocoder

Last synced: 08 Nov 2024

https://github.com/jurihock/voyx

Standalone real time dynamic vocal harmonizer

algorithms audio audio-effect audio-processing cpp dsp fft harmonizer live midi pitch-detection pitch-shifting smbpitchshift standalone stft stftpitchshift vocoder voice voyx

Last synced: 12 Oct 2024

https://github.com/vtuber-plan/istftnet

iSTFTNet Vocoder PyTorch Implement

vocoder

Last synced: 08 Dec 2024

https://github.com/bycob/harmonizer

Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too

audio-processing dsp harmonizer music realtime-audio vocoder

Last synced: 19 Nov 2024

https://github.com/revsic/speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor speech-dataset tts vocoder

Last synced: 30 Nov 2024

https://github.com/will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)

diffusion speech speech-synthesis tensorflow text-to-speech tts vocoder

Last synced: 14 Oct 2024

https://github.com/egorsmkv/radtts-hifigan

RADTTS + HiFiGAN vocoder

conversational-ai hifigan speech-synthesis text-to-speech tts ukrainian vocoder

Last synced: 18 Oct 2024

https://github.com/34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

hifi-gan mypy neural-source-filter nsf python pytorch tts vocoder voice-conversion

Last synced: 01 Nov 2024

https://github.com/yas-sim/csm_voice_encode_synthesis_python

Expermental code for CSM voice synthesis + CSM data generation

audio-codec audio-processing composite-sinusoidal-modeling csm fm-sound vocoder voice voice-synthesis voice-synthesizer yamaha ym2203

Last synced: 16 Nov 2024

https://github.com/egorsmkv/radtts-uk

🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

conversational-ai hifigan speech-ai speech-synthesis text-to-speech tts ukrainian vocoder

Last synced: 07 Dec 2024

https://github.com/monocasual/vocoder

Probably one of the best text-to-speech online apps in the world (if your browser supports it).

speechsynthesis text-to-speech vocoder voice-conversion voicetext

Last synced: 14 Nov 2024

https://github.com/egorsmkv/istftnet-pytorch

Patched original code with some developer additions, don't use in prod

pytorch vocoder

Last synced: 18 Oct 2024

https://github.com/egorsmkv/radtts-istftnet

RADTTS + iSTFTNet vocoder

conversational-ai istfnet speech-synthesis text-to-speech tts ukrainian vocoder

Last synced: 07 Dec 2024

https://github.com/tuan3w/ddsp-pytorch

Incomplete DDSP implementation in Pytorch

ddsp music pytorch tts vocoder

Last synced: 11 Nov 2024

https://github.com/isadrtdinov/wavenet

WaveNet vocoder implementation for speech synthesis task

deep-learning ljspeech pytorch speech-synthesis vocoder wavenet

Last synced: 19 Nov 2024

https://github.com/jurihock/pitchsheep

A pitch shifting sheep living in your browser...

audio audio-effects dafx dsp fft formants javascript pitch pitch-shifting stft stftpitchshift timbre vocoder wasm webassembly

Last synced: 19 Nov 2024

https://github.com/yjg30737/coquitts-kaggle

Using coquiTTS in kaggle notebook

coqui coquitts jupyter-notebook kaggle vocoder voice-cloning

Last synced: 06 Dec 2024

https://github.com/egorsmkv/radtts-waveode

RADTTS + WaveODE vocoder

conversational-ai speech-synthesis text-to-speech tts ukrainian vocoder waveode

Last synced: 07 Dec 2024