Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with vocoder
A curated list of projects in awesome lists tagged with vocoder .
https://github.com/coqui-ai/TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Last synced: 25 Oct 2024
https://github.com/coqui-ai/tts
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Last synced: 16 Dec 2024
https://github.com/paddlepaddle/paddlespeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper
Last synced: 16 Dec 2024
https://github.com/PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper
Last synced: 29 Oct 2024
https://github.com/mozilla/tts
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder
Last synced: 17 Dec 2024
https://github.com/mozilla/TTS
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder
Last synced: 25 Oct 2024
https://github.com/fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
agent bert bert-vits bert-vits2 fish fish-speech llm tts vits vits2 vocoder
Last synced: 30 Oct 2024
https://github.com/open-mmlab/amphion
Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion
Last synced: 17 Dec 2024
https://github.com/tensorspeech/tensorflowtts
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 16 Dec 2024
https://github.com/TensorSpeech/TensorflowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 28 Nov 2024
https://github.com/TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 29 Oct 2024
https://github.com/jik876/hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
deep-learning gan hifi-gan pytorch speech-synthesis text-to-speech tts vocoder
Last synced: 18 Dec 2024
https://github.com/kan-bayashi/parallelwavegan
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet
Last synced: 19 Dec 2024
https://github.com/kan-bayashi/ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet
Last synced: 06 Nov 2024
https://github.com/mmorise/world
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Last synced: 19 Dec 2024
https://github.com/mmorise/World
A high-quality speech analysis, manipulation and synthesis system
speech-analysis speech-synthesis vocoder
Last synced: 13 Nov 2024
https://github.com/haoheliu/voicefixer
General Speech Restoration
declipping denoise dereverberation mel speech speech-analysis speech-enhancement speech-processing speech-synthesis super-resolution tts vocoder
Last synced: 17 Dec 2024
https://github.com/gemelo-ai/vocos
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Last synced: 05 Nov 2024
https://github.com/rongjiehuang/fastdiff
PyTorch Implementation of FastDiff (IJCAI'22)
ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder
Last synced: 15 Dec 2024
https://github.com/Rongjiehuang/FastDiff
PyTorch Implementation of FastDiff (IJCAI'22)
ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder
Last synced: 28 Nov 2024
https://github.com/maum-ai/univnet
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
deep-learning gan pytorch speech-synthesis text-to-speech tts vocoder
Last synced: 11 Nov 2024
https://github.com/sh123/codec2_talkie
Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)
amateur-radio amateurradio aprs bluetooth codec2 digital digital-voice dv fm freedv ham-radio hf kiss lora opus radio uhf vhf vocoder walkie-talkie
Last synced: 20 Dec 2024
https://github.com/descriptinc/cargan
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
audio autoregression gan vocoder
Last synced: 02 Oct 2024
https://github.com/k2kobayashi/crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
adversarial-learning cyclic-constraints speech-synthesis vocoder voice-conversion vqvae
Last synced: 17 Dec 2024
https://github.com/xcmyz/fastvocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
hifigan melgan speech-synthesis vocoder
Last synced: 01 Dec 2024
https://github.com/ncsoft/avocodo
Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)
Last synced: 13 Nov 2024
https://github.com/x-lance/unicats-ctx-vec2wav
[AAAI 2024] Code for CTX-vec2wav in UniCATS
self-supervised-speech semantic-token speech-synthesis unicats vocoder vocoding
Last synced: 15 Dec 2024
https://github.com/jurihock/stftpitchshift
STFT based real-time pitch and timbre shifting in C++ and Python
algorithms audio audio-effect audio-processing cpp dafx dsp fft formants pitch pitch-shifting plugin python realtime smbpitchshift stft stftpitchshift timbre vocoder voice
Last synced: 15 Dec 2024
https://github.com/iamycy/golf
A DDSP-based neural voice synthesiser.
ddsp glottal-flow-model iir-filters linear-predictive-coding pytorch-implementation vocoder
Last synced: 15 Dec 2024
https://github.com/yoyololicon/golf
A DDSP-based neural voice synthesiser.
ddsp glottal-flow-model iir-filters linear-predictive-coding pytorch-implementation vocoder
Last synced: 03 Oct 2024
https://github.com/erogol/fftnet
FFTNet vocoder implementation
deep-learning fftnet pytorch text2speech vocoder
Last synced: 22 Oct 2024
https://github.com/yl4579/hiftnet
HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform
deep-learning speech-synthesis text-to-speech tts vocoder vocoders
Last synced: 14 Nov 2024
https://github.com/zzw922cn/lpc_for_tts
Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.
audiocompression lpc lpcnet mel-spectrogram tts vocoder wavernn
Last synced: 11 Nov 2024
https://github.com/vtuber-plan/nsf-hifigan
Vocoder NSF-HiFiGAN (Moved into deepaudio)
Last synced: 08 Dec 2024
https://github.com/yoyololicon/pytorch_fftnet
A pytorch implementation of FFTNet.
Last synced: 22 Oct 2024
https://github.com/revsic/tf-diffwave
Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis
diffusion diffwave tensorflow tts vocoder wavenet
Last synced: 30 Nov 2024
https://github.com/jurihock/stftpitchshiftplugin
Official JUCE plugin for stftPitchShift
au audio audio-effect audio-processing dafx dsp fft formants juce-plugins low-latency lv2 pitch-shifting plugin real-time stft stftpitchshift timbre vocoder voice vst
Last synced: 27 Oct 2024
https://github.com/ttop32/coqui_tts_korea
Korean TTS using coqui TTS (glowtts and multiband melgan) - νκ΅μ΄ TTS
coqui coqui-ai deep-learning glow-tts half-life korea korean korean-language korean-letters korean-text-processing korean-tokenizer korean-tts multiband-melgan pytorch speech speech-synthesis text-to-speech tts vocoder voice-cloning
Last synced: 11 Nov 2024
https://github.com/yuzukitsuru/world.js
World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten
audio-processing d4c dsp emscripten f0-estimation javascript javascript-library morise speech synthesis vocoder world wrapper
Last synced: 23 Nov 2024
https://github.com/vtuber-plan/hifi-gan
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
Last synced: 08 Dec 2024
https://github.com/yoyololicon/wavenet-like-vocoder
Basic wavenet and fftnet vocoder model.
fftnet mel-spectrogram pytorch vocoder wavenet
Last synced: 23 Oct 2024
https://github.com/mmorise/noisegenerators
Noise generators for vocoder
gaussian-white-noise modified-velvet-noise speech-synthesis velvet-noise vocoder
Last synced: 08 Nov 2024
https://github.com/jurihock/voyx
Standalone real time dynamic vocal harmonizer
algorithms audio audio-effect audio-processing cpp dsp fft harmonizer live midi pitch-detection pitch-shifting smbpitchshift standalone stft stftpitchshift vocoder voice voyx
Last synced: 12 Oct 2024
https://github.com/bycob/harmonizer
Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too
audio-processing dsp harmonizer music realtime-audio vocoder
Last synced: 19 Nov 2024
https://github.com/revsic/speechset
Numpy-librosa implementation of Speech dataset pipeline
preprocessor speech-dataset tts vocoder
Last synced: 30 Nov 2024
https://github.com/will-rice/diffwave
TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)
diffusion speech speech-synthesis tensorflow text-to-speech tts vocoder
Last synced: 14 Oct 2024
https://github.com/egorsmkv/radtts-hifigan
RADTTS + HiFiGAN vocoder
conversational-ai hifigan speech-synthesis text-to-speech tts ukrainian vocoder
Last synced: 18 Oct 2024
https://github.com/34j/neural-source-filter
Python package for NSF and NSF-HiFi-GAN (unofficial)
hifi-gan mypy neural-source-filter nsf python pytorch tts vocoder voice-conversion
Last synced: 01 Nov 2024
https://github.com/yas-sim/csm_voice_encode_synthesis_python
Expermental code for CSM voice synthesis + CSM data generation
audio-codec audio-processing composite-sinusoidal-modeling csm fm-sound vocoder voice voice-synthesis voice-synthesizer yamaha ym2203
Last synced: 16 Nov 2024
https://github.com/egorsmkv/radtts-uk
πΊπ¦ Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model
conversational-ai hifigan speech-ai speech-synthesis text-to-speech tts ukrainian vocoder
Last synced: 07 Dec 2024
https://github.com/monocasual/vocoder
Probably one of the best text-to-speech online apps in the world (if your browser supports it).
speechsynthesis text-to-speech vocoder voice-conversion voicetext
Last synced: 14 Nov 2024
https://github.com/egorsmkv/istftnet-pytorch
Patched original code with some developer additions, don't use in prod
Last synced: 18 Oct 2024
https://github.com/egorsmkv/radtts-istftnet
RADTTS + iSTFTNet vocoder
conversational-ai istfnet speech-synthesis text-to-speech tts ukrainian vocoder
Last synced: 07 Dec 2024
https://github.com/isadrtdinov/wavenet
WaveNet vocoder implementation for speech synthesis task
deep-learning ljspeech pytorch speech-synthesis vocoder wavenet
Last synced: 19 Nov 2024
https://github.com/jurihock/pitchsheep
A pitch shifting sheep living in your browser...
audio audio-effects dafx dsp fft formants javascript pitch pitch-shifting stft stftpitchshift timbre vocoder wasm webassembly
Last synced: 19 Nov 2024
https://github.com/yjg30737/coquitts-kaggle
Using coquiTTS in kaggle notebook
coqui coquitts jupyter-notebook kaggle vocoder voice-cloning
Last synced: 06 Dec 2024
https://github.com/egorsmkv/radtts-waveode
RADTTS + WaveODE vocoder
conversational-ai speech-synthesis text-to-speech tts ukrainian vocoder waveode
Last synced: 07 Dec 2024