An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with vocoder

A curated list of projects in awesome lists tagged with vocoder .

https://github.com/paddlepaddle/paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 12 May 2025

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 24 Mar 2025

https://github.com/mozilla/tts

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 13 May 2025

https://github.com/open-mmlab/amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 12 May 2025

https://github.com/mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 14 Mar 2025

https://github.com/open-mmlab/Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 28 Mar 2025

https://github.com/fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

agent bert bert-vits bert-vits2 fish fish-speech llm tts vits vits2 vocoder

Last synced: 27 Mar 2025

https://github.com/TensorSpeech/TensorflowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 21 Jul 2025

https://github.com/tensorspeech/tensorflowtts

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 09 Apr 2025

https://github.com/TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 24 Mar 2025

https://github.com/jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

deep-learning gan hifi-gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 14 May 2025

https://github.com/kan-bayashi/parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 12 Apr 2025

https://github.com/kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 06 Apr 2025

https://github.com/mmorise/world

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 14 May 2025

https://github.com/mmorise/World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 04 May 2025

https://github.com/gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

vocoder vocos

Last synced: 04 Apr 2025

https://github.com/lmnt-com/diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

deep-learning diffwave machine-learning neural-network paper pretrained-models pytorch speech speech-synthesis text-to-speech tts vocoder

Last synced: 03 Oct 2025

https://github.com/Rongjiehuang/FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 21 Jul 2025

https://github.com/rongjiehuang/fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 05 Apr 2025

https://github.com/szechyjs/mbelib

P25 Phase 1 and ProVoice vocoder

c vocoder

Last synced: 06 Apr 2025

https://github.com/sh123/codec2_talkie

Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

amateur-radio amateurradio aprs bluetooth codec2 digital digital-voice dv fm freedv ham-radio hf kiss lora opus radio uhf vhf vocoder walkie-talkie

Last synced: 24 Feb 2026

https://github.com/maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

deep-learning gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 09 Apr 2025

https://github.com/maum-ai/phaseaug

ICASSP 2023 Accepted

gan speech-synthesis vocoder

Last synced: 10 Apr 2025

https://github.com/descriptinc/cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

audio autoregression gan vocoder

Last synced: 06 Apr 2025

https://github.com/k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

adversarial-learning cyclic-constraints speech-synthesis vocoder voice-conversion vqvae

Last synced: 06 Apr 2025

https://github.com/yl4579/hiftnet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

deep-learning speech-synthesis text-to-speech tts vocoder vocoders

Last synced: 05 Apr 2025

https://github.com/xcmyz/fastvocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

hifigan melgan speech-synthesis vocoder

Last synced: 31 Oct 2025

https://github.com/ncsoft/avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

avocodo gan pytorch vocoder

Last synced: 23 Jul 2025

https://github.com/erogol/fftnet

FFTNet vocoder implementation

deep-learning fftnet pytorch text2speech vocoder

Last synced: 27 Jun 2025

https://github.com/zzw922cn/lpc_for_tts

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

audiocompression lpc lpcnet mel-spectrogram tts vocoder wavernn

Last synced: 26 Apr 2025

https://github.com/vtuber-plan/nsf-hifigan

Vocoder NSF-HiFiGAN (Moved into deepaudio)

vocoder

Last synced: 06 Oct 2025

https://github.com/revsic/tf-diffwave

Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis

diffusion diffwave tensorflow tts vocoder wavenet

Last synced: 05 May 2025

https://github.com/yoyolicoris/pytorch_FFTNet

A pytorch implementation of FFTNet.

cnn fftnet vocoder

Last synced: 09 Mar 2025

https://github.com/vtuber-plan/hifi-gan

An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.

vocoder

Last synced: 22 Jul 2025

https://github.com/yuzukitsuru/world.js

World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten

audio-processing d4c dsp emscripten f0-estimation javascript javascript-library morise speech synthesis vocoder world wrapper

Last synced: 15 Jul 2025

https://github.com/yoyolicoris/wavenet-like-vocoder

Basic wavenet and fftnet vocoder model.

fftnet mel-spectrogram pytorch vocoder wavenet

Last synced: 05 May 2025

https://github.com/vtuber-plan/istftnet

iSTFTNet Vocoder PyTorch Implement

vocoder

Last synced: 10 Apr 2025

https://github.com/revsic/speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor speech-dataset tts vocoder

Last synced: 26 Jun 2025

https://github.com/bycob/harmonizer

Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too

audio-processing dsp harmonizer music realtime-audio vocoder

Last synced: 16 May 2025

https://github.com/will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)

diffusion speech speech-synthesis tensorflow text-to-speech tts vocoder

Last synced: 13 Apr 2025

https://github.com/34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

hifi-gan mypy neural-source-filter nsf python pytorch tts vocoder voice-conversion

Last synced: 07 Mar 2026

https://github.com/monocasual/vocoder

Probably one of the best text-to-speech online apps in the world (if your browser supports it).

speechsynthesis text-to-speech vocoder voice-conversion voicetext

Last synced: 02 Feb 2026

https://github.com/egorsmkv/radtts-uk

🇺🇦 Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

conversational-ai hifigan speech-ai speech-synthesis text-to-speech tts ukrainian vocoder

Last synced: 08 Jan 2026

https://github.com/tuan3w/ddsp-pytorch

Incomplete DDSP implementation in Pytorch

ddsp music pytorch tts vocoder

Last synced: 13 May 2026

https://github.com/egorsmkv/istftnet-pytorch

Patched original code with some developer additions, don't use in prod

pytorch vocoder

Last synced: 03 Mar 2025

https://github.com/isadrtdinov/wavenet

WaveNet vocoder implementation for speech synthesis task

deep-learning ljspeech pytorch speech-synthesis vocoder wavenet

Last synced: 27 Apr 2026

https://github.com/khaykingleb/hifi-gan

Vocoder for TTS

gan hifi-gan pytorch tts vocoder

Last synced: 28 Apr 2026

https://github.com/yu2924/channelvocoder

channel vocoder audio effect

audio-processing experimental juce-plugin vocoder

Last synced: 22 Jun 2026

https://github.com/yu2924/vowel2d

simple vowel simulation experiment

audio-processing experimental juce-plugin vocoder

Last synced: 25 Jun 2026

https://github.com/yjg30737/coquitts-kaggle

Using coquiTTS in kaggle notebook

coqui coquitts jupyter-notebook kaggle vocoder voice-cloning

Last synced: 16 Apr 2026

https://github.com/yrom/mlx-bigvgan

MLX implementation of https://github.com/NVIDIA/BigVGAN

gan mlx python3 vocoder

Last synced: 17 Mar 2026