Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with vocoder

A curated list of projects in awesome lists tagged with vocoder .

https://github.com/paddlepaddle/paddlespeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 16 Dec 2024

https://github.com/PaddlePaddle/PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

asr code-switch conformer kws punctuation-restoration self-supervised-learning sound-classification speech-alignment speech-recognition speech-synthesis speech-translation streaming-asr streaming-tts transformer tts vocoder voice-cloning voice-recognition wav2vec2 whisper

Last synced: 29 Oct 2024

https://github.com/mozilla/tts

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 17 Dec 2024

https://github.com/mozilla/TTS

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

dataset-analysis deep-learning gantts glow-tts melgan multiband-melgan python pytorch speaker-encoder speech tacotron tacotron2 tensorflow2 text-to-speech tts vocoder

Last synced: 25 Oct 2024

https://github.com/fishaudio/Bert-VITS2

vits2 backbone with multilingual-bert

agent bert bert-vits bert-vits2 fish fish-speech llm tts vits vits2 vocoder

Last synced: 30 Oct 2024

https://github.com/open-mmlab/amphion

Amphion (/Γ¦mˈfaΙͺΙ™n/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion

Last synced: 17 Dec 2024

https://github.com/tensorspeech/tensorflowtts

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 16 Dec 2024

https://github.com/TensorSpeech/TensorflowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 28 Nov 2024

https://github.com/TensorSpeech/TensorFlowTTS

:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts

Last synced: 29 Oct 2024

https://github.com/jik876/hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

deep-learning gan hifi-gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 18 Dec 2024

https://github.com/kan-bayashi/parallelwavegan

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 19 Dec 2024

https://github.com/kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

hifigan melgan neural-vocoder parallel-wavenet pytorch realtime speech-synthesis style-melgan text-to-speech tts vocoder wavenet

Last synced: 06 Nov 2024

https://github.com/mmorise/world

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 19 Dec 2024

https://github.com/mmorise/World

A high-quality speech analysis, manipulation and synthesis system

speech-analysis speech-synthesis vocoder

Last synced: 13 Nov 2024

https://github.com/gemelo-ai/vocos

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

vocoder vocos

Last synced: 05 Nov 2024

https://github.com/rongjiehuang/fastdiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 15 Dec 2024

https://github.com/Rongjiehuang/FastDiff

PyTorch Implementation of FastDiff (IJCAI'22)

ijcai2022 neural-vocoder speech-synthesis text-to-speech vocoder

Last synced: 28 Nov 2024

https://github.com/szechyjs/mbelib

P25 Phase 1 and ProVoice vocoder

c vocoder

Last synced: 16 Dec 2024

https://github.com/maum-ai/univnet

Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)

deep-learning gan pytorch speech-synthesis text-to-speech tts vocoder

Last synced: 11 Nov 2024

https://github.com/sh123/codec2_talkie

Turn your Android phone into Amateur Radio Codec2/OPUS APRS enabled DV handheld transceiver (Bluetooth/BLE/USB/TCPIP KISS/Sound modem client for DV digital voice communication)

amateur-radio amateurradio aprs bluetooth codec2 digital digital-voice dv fm freedv ham-radio hf kiss lora opus radio uhf vhf vocoder walkie-talkie

Last synced: 20 Dec 2024

https://github.com/maum-ai/phaseaug

ICASSP 2023 Accepted

gan speech-synthesis vocoder

Last synced: 18 Dec 2024

https://github.com/descriptinc/cargan

Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"

audio autoregression gan vocoder

Last synced: 02 Oct 2024

https://github.com/k2kobayashi/crank

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

adversarial-learning cyclic-constraints speech-synthesis vocoder voice-conversion vqvae

Last synced: 17 Dec 2024

https://github.com/xcmyz/fastvocoder

Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

hifigan melgan speech-synthesis vocoder

Last synced: 01 Dec 2024

https://github.com/ncsoft/avocodo

Official implementation of "Avocodo: Generative Adversarial Network for Artifact-Free Vocoder" (AAAI2023)

avocodo gan pytorch vocoder

Last synced: 13 Nov 2024

https://github.com/erogol/fftnet

FFTNet vocoder implementation

deep-learning fftnet pytorch text2speech vocoder

Last synced: 22 Oct 2024

https://github.com/yl4579/hiftnet

HiFTNet: A Fast High-Quality Neural Vocoder with Harmonic-plus-Noise Filter and Inverse Short Time Fourier Transform

deep-learning speech-synthesis text-to-speech tts vocoder vocoders

Last synced: 14 Nov 2024

https://github.com/zzw922cn/lpc_for_tts

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

audiocompression lpc lpcnet mel-spectrogram tts vocoder wavernn

Last synced: 11 Nov 2024

https://github.com/vtuber-plan/nsf-hifigan

Vocoder NSF-HiFiGAN (Moved into deepaudio)

vocoder

Last synced: 08 Dec 2024

https://github.com/yoyololicon/pytorch_fftnet

A pytorch implementation of FFTNet.

cnn fftnet vocoder

Last synced: 22 Oct 2024

https://github.com/revsic/tf-diffwave

Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis

diffusion diffwave tensorflow tts vocoder wavenet

Last synced: 30 Nov 2024

https://github.com/yuzukitsuru/world.js

World.JS is a JavaScript Wrapper for World Vocoder Powered by Emscripten

audio-processing d4c dsp emscripten f0-estimation javascript javascript-library morise speech synthesis vocoder world wrapper

Last synced: 23 Nov 2024

https://github.com/vtuber-plan/hifi-gan

An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.

vocoder

Last synced: 08 Dec 2024

https://github.com/yoyololicon/wavenet-like-vocoder

Basic wavenet and fftnet vocoder model.

fftnet mel-spectrogram pytorch vocoder wavenet

Last synced: 23 Oct 2024

https://github.com/vtuber-plan/istftnet

iSTFTNet Vocoder PyTorch Implement

vocoder

Last synced: 08 Dec 2024

https://github.com/bycob/harmonizer

Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too

audio-processing dsp harmonizer music realtime-audio vocoder

Last synced: 19 Nov 2024

https://github.com/revsic/speechset

Numpy-librosa implementation of Speech dataset pipeline

preprocessor speech-dataset tts vocoder

Last synced: 30 Nov 2024

https://github.com/will-rice/diffwave

TensorFlow 2.0 Implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis. (WIP)

diffusion speech speech-synthesis tensorflow text-to-speech tts vocoder

Last synced: 14 Oct 2024

https://github.com/34j/neural-source-filter

Python package for NSF and NSF-HiFi-GAN (unofficial)

hifi-gan mypy neural-source-filter nsf python pytorch tts vocoder voice-conversion

Last synced: 01 Nov 2024

https://github.com/egorsmkv/radtts-uk

πŸ‡ΊπŸ‡¦ Ukrainian RAD-TTS++ models (decoder + models with 3 voices) and HiFiGAN model

conversational-ai hifigan speech-ai speech-synthesis text-to-speech tts ukrainian vocoder

Last synced: 07 Dec 2024

https://github.com/monocasual/vocoder

Probably one of the best text-to-speech online apps in the world (if your browser supports it).

speechsynthesis text-to-speech vocoder voice-conversion voicetext

Last synced: 14 Nov 2024

https://github.com/egorsmkv/istftnet-pytorch

Patched original code with some developer additions, don't use in prod

pytorch vocoder

Last synced: 18 Oct 2024

https://github.com/tuan3w/ddsp-pytorch

Incomplete DDSP implementation in Pytorch

ddsp music pytorch tts vocoder

Last synced: 11 Nov 2024

https://github.com/isadrtdinov/wavenet

WaveNet vocoder implementation for speech synthesis task

deep-learning ljspeech pytorch speech-synthesis vocoder wavenet

Last synced: 19 Nov 2024

https://github.com/yjg30737/coquitts-kaggle

Using coquiTTS in kaggle notebook

coqui coquitts jupyter-notebook kaggle vocoder voice-cloning

Last synced: 06 Dec 2024