Projects in Awesome Lists tagged with fastspeech2
A curated list of projects in awesome lists tagged with fastspeech2 .
https://github.com/open-mmlab/amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion
Last synced: 12 May 2025
https://github.com/open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm audit emilia fastspeech2 maskgct music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e vits vocoder voice-conversion
Last synced: 28 Mar 2025
https://github.com/tensorspeech/tensorflowtts
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 09 Apr 2025
https://github.com/TensorSpeech/TensorFlowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 24 Mar 2025
https://github.com/TensorSpeech/TensorflowTTS
:stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
chinese-tts fastspeech fastspeech2 german-tts japanese-tts korea-tts melgan mobile-tts multi-speaker-tts multiband-melgan parallel-wavegan real-time speech-synthesis tacotron2 tensorflow2 text-to-speech tflite tts vocoder zh-tts
Last synced: 28 Nov 2024
https://github.com/zdisket/tensorvox
Desktop application for neural speech synthesis written in C++
desktop fastspeech2 mb-melgan multiband-melgan phoneme real-time speech-synthesis tacotron2 text-to-speech tts voice-synthesis
Last synced: 08 May 2025
https://github.com/xcmyz/fastspeech2
The Implementation of FastSpeech2 Based on Pytorch.
fastspeech fastspeech2 pytorch speech-synthesis tts
Last synced: 01 Dec 2024
https://github.com/dathudeptrai/fastspeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
fastspeech fastspeech2 real-time tensorflow tensorflow2
Last synced: 14 Apr 2025
https://github.com/gagan3012/image2audio
Convert Image to audio using ViT, GPT and FastSpeech
fastspeech2 gpt-2 image-captioning imagecaptioning pytorch speech-to-text vit
Last synced: 07 Apr 2025
https://github.com/mariatepei/vt_thesis_mtepei
This repository accompanies my MSc Thesis for the degree Voice Technology, storing all referenced data and other relevant resources.
data-augmentation fastspeech2 speech-recognition whisper
Last synced: 05 Apr 2025