Projects in Awesome Lists tagged with torchaudio
A curated list of projects in awesome lists tagged with torchaudio .
https://github.com/2noise/chattts
A generative speech model for daily dialogue.
agent chat chatgpt chattts chinese chinese-language english english-language gpt llm llm-agent natural-language-inference python text-to-speech torch torchaudio tts
Last synced: 12 May 2025
https://github.com/2noise/ChatTTS
A generative speech model for daily dialogue.
agent chat chatgpt chattts chinese chinese-language english english-language gpt llm llm-agent natural-language-inference python text-to-speech torch torchaudio tts
Last synced: 24 Mar 2025
https://github.com/kentonishi/torch-pitch-shift
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
audio-augmentation augmentation gpu-support pitch-shift pytorch sound-processing torch torchaudio
Last synced: 09 Apr 2025
https://github.com/kentonishi/torch-time-stretch
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
audio-augmentation augmentation gpu-support pytorch sound-processing time-stretch torch torchaudio
Last synced: 21 Dec 2024
https://github.com/pinto0309/pytorch4raspberrypi
Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS
armv7l pytorch raspberry-pi torchaudio torchvision
Last synced: 05 May 2025
https://github.com/eonu/torch-fsdd
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
audio audio-dataset data-loader free-spoken-digit-dataset fsdd pytorch-dataloader pytorch-dataset pytorch-dataset-split torch torchaudio
Last synced: 07 May 2025
https://github.com/crispengari/animal-sound-classification
this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.
artificial-intelligence artificial-neural-networks audio audio-processing deep-learning deep-neural-networks machine-learning python pytorch rnn torchaudio
Last synced: 01 May 2025
https://github.com/glefundes/misophonia-bot
🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.
audio audio-classification deep-learning pytorch telegram telegram-bot telegram-bot-api torchaudio
Last synced: 10 Feb 2025
https://github.com/mehdihosseinimoghadam/signal-processing
Signal Processing with Python and Librosa
griffinlim librosa melspectrogram python signal-processing spectrogram torchaudio variational-autoencoder vector-quantization voice voice-reconstruction voice-synthesis vq-vae
Last synced: 14 Jan 2025
https://github.com/crispengari/emotionai
(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.
artificial-intelligence deeplearning flask machine-learning python pytorch torch torchaudio torchvision
Last synced: 16 Dec 2024
https://github.com/vectominist/switchboard-wsj-utils
Utilities for preprocessing the Switchboard and WSJ corpora in Python3
python speech-corpus switchboard torchaudio wsj wtimit
Last synced: 02 Dec 2024
https://github.com/crispengari/torch-audio
🎶🎼 This repository contains some notebooks that were used to train Audio Classification models in pytorch using torchaudio.
artificial-intelligence artificial-neural-networks audio-processing classification deep-learning machine-learning python pytorch torchaudio
Last synced: 03 Apr 2025
https://github.com/nhassl3/detect-russian-road-signs
The road sign recognition system of the Russian Federation, which uses an already prepared model for object detection and image segmentation in real time to improve road safety
machine-learning opencv-python roboflow-dataset sign-recognition torch torchaudio torchvision ultralytics
Last synced: 24 Feb 2025
https://github.com/crispengari/hbsc
🩺♥ Heart Beat Sound Classification (HBSC) is a GraphQL API for classifying heart beats sounds in real time.
ariadne artificial-intelligence graphql machine-learning python pytorch sound-classification torch torchaudio uvicorn
Last synced: 16 Dec 2024
https://github.com/philipamadasun/ser-model-for-dimensional-attribute-prediction
Speaker Emotion Recognition model for multi-attribute prediction
deep-learning speaker-emotion-recogntion torch torchaudio transformer
Last synced: 10 Apr 2025
https://github.com/nomadsdev/math-gen-ai
MathGenAI is a Python project that generates math problems and provides AI-generated explanations
ai math math-gen-ai proleak-innovation python torch torchaudio torchvision transformers
Last synced: 01 Apr 2025
https://github.com/chris-santiago/emonet
CNN-LSTM model for audio emotion detection in children with adverse childhood events.
adverse-childhood-events audio-classification cnn-lstm emotion-detection emotion-recognition melspectrogram pytorch pytorch-lightning torchaudio torchvision
Last synced: 02 Apr 2025
https://github.com/thekartikeyamishra/voicecloner
The Voice Cloner is a Python-based project that leverages Tacotron 2 and WaveGlow models for text-to-speech (TTS) synthesis and basic voice cloning. This project supports 22 official Indian languages, including Sanskrit, making it versatile for multilingual text input.
ai indic-transliteration librosa machine-learning numpy nvidia-pyindex nvidia-tacotron2 nvidia-waveglow python torch torchaudio
Last synced: 20 Feb 2025
https://github.com/d-f/nylon-amt
Automatic music transcription for classical guitar with hierarchical frequency-time transformers and the MAESTRO dataset
huggingface-transformers music-information-retrieval music-transcription torchaudio transformer
Last synced: 07 Apr 2025
https://github.com/baonguyen6742/uv-install-torch
Tutorial to install torch/pytorch with cuda using uv
cuda install installation package python pytorch resolver torch torchaudio torchvision tutorial uv
Last synced: 06 Apr 2025
https://github.com/efenstor/pytorch-rocm-gfx1010
Instructions on how to build PyTorch on Debian 12 with support for the AMD gfx1010 architecture
5600xt 5700xt amdgpu building-instructions chainner comfyui gfx1010 pytorch radeon rocm torchaudio torchvision
Last synced: 13 Apr 2025
https://github.com/capjamesg/taylor-swift
Find how similar your voice is to Taylor Swift (WIP) ✨
audio-analysis spectrograms taylor-swift torchaudio
Last synced: 03 Apr 2025
https://github.com/philipamadasun/whisper_torch
An implmenetation of whisper "turbo" model in torch
deep-learning fine-tuning torch torchaudio whisper
Last synced: 31 Mar 2025