awesome-python-audio
Awesome Python resources related to audio and music
https://github.com/andreimatveyeu/awesome-python-audio
Last synced: 12 days ago
JSON representation
-
Datasets
-
Music
- NSynth - scale dataset of annotated musical notes
- The Million Song Dataset
- MusicCaps
- AcousticBrainz
- Beatport EDM Key
- DEAM
- IRMAS
- MAESTRO
- MedleyDB - track mixing research
- Musdb18
- Open MIC
- RWC Music Database
- FMA
- DALI
- Jamendo Audio Tagging - label audio tagging dataset
- GiantMIDI-Piano - scale MIDI dataset of classical piano music
- LAION-Audio-630K - text pairs for CLAP training
- The Million Song Dataset
- hsmusic
- symbolic-music-datasets
- Beatport EDM Key
- MagnaTagATune
- Open MIC
- MusicNet
- Open MIC
-
Audio
- AudioSet - scale dataset of manually annotated audio events
- VoxCeleb - scale speaker identification dataset
- VCTK
- LibriSpeech
- UrbanSound8K
- Birdsong
- Freesound Dataset
- RAVDESS - visual dataset of emotional speech and song
- TIDIGITS
- Speech Commands
- ESC-50
- Free Spoken Digit Dataset
- Freesound Dataset
- RAVDESS - visual dataset of emotional speech and song
- VoxCeleb - scale speaker identification dataset
- Common Voice
- Speech Commands
- Speech Commands
-
-
Analysis & Feature Extraction
-
Audio Processing & I/O
- numpy & scipy.io.wavfile
- wave
- audioread - library audio decoding (GStreamer + Core Audio + MAD + FFmpeg)
- matchering
- PyDub
- pedalboard
- pyAudioProcessing
- noisereduce
- torchaudio
- audiomentations
- Matchering-cli
- babycat
- SoundDevice
- torch-audiomentations
-
Music Generation & AI
-
Text-to-Speech
- NSynth
- AudioCraft
- musicautobot
- Magenta
- Stable Audio Tools
- Riffusion - time music generation using stable diffusion on spectrograms
-
-
Tutorials
-
Music
-
librosa
-
PyDub
-
-
Source Separation
-
Text-to-Speech
- Spleeter
- Demucs - of-the-art music source separation from Meta
- pydsm
- Asteroid - based audio source separation toolkit for researchers
- audio-separator - Net VR Arch and Demucs models
-
-
Playback & Services
-
Text-to-Speech
- beets
- discord.py
- pygame.mixer
- Mopidy
- Spotipy
- pyglet - platform windowing and multimedia library
- pyradio
- freesound-python
- mpv
- Mopidy-YouTube
- audiostream
- MusicBot
- miniaudio
- pyAV
-
-
Speech Processing
-
Text-to-Speech
-
Speech-to-Text
- WhisperX - level timestamps and speaker diarization
- SpeechBrain
- Whisper
- pyannote-audio
- SpeechRecognition
- faster-whisper
- Vosk
-
-
Synthesis and Generation
-
Music
- FoxDot - based live coding environment for sound synthesis
- WaveGAN
- Audioguide
- PySynth
- Python-musical
- Nsynth
-
-
Music Theory & Composition
-
Music Transcription & Pitch
-
Text-to-Speech
- basic-pitch
- piano_transcription_inference - resolution piano transcription with pedal detection
- torchcrepe
- CREPE
- MT3 - instrument automatic music transcription from Google Magenta
-
-
Analysis and Visualization
- Realtime_PyAudio_FFT
- audio-fingerprint-identifying-python - similar app, that can identify the song using audio fingerprints & spectrum analysis and Fast Fourier transform
- AudioLazy
- AudioOwl
- BregmanToolkit
- paura
-
Music Theory and Composition
-
Music
- PyTheory
- Musical-scales
- MusicMaker - Audio Description Language
- pyHarmonySearch
-
-
Audio Manipulation
-
Synthesis & Sound Design
-
Text-to-Speech
-
-
Audio Embeddings & Representations
- OpenL3 - source deep audio and image embeddings
- CLAP (LAION) - Audio Pretraining for zero-shot audio classification
- CLAP (Microsoft)
- wav2vec2 - supervised speech representations from Facebook AI
- panns-inference
-
Playback and Streaming
-
Music
-
Programming Languages
Categories
Datasets
43
Audio Processing & I/O
14
Playback & Services
14
Analysis & Feature Extraction
11
Speech Processing
10
Music Theory & Composition
10
Tutorials
9
Synthesis & Sound Design
7
Synthesis and Generation
6
Music Generation & AI
6
Analysis and Visualization
6
Source Separation
5
Music Transcription & Pitch
5
Audio Embeddings & Representations
5
Music Theory and Composition
4
Audio Manipulation
4
Playback and Streaming
1
Sub Categories
Keywords
python
40
audio
26
music
18
deep-learning
13
music-information-retrieval
11
machine-learning
11
audio-processing
8
pytorch
8
dsp
8
sound
7
dataset
7
speech-recognition
6
speech-to-text
5
signal-processing
4
mir
4
audio-analysis
4
midi
4
scipy
3
pretrained-models
3
speaker-verification
3
spectral-analysis
3
speech
3
pitch
3
asr
3
sound-processing
3
mfcc
3
python-library
3
synthesis
3
python3
3
augmentation
3
gamedev
2
whisper
2
mopidy
2
audio-data
2
speaker-diarization
2
vst
2
speaker-recognition
2
audio-data-augmentation
2
speech-enhancement
2
speech-processing
2
speech-separation
2
audio-effects
2
data-augmentation
2
voice-recognition
2
text-to-speech
2
discord
2
c
2
feature-extraction
2
algorithmic-composition
2
audio-features
2