An open API service indexing awesome lists of open source software.

awesome-python-audio-science

A curated, public-facing list of Python audio science, scientific audio research, music information retrieval, DSP, machine learning audio, datasets, notebooks, and creator-audio tooling.
https://github.com/GareBear99/awesome-python-audio-science

Last synced: 5 days ago
JSON representation

  • Academic / Research SEO Positioning

    • Citation-ready ecosystem anchors

      • FreeEQ8 - source JUCE/C++ EQ plugin for DSP education, plugin testing, analyzer research, M/S processing examples, and citable audio-software packaging. |
      • FreeVox8 - resynthesis R&D anchor for STFT, phase vocoder, formant, masking, and spectral-processing research direction. |
      • Instrudio - source-of-truth virtual instrument runtime with physically informed Studio Violin, Web Audio synthesis, MIDI routing, and runtime metrics. |
      • Voxel Audio - facing media tooling useful for audio-visual research workflows. |
      • TizWildinEntertainmentHUB
  • Audio Plugin / Creator Tooling

    • IO router / deconstructed loops anchor

      • pedalboard - Python audio effects and VST3/AU plugin hosting from Spotify.
      • DawDreamer - Python DAW-style rendering and plugin hosting.
      • Vamp plugins / VamPy host - Python interface for Vamp audio-analysis plugins.
      • JUCE - C++ plugin framework; relevant for Python-assisted testing, packaging, and analysis pipelines.
  • Contributing

    • IO router / deconstructed loops anchor

  • Core Python Audio Packages

    • IO router / deconstructed loops anchor

      • librosa - Audio and music analysis library for Python, widely used for MIR and research workflows.
      • Essentia - C++/Python library for audio and music analysis, feature extraction, and large-scale research.
      • aubio - Audio feature extraction, onset detection, pitch detection, beat tracking, and segmentation.
      • audioFlux - Audio and music analysis library with spectral transforms and feature extraction.
      • torchaudio - PyTorch audio I/O, transforms, datasets, and deep-learning audio utilities.
      • JAX audio examples - Useful base for research pipelines that need accelerated differentiable audio experiments.
  • Datasets & Dataloaders

    • IO router / deconstructed loops anchor

      • MUSDB18 - Music source-separation dataset.
      • MedleyDB - Multitrack dataset for MIR research.
      • FMA - Free Music Archive dataset for music analysis.
      • AudioSet - Large-scale audio event dataset.
      • Common Voice - Open speech dataset.
      • Magenta Datasets - Music and symbolic datasets for creative ML.
  • DSP, Transforms & Spectral Analysis

    • IO router / deconstructed loops anchor

      • NumPy - Array computation base for most audio analysis and DSP pipelines.
      • SciPy Signal - Filtering, transforms, windows, spectral analysis, and signal-processing utilities.
      • pyFFTW - Python wrapper around FFTW for faster FFT-heavy workflows.
      • pyroomacoustics - Room acoustics simulation, beamforming, and sound-field experimentation.
      • acoustics - Acoustics calculations, room acoustics, decibels, filters, and standards-oriented utilities.
      • pyloudnorm - Loudness meter and normalization library based on ITU-R BS.1770.
      • matchering - Automated reference audio mastering in Python.
  • Feature Extraction & MIR

    • IO router / deconstructed loops anchor

      • mir_eval - Standard evaluation tools for music information retrieval tasks.
      • mirdata - Common loaders for MIR datasets.
      • msaf - Music structure analysis framework.
      • pretty_midi - MIDI analysis and manipulation utilities.
      • music21 - Computer-aided musicology and symbolic music analysis.
      • mido - MIDI message, port, and file utilities.
      • partitura - Symbolic music processing for score-performance research.
  • GareBear99 / TizWildin Audio Discovery Network

  • Loudness, Metering & Perceptual Audio

    • IO router / deconstructed loops anchor

      • pystoi - Short-Time Objective Intelligibility measure.
      • pesq - PESQ speech quality metric wrapper.
      • cochlea - Inner-ear auditory models.
  • Machine Learning Audio

    • IO router / deconstructed loops anchor

      • PyTorch - Deep-learning framework commonly used for audio ML, MIR, source separation, and speech.
      • TensorFlow Audio / Keras - Audio loading, preprocessing, and model training paths in the TensorFlow ecosystem.
      • Kapre - Keras audio preprocessing layers.
      • nnAudio - GPU-accelerated audio transforms using neural-network operations.
      • Hugging Face Audio - Models and datasets for classification, speech, separation, and generation tasks.
      • Asteroid - PyTorch-based audio source-separation toolkit.
      • SpeechBrain - Speech and audio toolkit for ASR, speaker recognition, separation, enhancement, and more.
  • Quick answer

  • Read / Write / Metadata

    • IO router / deconstructed loops anchor

      • audioread - Decode audio using available backends such as GStreamer, Core Audio, MAD, FFmpeg, and others.
      • pydub - Simple high-level audio manipulation, conversion, slicing, and export.
      • tinytag - Lightweight metadata reader for MP3, OGG, FLAC, WAV, and more.
      • stempeg - Read and write multi-stream audio, especially useful for stems and source-separation datasets.
  • Realtime Audio & MIDI

    • IO router / deconstructed loops anchor

      • Jupylet - Python game/audio environment with synth and sample support.
      • python-rtmidi - Realtime MIDI I/O bindings.
  • Source Separation

    • IO router / deconstructed loops anchor

      • Demucs - Music source separation with deep learning.
      • Spleeter - Source-separation library by Deezer.
      • Open-Unmix - Music source separation toolkit and model family.
      • nussl - Holistic source-separation framework.
      • musdb - MUSDB dataset tooling for source separation.
  • Speech, Voice & Alignment

    • IO router / deconstructed loops anchor

      • aeneas - Forced alignment of audio and text.
      • pyannote.audio - Neural speaker diarization and speaker-analysis toolkit.
      • SpeechRecognition - Wrapper for multiple speech recognition engines and APIs.
      • python_speech_features - Classic speech features such as MFCC and filterbanks.
      • spafe - Speech and audio feature extraction.
      • WORLD / PyWorld - WORLD vocoder Python wrapper for speech analysis/synthesis experiments.
  • Visualization, Notebooks & Web Audio

    • IO router / deconstructed loops anchor

      • matplotlib - Plot waveforms, spectrograms, features, and research figures.
      • plotly - Interactive audio-analysis plots and dashboards.
      • Jupyter - Notebook environment for audio experiments, MIR notebooks, and reproducible analysis.
      • Streamlit - Quick interactive audio analysis apps and dashboards.
      • Gradio - Simple ML/audio demos and web UIs.
      • TimeSide - Audio analysis, imaging, transcoding, streaming, and labelling tools.