awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio
https://github.com/faroit/awesome-python-scientific-audio

Last synced: about 5 hours ago
JSON representation

Audio Related Packages
- mutagen - Reads and writes all kind of audio metadata for various formats.
- pyAV - PyAV is a Pythonic binding for FFmpeg or Libav.
- AudioTSM - real-time audio time-scale modification procedures.
- NSGT - Non-stationary gabor transform, constant-q.
- pydub - Manipulate audio with a simple and easy high level interface.
- pytftb - signal/pytftb) - Implementation of the MATLAB Time-Frequency Toolbox.
- PyWavelets - Discrete Wavelet Transform in Python.
- Resampy - Sample rate conversion.
- STFT - werner/stft) [:package:](https://pypi.python.org/pypi/stft) - Standalone package for Short-Time Fourier Transform.
- aubio - Feature extractor, written in C, Python interface.
- essentia - Music related low level and high level feature extractor, C++ based, includes Python bindings.
- muda - Musical Data Augmentation.
- aeneas - Forced aligner, based on MFCC+DTW, 35+ languages.
- persephone - tools/persephone) [:package:](https://pypi.org/project/persephone/) - Automatic phoneme transcription tool.
- SIDEKIT - Speaker and Language recognition.
- Brian2 - team/brian2) [:package:](https://pypi.python.org/pypi/Brian2) - Spiking neural networks simulator, includes cochlea model.
- pyloudnorm - Audio loudness meter and normalization, implements ITU-R BS.1770-4.
- Sound Field Synthesis Toolbox - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
- NUSSL - Holistic source separation framework including DSP methods and deep learning methods.
- NIMFA - Several flavors of non-negative-matrix factorization.
- Madmom - MIR packages with strong focus on beat detection, onset detection and chord recognition.
- msaf - Music Structure Analysis Framework.
- librosa - General audio and music analysis.
- Music21 - Toolkit for Computer-Aided Musicology.
- Mido - Realtime MIDI wrapper.
- PYO - Realtime audio dsp engine.
- beets - Music library manager and [MusicBrainz](https://musicbrainz.org/) tagger.
- musdb - mus-db) [:package:](https://pypi.python.org/pypi/musdb) - Parse and process the MUSDB18 dataset.
- medleydb - Parse [medleydb](http://medleydb.weebly.com/) audio + annotations.
- mirdata - dataset-loaders/mirdata) [:package:](https://pypi.python.org/pypi/mirdata) - Common loaders for Music Information Retrieval (MIR) datasets.
- VamPy Host - Interface compiled vamp plugins.
- (Py)Soundfile - Library based on libsndfile, CFFI, and NumPy.
- audiolazy - Expressive Digital Signal Processing (DSP) package for Python.
- audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
- stempeg - read/write of STEMS multistream audio.
- acoustics - acoustics/python-acoustics/) [:package:](https://pypi.python.org/pypi/acoustics) - useful tools for acousticians.
- AudioTK - DSP filter toolbox (lots of filters).
- Gammatone - Gammatone filterbank implementation.
- pyFFTW - Wrapper for FFTW(3).
- matchering - Automated reference audio mastering.
- MDCT - werner/mdct) [:package:](https://pypi.python.org/pypi/mdct) - MDCT transform.
- pydub - Manipulate audio with a simple and easy high level interface.
- pyroomacoustics - Room Acoustics Simulation (RIR generator)
- PyRubberband - Wrapper for [rubberband](http://breakfastquay.com/rubberband/) to do pitch-shifting and time-stretching.
- SFS-Python - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
- sound_field_analysis - py) [:package:](https://pypi.org/project/sound-field-analysis/) - Analyze, visualize and process sound field data recorded by spherical microphone arrays.
- pytftb - signal/pytftb) - Implementation of the MATLAB Time-Frequency Toolbox.
- TimeSide (Beta) - high level audio analysis, imaging, transcoding, streaming and labelling.
- tinytag - reading music meta data of MP3, OGG, FLAC and Wave files.
- sed_eval - ARG/sed_eval) [:package:](https://pypi.org/project/sed_eval/) - Evaluation toolbox for Sound Event Detection
- Pretty-MIDI - midi) [:package:](https://pypi.python.org/pypi/pretty-midi) - Utility functions for handling MIDI data in a nice/intuitive way.
- acoustics - acoustics/python-acoustics/) [:package:](https://pypi.python.org/pypi/acoustics) - useful tools for acousticians.
- pyFFTW - Wrapper for FFTW(3).
- SFS-Python - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
- audioFlux - A library for audio and music analysis, feature extraction.
- python_speech_features - Common speech features for ASR.
- pyYAAFE - Python bindings for YAAFE feature extractor.
- speechpy - Library for Speech Processing and Recognition, mostly feature extraction for now.
- spafe - Python library for features extraction from audio files.
- audiomentations - Audio Data Augmentation.
- pydiogment - Audio Data Augmentation.
- deepspeech - Pretrained automatic speech recognition.
- Parselmouth - parselmouth/) - Python interface to the [Praat](http://www.praat.org) phonetics and speech analysis, synthesis, and manipulation software.
- pyannote.audio - audio) [:package:](https://pypi.org/project/pyannote-audio/) - Neural building blocks for speaker diarization.
- pyAudioAnalysis - Feature Extraction, Classification, Diarization.
- py-webrtcvad - webrtcvad) [:package:](https://pypi.python.org/pypi/webrtcvad/) - Interface to the WebRTC Voice Activity Detector.
- pypesq - pesq) - Wrapper for the PESQ score calculation.
- pystoi - Short Term Objective Intelligibility measure (STOI).
- PyWorldVocoder - Wrapper-for-World-Vocoder) - Wrapper for Morise's World Vocoder.
- Montreal Forced Aligner - Forced-Aligner) - Forced aligner, based on Kaldi (HMM), English (others can be trained).
- SpeechRecognition - Wrapper for several ASR engines and APIs, online and offline.
- sed_eval - ARG/sed_eval) [:package:](https://pypi.org/project/sed_eval/) - Evaluation toolbox for Sound Event Detection
- cochlea - Inner ear models.
- Loudness - Perceived loudness, includes Zwicker, Moore/Glasberg model.
- NTFLib - Sparse Beta-Divergence Tensor Factorization.
- NIMFA - Several flavors of non-negative-matrix factorization.
- Catchy - Corpus Analysis Tools for Computational Hook Discovery.
- chord-detection - detection) - Algorithms for chord detection and key estimation.
- mir_eval - Common scores for various MIR tasks. Also includes bss_eval implementation.
- Kapre - Keras Audio Preprocessors
- TorchAudio - PyTorch Audio Loaders
- nnAudio - Accelerated audio processing using 1D convolution networks in PyTorch.
- Music21 - Toolkit for Computer-Aided Musicology.
- mingus - mingus) [:package:](https://pypi.org/project/mingus) - Advanced music theory and notation package with MIDI file and playback support.
- Pretty-MIDI - midi) [:package:](https://pypi.python.org/pypi/pretty-midi) - Utility functions for handling MIDI data in a nice/intuitive way.
- Jupylet - Subtractive, additive, FM, and sample-based sound synthesis.
- python-sounddevice - sounddevice.readthedocs.io) [:package:](https://pypi.python.org/pypi/sounddevice) - PortAudio wrapper providing realtime audio I/O with NumPy.
- ReTiSAR - Binarual rendering of streamed or IR-based high-order spherical microphone array signals.
- Soundcloud API - python) [:package:](https://pypi.python.org/pypi/soundcloud) - Wrapper for [Soundcloud API](https://developers.soundcloud.com/).
- audiomate - Loading different types of audio datasets.
- mutagen - Reads and writes all kind of audio metadata for various formats.
- pyAV - PyAV is a Pythonic binding for FFmpeg or Libav.
- (Py)Soundfile - Library based on libsndfile, CFFI, and NumPy.
- AudioTSM - real-time audio time-scale modification procedures.
- pydub - Manipulate audio with a simple and easy high level interface.
- PyWavelets - Discrete Wavelet Transform in Python.
- Resampy - Sample rate conversion.
- STFT - werner/stft) [:package:](https://pypi.python.org/pypi/stft) - Standalone package for Short-Time Fourier Transform.
- aubio - Feature extractor, written in C, Python interface.
- essentia - Music related low level and high level feature extractor, C++ based, includes Python bindings.
- gentle - Forced-aligner built on Kaldi.
- SIDEKIT - Speaker and Language recognition.
- Brian2 - team/brian2) [:package:](https://pypi.python.org/pypi/Brian2) - Spiking neural networks simulator, includes cochlea model.
- commonfate - Common Fate Model and Transform.
- mir_eval - Common scores for various MIR tasks. Also includes bss_eval implementation.
- msaf - Music Structure Analysis Framework.
- librosa - General audio and music analysis.
- musdb - mus-db) [:package:](https://pypi.python.org/pypi/musdb) - Parse and process the MUSDB18 dataset.
- medleydb - Parse [medleydb](http://medleydb.weebly.com/) audio + annotations.
Tutorials
- Introduction to Numpy and Scipy - lectures/scipy-lecture-notes) - Highly recommended tutorial, covers large parts of the scientific Python ecosystem.
- Numpy for MATLAB® Users - Short overview of equivalent python functions for switchers.
- MIR Notebooks - mir) - collection of instructional iPython Notebooks for music information retrieval (MIR).
- Live-coding a music synthesizer - coding video showing how to use the SoundDevice library to reproduce realistic sounds. [Code](https://github.com/cool-RR/python_synthesizer).
- Whirlwind Tour Of Python
- Selected Topics in Audio Signal Processing - Exercises as iPython notebooks.
- Numpy for MATLAB® Users - Short overview of equivalent python functions for switchers.
- MIR Notebooks - mir) - collection of instructional iPython Notebooks for music information retrieval (MIR).
Books
- Fundamentals of Music Processing - Meinard Müller, comes with Python exercises.
- Python Data Science Handbook - Jake Vanderplas, Excellent Book and accompanying tutorial notebooks.
Scientific Papers
- pyannote.audio: neural building blocks for speaker diarization - Hervé Bredin, Ruiqing Yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, Marvin Lavechin, Diego Fustes, Hadrien Titeux, Wassim Bouaziz, Marie-Philippe Gill, ICASSP 2020.
- Python for audio signal processing - John C. Glover, Victor Lazzarini and Joseph Timoney, Linux Audio Conference 2011.
- Python for audio signal processing - John C. Glover, Victor Lazzarini and Joseph Timoney, Linux Audio Conference 2011.
- librosa: Audio and Music Signal Analysis in Python - Brian McFee, Colin Raffel, Dawen Liang, Daniel P.W. Ellis, Matt McVicar, Eric Battenberg, Oriol Nieto, Scipy 2015.
Other Resources
- Coursera Course - Audio Signal Processing, Python based course from UPF of Barcelona and Stanford University.
- Slack Channel - Music Information Retrieval Community.
- Digital Signal Processing Course - Masters Course Material (University of Rostock) with many Python examples.
- Slack Channel - Music Information Retrieval Community.
Related lists
- PythonInMusic - Python](https://github.com/vinta/awesome-python) is large curated list of python packages. However, the audio section is very small.

Programming Languages

Python 30 C++ 5 Jupyter Notebook 3 C 2 Cython 1 MATLAB 1 Matlab 1

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

awesome-python-scientific-audio

Tutorials

Books

Scientific Papers

Other Resources

awesome-python-scientific-audio

Audio Related Packages

Tutorials

Books

Scientific Papers

Other Resources

Related lists