An open API service indexing awesome lists of open source software.

awesome-python-scientific-audio

Curated list of python software and packages related to scientific research in audio
https://github.com/faroit/awesome-python-scientific-audio

Last synced: about 5 hours ago
JSON representation

    • mutagen - Reads and writes all kind of audio metadata for various formats.
    • pyAV - PyAV is a Pythonic binding for FFmpeg or Libav.
    • AudioTSM - real-time audio time-scale modification procedures.
    • NSGT - Non-stationary gabor transform, constant-q.
    • pydub - Manipulate audio with a simple and easy high level interface.
    • pytftb - signal/pytftb) - Implementation of the MATLAB Time-Frequency Toolbox.
    • PyWavelets - Discrete Wavelet Transform in Python.
    • Resampy - Sample rate conversion.
    • STFT - werner/stft) [:package:](https://pypi.python.org/pypi/stft) - Standalone package for Short-Time Fourier Transform.
    • aubio - Feature extractor, written in C, Python interface.
    • essentia - Music related low level and high level feature extractor, C++ based, includes Python bindings.
    • muda - Musical Data Augmentation.
    • aeneas - Forced aligner, based on MFCC+DTW, 35+ languages.
    • persephone - tools/persephone) [:package:](https://pypi.org/project/persephone/) - Automatic phoneme transcription tool.
    • SIDEKIT - Speaker and Language recognition.
    • Brian2 - team/brian2) [:package:](https://pypi.python.org/pypi/Brian2) - Spiking neural networks simulator, includes cochlea model.
    • pyloudnorm - Audio loudness meter and normalization, implements ITU-R BS.1770-4.
    • Sound Field Synthesis Toolbox - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
    • NUSSL - Holistic source separation framework including DSP methods and deep learning methods.
    • NIMFA - Several flavors of non-negative-matrix factorization.
    • Madmom - MIR packages with strong focus on beat detection, onset detection and chord recognition.
    • msaf - Music Structure Analysis Framework.
    • librosa - General audio and music analysis.
    • Music21 - Toolkit for Computer-Aided Musicology.
    • Mido - Realtime MIDI wrapper.
    • PYO - Realtime audio dsp engine.
    • beets - Music library manager and [MusicBrainz](https://musicbrainz.org/) tagger.
    • musdb - mus-db) [:package:](https://pypi.python.org/pypi/musdb) - Parse and process the MUSDB18 dataset.
    • medleydb - Parse [medleydb](http://medleydb.weebly.com/) audio + annotations.
    • mirdata - dataset-loaders/mirdata) [:package:](https://pypi.python.org/pypi/mirdata) - Common loaders for Music Information Retrieval (MIR) datasets.
    • VamPy Host - Interface compiled vamp plugins.
    • (Py)Soundfile - Library based on libsndfile, CFFI, and NumPy.
    • audiolazy - Expressive Digital Signal Processing (DSP) package for Python.
    • audioread - Cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.
    • stempeg - read/write of STEMS multistream audio.
    • acoustics - acoustics/python-acoustics/) [:package:](https://pypi.python.org/pypi/acoustics) - useful tools for acousticians.
    • AudioTK - DSP filter toolbox (lots of filters).
    • Gammatone - Gammatone filterbank implementation.
    • pyFFTW - Wrapper for FFTW(3).
    • matchering - Automated reference audio mastering.
    • MDCT - werner/mdct) [:package:](https://pypi.python.org/pypi/mdct) - MDCT transform.
    • pydub - Manipulate audio with a simple and easy high level interface.
    • pyroomacoustics - Room Acoustics Simulation (RIR generator)
    • PyRubberband - Wrapper for [rubberband](http://breakfastquay.com/rubberband/) to do pitch-shifting and time-stretching.
    • SFS-Python - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
    • sound_field_analysis - py) [:package:](https://pypi.org/project/sound-field-analysis/) - Analyze, visualize and process sound field data recorded by spherical microphone arrays.
    • pytftb - signal/pytftb) - Implementation of the MATLAB Time-Frequency Toolbox.
    • TimeSide (Beta) - high level audio analysis, imaging, transcoding, streaming and labelling.
    • tinytag - reading music meta data of MP3, OGG, FLAC and Wave files.
    • sed_eval - ARG/sed_eval) [:package:](https://pypi.org/project/sed_eval/) - Evaluation toolbox for Sound Event Detection
    • Pretty-MIDI - midi) [:package:](https://pypi.python.org/pypi/pretty-midi) - Utility functions for handling MIDI data in a nice/intuitive way.
    • acoustics - acoustics/python-acoustics/) [:package:](https://pypi.python.org/pypi/acoustics) - useful tools for acousticians.
    • pyFFTW - Wrapper for FFTW(3).
    • SFS-Python - python) [:package:](https://pypi.python.org/pypi/sfs/) - Sound Field Synthesis Toolbox.
    • audioFlux - A library for audio and music analysis, feature extraction.
    • python_speech_features - Common speech features for ASR.
    • pyYAAFE - Python bindings for YAAFE feature extractor.
    • speechpy - Library for Speech Processing and Recognition, mostly feature extraction for now.
    • spafe - Python library for features extraction from audio files.
    • audiomentations - Audio Data Augmentation.
    • pydiogment - Audio Data Augmentation.
    • deepspeech - Pretrained automatic speech recognition.
    • Parselmouth - parselmouth/) - Python interface to the [Praat](http://www.praat.org) phonetics and speech analysis, synthesis, and manipulation software.
    • pyannote.audio - audio) [:package:](https://pypi.org/project/pyannote-audio/) - Neural building blocks for speaker diarization.
    • pyAudioAnalysis - Feature Extraction, Classification, Diarization.
    • py-webrtcvad - webrtcvad) [:package:](https://pypi.python.org/pypi/webrtcvad/) - Interface to the WebRTC Voice Activity Detector.
    • pypesq - pesq) - Wrapper for the PESQ score calculation.
    • pystoi - Short Term Objective Intelligibility measure (STOI).
    • PyWorldVocoder - Wrapper-for-World-Vocoder) - Wrapper for Morise's World Vocoder.
    • Montreal Forced Aligner - Forced-Aligner) - Forced aligner, based on Kaldi (HMM), English (others can be trained).
    • SpeechRecognition - Wrapper for several ASR engines and APIs, online and offline.
    • sed_eval - ARG/sed_eval) [:package:](https://pypi.org/project/sed_eval/) - Evaluation toolbox for Sound Event Detection
    • cochlea - Inner ear models.
    • Loudness - Perceived loudness, includes Zwicker, Moore/Glasberg model.
    • NTFLib - Sparse Beta-Divergence Tensor Factorization.
    • NIMFA - Several flavors of non-negative-matrix factorization.
    • Catchy - Corpus Analysis Tools for Computational Hook Discovery.
    • chord-detection - detection) - Algorithms for chord detection and key estimation.
    • mir_eval - Common scores for various MIR tasks. Also includes bss_eval implementation.
    • Kapre - Keras Audio Preprocessors
    • TorchAudio - PyTorch Audio Loaders
    • nnAudio - Accelerated audio processing using 1D convolution networks in PyTorch.
    • Music21 - Toolkit for Computer-Aided Musicology.
    • mingus - mingus) [:package:](https://pypi.org/project/mingus) - Advanced music theory and notation package with MIDI file and playback support.
    • Pretty-MIDI - midi) [:package:](https://pypi.python.org/pypi/pretty-midi) - Utility functions for handling MIDI data in a nice/intuitive way.
    • Jupylet - Subtractive, additive, FM, and sample-based sound synthesis.
    • python-sounddevice - sounddevice.readthedocs.io) [:package:](https://pypi.python.org/pypi/sounddevice) - PortAudio wrapper providing realtime audio I/O with NumPy.
    • ReTiSAR - Binarual rendering of streamed or IR-based high-order spherical microphone array signals.
    • Soundcloud API - python) [:package:](https://pypi.python.org/pypi/soundcloud) - Wrapper for [Soundcloud API](https://developers.soundcloud.com/).
    • audiomate - Loading different types of audio datasets.
    • mutagen - Reads and writes all kind of audio metadata for various formats.
    • pyAV - PyAV is a Pythonic binding for FFmpeg or Libav.
    • (Py)Soundfile - Library based on libsndfile, CFFI, and NumPy.
    • AudioTSM - real-time audio time-scale modification procedures.
    • pydub - Manipulate audio with a simple and easy high level interface.
    • PyWavelets - Discrete Wavelet Transform in Python.
    • Resampy - Sample rate conversion.
    • STFT - werner/stft) [:package:](https://pypi.python.org/pypi/stft) - Standalone package for Short-Time Fourier Transform.
    • aubio - Feature extractor, written in C, Python interface.
    • essentia - Music related low level and high level feature extractor, C++ based, includes Python bindings.
    • gentle - Forced-aligner built on Kaldi.
    • SIDEKIT - Speaker and Language recognition.
    • Brian2 - team/brian2) [:package:](https://pypi.python.org/pypi/Brian2) - Spiking neural networks simulator, includes cochlea model.
    • commonfate - Common Fate Model and Transform.
    • mir_eval - Common scores for various MIR tasks. Also includes bss_eval implementation.
    • msaf - Music Structure Analysis Framework.
    • librosa - General audio and music analysis.
    • musdb - mus-db) [:package:](https://pypi.python.org/pypi/musdb) - Parse and process the MUSDB18 dataset.
    • medleydb - Parse [medleydb](http://medleydb.weebly.com/) audio + annotations.
  • Tutorials

  • Books

  • Scientific Papers

  • Other Resources

    • PythonInMusic - Python](https://github.com/vinta/awesome-python) is large curated list of python packages. However, the audio section is very small.