An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with audio-processing

A curated list of projects in awesome lists tagged with audio-processing .

https://github.com/deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 12 May 2025

https://github.com/Deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 05 May 2025

https://github.com/tenacityteam/tenacity-legacy

THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.

audacity audio audio-applications audio-processing floss hacktoberfest libre privacy-friendly privacy-preserving recorder recording-app

Last synced: 27 Sep 2025

https://github.com/nvidia/dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 13 May 2025

https://github.com/NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 15 Mar 2025

https://github.com/cgzirim/seek-tune

An implementation of Shazam's song recognition algorithm.

audio-fingerprinting audio-processing go golang not-shazam shazam song song-recognition-algorithm

Last synced: 17 Jul 2025

https://github.com/wyattblue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 12 May 2025

https://github.com/WyattBlue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 29 Mar 2025

https://github.com/stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

audio-processing bass deep-learning demucs drums electron javascript machine-learning python source-separation vocals

Last synced: 14 May 2025

https://github.com/scottlawsonbc/audio-reactive-led-strip

:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi

arduino audio-processing esp8266 music-visualizer python raspberry-pi signal-processing

Last synced: 15 May 2025

https://github.com/bitfieldaudio/OTTO

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 23 Apr 2025

https://github.com/bitfieldaudio/otto

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 15 May 2025

https://github.com/pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio audio-processing io machine-learning python pytorch speech

Last synced: 05 May 2025

https://github.com/blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers

Last synced: 29 Jun 2025

https://github.com/faiface/beep

A little package that brings sound to any Go application. Suitable for playback and audio-processing.

audio audio-playback audio-processing go golang

Last synced: 14 May 2025

https://github.com/julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

audio-processing recognition speech speech-recognition

Last synced: 14 Jun 2025

https://github.com/gauravbh1010tt/deeplearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 15 May 2025

https://github.com/GauravBh1010tt/DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 14 Mar 2025

https://github.com/kfrlib/kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd

Last synced: 14 May 2025

https://github.com/audiamus/AaxAudioConverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 10 May 2025

https://github.com/audiamus/aaxaudioconverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 01 Apr 2025

https://github.com/ledfx/ledfx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 13 May 2025

https://github.com/LedFx/LedFx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 09 Apr 2025

https://github.com/guitarml/smartguitaramp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 16 May 2025

https://github.com/flutydeer/audio-slicer

A simple GUI application that slices audio with silence detection

audio-processing gui pyside6 qt6

Last synced: 03 Oct 2025

https://github.com/GuitarML/SmartGuitarAmp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 30 Apr 2025

https://github.com/mikeroyal/pipewire-guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 16 May 2025

https://github.com/mikeroyal/PipeWire-Guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 09 May 2025

https://github.com/timschneeb/rootlessjamesdsp

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 14 May 2025

https://github.com/funaudiollm/inspiremusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

audio-generation audio-processing music-generation pytorch

Last synced: 15 May 2025

https://github.com/dbraun/dawdreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 14 May 2025

https://github.com/DBraun/DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 16 Mar 2025

https://github.com/addictedcs/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 23 Oct 2025

https://github.com/AddictedCS/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 05 Apr 2025

https://github.com/acoustid/chromaprint

C library for generating audio fingerprints used by AcoustID

acoustid audio audio-analysis audio-fingerprinting audio-processing chromaprint

Last synced: 15 Mar 2025

https://github.com/timschneeb/RootlessJamesDSP

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 08 Apr 2025

https://github.com/spotify/klio

Smarter data pipelines for audio.

audio-processing data-pipeline media-processing signal-processing

Last synced: 15 May 2025

https://github.com/f90/Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

audio-processing deep-learning mit-license waveform-analysis

Last synced: 14 Jul 2025

https://github.com/x-lance/slam-llm

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 15 May 2025

https://github.com/Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers

Last synced: 04 May 2025

https://github.com/rnchg/APT

AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and other models, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 14 Aug 2025

https://github.com/rnchg/apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 15 May 2025

https://github.com/rnchg/Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 24 Mar 2025

https://github.com/X-LANCE/SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 11 Sep 2025

https://github.com/open-mmlab/foleycrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 04 Apr 2025

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 27 Apr 2025

https://github.com/relevanceai/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 04 Apr 2025

https://github.com/lagmoellertim/unsilence

Console Interface and Library to remove silent parts of a media file 🔈

audio-processing contributions-welcome hacktoberfest media python silence-speedup silencedetect video-processing

Last synced: 12 Jul 2025

https://github.com/open-mmlab/FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 30 Oct 2025

https://github.com/josephernest/samplerbox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 13 Apr 2025

https://github.com/josephernest/SamplerBox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 29 Mar 2025

https://github.com/sfluor/musig

A shazam like tool to store songs fingerprints and retrieve them

audio audio-processing digital-signal-processing go golang microphone musig shazam song

Last synced: 11 Oct 2025

https://github.com/wofwca/jumpcutter

⏩ Fast-forwards long pauses between sentences — watch lectures ~1.5x faster (browser extension)

agpl audio audio-processing browser-extension chrome-extension firefox-addon firefox-extension productivity video web-audio-api webextension youtube

Last synced: 16 May 2025

https://github.com/adobe-research/deepafx-st

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

adaptive-presets afx ai audio audio-processing audio-production compressor deeplearning drc effects eq music-production styletransfer

Last synced: 06 Apr 2025

https://github.com/Parisson/TimeSide

scalable audio processing framework and server written in Python

audio-processing python web

Last synced: 13 Mar 2025

https://github.com/scopeInfinity/Video2Description

Video to Text: Natural language description generator for some given video. [Video Captioning]

audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text

Last synced: 07 Apr 2025

https://github.com/busyyang/python_sound_open

语音信号处理试验教程,Python代码

audio-processing blog matlab python

Last synced: 06 Apr 2025

https://github.com/YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

audio audio-classification audio-processing audio-tagging speech-recognition

Last synced: 01 Apr 2025

https://github.com/etienneab3d/whisperhallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper

Last synced: 16 May 2025

https://github.com/fabiogra/moseca

A Streamilt web app for music source separation & karaoke

audio-processing demucs huggingface music-separation streamlit vocal-remover

Last synced: 06 Apr 2025

https://github.com/aetaric/checkrr

Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr

audio audio-processing ffprobe media radarr-api sonarr-api video video-processing

Last synced: 31 Mar 2025

https://github.com/Yuan-ManX/audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

artificial-intelligence audio audio-generation audio-processing deep-learning dsp machine-learning music music-generation signal-processing speech speech-processing speech-synthesis

Last synced: 17 Mar 2025