An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with audio-processing

A curated list of projects in awesome lists tagged with audio-processing .

https://github.com/Deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 05 May 2025

https://github.com/deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 12 May 2025

https://github.com/tenacityteam/tenacity-legacy

THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.

audacity audio audio-applications audio-processing floss hacktoberfest libre privacy-friendly privacy-preserving recorder recording-app

Last synced: 27 Sep 2025

https://github.com/nvidia/dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 13 May 2025

https://github.com/NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 15 Mar 2025

https://github.com/cgzirim/seek-tune

An implementation of Shazam's song recognition algorithm.

audio-fingerprinting audio-processing go golang not-shazam shazam song song-recognition-algorithm

Last synced: 17 Jul 2025

https://github.com/wyattblue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 16 Jan 2026

https://github.com/WyattBlue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 29 Mar 2025

https://github.com/stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

audio-processing bass deep-learning demucs drums electron javascript machine-learning python source-separation vocals

Last synced: 14 May 2025

https://github.com/scottlawsonbc/audio-reactive-led-strip

:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi

arduino audio-processing esp8266 music-visualizer python raspberry-pi signal-processing

Last synced: 15 May 2025

https://github.com/bitfieldaudio/OTTO

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 23 Apr 2025

https://github.com/bitfieldaudio/otto

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 15 May 2025

https://github.com/pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio audio-processing io machine-learning python pytorch speech

Last synced: 05 May 2025

https://github.com/blaizzy/mlx-audio

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers

Last synced: 23 Jan 2026

https://github.com/faiface/beep

A little package that brings sound to any Go application. Suitable for playback and audio-processing.

audio audio-playback audio-processing go golang

Last synced: 14 May 2025

https://github.com/julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

audio-processing recognition speech speech-recognition

Last synced: 14 Jun 2025

https://github.com/gauravbh1010tt/deeplearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 15 May 2025

https://github.com/GauravBh1010tt/DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 14 Mar 2025

https://github.com/kfrlib/kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd

Last synced: 14 May 2025

https://github.com/audiamus/AaxAudioConverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 10 May 2025

https://github.com/audiamus/aaxaudioconverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 01 Apr 2025

https://github.com/ledfx/ledfx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 13 May 2025

https://github.com/LedFx/LedFx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 09 Apr 2025

https://github.com/guitarml/smartguitaramp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 16 May 2025

https://github.com/flutydeer/audio-slicer

A simple GUI application that slices audio with silence detection

audio-processing gui pyside6 qt6

Last synced: 03 Oct 2025

https://github.com/GuitarML/SmartGuitarAmp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 30 Apr 2025

https://github.com/mikeroyal/pipewire-guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 16 May 2025

https://github.com/mikeroyal/PipeWire-Guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 09 May 2025

https://github.com/acoustid/chromaprint

C library for generating audio fingerprints used by AcoustID

acoustid audio audio-analysis audio-fingerprinting audio-processing chromaprint

Last synced: 17 Jan 2026

https://github.com/timschneeb/rootlessjamesdsp

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 14 May 2025

https://github.com/funaudiollm/inspiremusic

InspireMusic: A Unified Framework for Music, Song, Audio Generation.

audio-generation audio-processing music-generation pytorch

Last synced: 15 May 2025

https://github.com/dbraun/dawdreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 14 May 2025

https://github.com/DBraun/DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 16 Mar 2025

https://github.com/addictedcs/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 23 Oct 2025

https://github.com/AddictedCS/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 05 Apr 2025

https://github.com/timschneeb/RootlessJamesDSP

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 08 Apr 2025

https://github.com/spotify/klio

Smarter data pipelines for audio.

audio-processing data-pipeline media-processing signal-processing

Last synced: 15 May 2025

https://github.com/f90/Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

audio-processing deep-learning mit-license waveform-analysis

Last synced: 14 Jul 2025

https://github.com/x-lance/slam-llm

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 15 May 2025

https://github.com/Blaizzy/mlx-audio

A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.

apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers

Last synced: 04 May 2025

https://github.com/rnchg/APT

AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and other models, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 14 Aug 2025

https://github.com/rnchg/apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 15 May 2025

https://github.com/rnchg/Apt

AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.

ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing

Last synced: 24 Mar 2025

https://github.com/X-LANCE/SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 11 Sep 2025

https://github.com/open-mmlab/foleycrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 04 Apr 2025

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 27 Apr 2025

https://github.com/relevanceai/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 04 Apr 2025

https://github.com/lagmoellertim/unsilence

Console Interface and Library to remove silent parts of a media file 🔈

audio-processing contributions-welcome hacktoberfest media python silence-speedup silencedetect video-processing

Last synced: 02 Jan 2026

https://github.com/open-mmlab/FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 30 Oct 2025

https://github.com/YuanGongND/ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

audio audio-processing deep-learning large-language-models speech-recognition

Last synced: 24 Jan 2026

https://github.com/josephernest/samplerbox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 13 Apr 2025

https://github.com/josephernest/SamplerBox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 29 Mar 2025

https://github.com/sfluor/musig

A shazam like tool to store songs fingerprints and retrieve them

audio audio-processing digital-signal-processing go golang microphone musig shazam song

Last synced: 11 Oct 2025

https://github.com/wofwca/jumpcutter

⏩ Fast-forwards long pauses between sentences — watch lectures ~1.5x faster (browser extension)

agpl audio audio-processing browser-extension chrome-extension firefox-addon firefox-extension productivity video web-audio-api webextension youtube

Last synced: 16 May 2025

https://github.com/adobe-research/deepafx-st

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

adaptive-presets afx ai audio audio-processing audio-production compressor deeplearning drc effects eq music-production styletransfer

Last synced: 06 Apr 2025

https://github.com/alnitak/flutter_soloud

Flutter low-level audio plugin using SoLoud C++ library and FFI

audio audio-player audio-processing audio-visualizer dart-ffi flutter flutter-plugin miniaudio soloud

Last synced: 21 Jan 2026

https://github.com/Parisson/TimeSide

scalable audio processing framework and server written in Python

audio-processing python web

Last synced: 13 Mar 2025

https://github.com/scopeInfinity/Video2Description

Video to Text: Natural language description generator for some given video. [Video Captioning]

audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text

Last synced: 07 Apr 2025

https://github.com/busyyang/python_sound_open

语音信号处理试验教程,Python代码

audio-processing blog matlab python

Last synced: 06 Apr 2025

https://github.com/etienneab3d/whisperhallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper

Last synced: 16 May 2025

https://github.com/YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

audio audio-classification audio-processing audio-tagging speech-recognition

Last synced: 01 Apr 2025

https://github.com/fabiogra/moseca

A Streamilt web app for music source separation & karaoke

audio-processing demucs huggingface music-separation streamlit vocal-remover

Last synced: 06 Apr 2025