Projects in Awesome Lists tagged with audio-processing
A curated list of projects in awesome lists tagged with audio-processing .
https://github.com/google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing
Last synced: 30 Jan 2026
https://google.github.io/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing
Last synced: 02 Apr 2025
https://github.com/Deezer/spleeter
Deezer source separation library including pretrained models.
audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals
Last synced: 05 May 2025
https://github.com/deezer/spleeter
Deezer source separation library including pretrained models.
audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals
Last synced: 12 May 2025
https://github.com/speechbrain/speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Last synced: 13 May 2025
https://github.com/tenacityteam/tenacity-legacy
THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.
audacity audio audio-applications audio-processing floss hacktoberfest libre privacy-friendly privacy-preserving recorder recording-app
Last synced: 27 Sep 2025
https://github.com/spotify/pedalboard
🎛 🔊 A Python library for audio.
audio audio-processing audio-production audio-research audio-unit augmentation juce machine-learning pybind11 python vst3 vst3-host
Last synced: 16 Jan 2026
https://github.com/bitgapp/eqMac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer
Last synced: 14 Mar 2025
https://github.com/bitgapp/eqmac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer
Last synced: 14 May 2025
https://github.com/nvidia/dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 13 May 2025
https://github.com/NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 15 Mar 2025
https://github.com/cgzirim/seek-tune
An implementation of Shazam's song recognition algorithm.
audio-fingerprinting audio-processing go golang not-shazam shazam song song-recognition-algorithm
Last synced: 17 Jul 2025
https://github.com/wyattblue/auto-editor
Auto-Editor: Efficient media analysis and rendering
audio audio-editing audio-processing automatic python3 video video-editing video-processing
Last synced: 16 Jan 2026
https://github.com/WyattBlue/auto-editor
Auto-Editor: Efficient media analysis and rendering
audio audio-editing audio-processing automatic python3 video video-editing video-processing
Last synced: 29 Mar 2025
https://github.com/libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 13 Mar 2025
https://github.com/libaudioflux/audioflux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 14 May 2025
https://github.com/stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song
audio-processing bass deep-learning demucs drums electron javascript machine-learning python source-separation vocals
Last synced: 14 May 2025
https://github.com/scottlawsonbc/audio-reactive-led-strip
:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
arduino audio-processing esp8266 music-visualizer python raspberry-pi signal-processing
Last synced: 15 May 2025
https://github.com/bitfieldaudio/OTTO
Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design
Last synced: 23 Apr 2025
https://github.com/bitfieldaudio/otto
Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design
Last synced: 15 May 2025
https://github.com/pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
audio audio-processing io machine-learning python pytorch speech
Last synced: 05 May 2025
https://github.com/blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers
Last synced: 23 Jan 2026
https://github.com/axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
action-recognition anomaly-detection audio-processing background-removal crowd-counting deep-learning embeddings face-detection face-recognition fashion-ai gan hand-detection image-classification image-segmentation llm neural-network object-detection object-recognition object-tracking pose-estimation
Last synced: 13 May 2025
https://github.com/faiface/beep
A little package that brings sound to any Go application. Suitable for playback and audio-processing.
audio audio-playback audio-processing go golang
Last synced: 14 May 2025
https://github.com/julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
audio-processing recognition speech speech-recognition
Last synced: 14 Jun 2025
https://github.com/gauravbh1010tt/deeplearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 15 May 2025
https://github.com/GauravBh1010tt/DeepLearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 14 Mar 2025
https://github.com/monocasual/giada
Your Hardcore Loop Machine.
audio audio-processing audio-production beatmaking cpp20 daw drum-machine giada giadaloopmachine hardcore-loopmachine juce linux loop-machine macos midi midi-device music music-composition vst3 windows
Last synced: 14 May 2025
https://github.com/kfrlib/kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd
Last synced: 14 May 2025
https://github.com/letoram/arcan
Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
audio-processing c desktop-environment display-server freebsd game-engine linux lua multimedia-graphic-library openbsd video-processing virtual-reality visualization wayland
Last synced: 14 May 2025
https://github.com/audiamus/AaxAudioConverter
Convert Audible aax files to mp3 and m4a/m4b
aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3
Last synced: 10 May 2025
https://github.com/audiamus/aaxaudioconverter
Convert Audible aax files to mp3 and m4a/m4b
aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3
Last synced: 01 Apr 2025
https://github.com/mltframework/mlt
MLT Multimedia Framework
audio audio-processing c c-plus-plus ffmpeg framework frei0r ladspa multimedia opengl qt sdl2 video video-processing
Last synced: 14 May 2025
https://github.com/ledfx/ledfx
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled
Last synced: 13 May 2025
https://github.com/LedFx/LedFx
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled
Last synced: 09 Apr 2025
https://github.com/cycfi/Q
C++ Library for Audio Digital Signal Processing
audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth
Last synced: 11 May 2025
https://github.com/guitarml/smartguitaramp
Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.
audio-processing guitar juce machinelearning neuralnetworks
Last synced: 16 May 2025
https://github.com/cycfi/q
C++ Library for Audio Digital Signal Processing
audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth
Last synced: 14 May 2025
https://github.com/tracktion/tracktion_engine
Tracktion Engine module
audio audio-processing c-plus-plus cpp daw framework juce
Last synced: 13 Apr 2025
https://github.com/flutydeer/audio-slicer
A simple GUI application that slices audio with silence detection
audio-processing gui pyside6 qt6
Last synced: 03 Oct 2025
https://github.com/GuitarML/SmartGuitarAmp
Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.
audio-processing guitar juce machinelearning neuralnetworks
Last synced: 30 Apr 2025
https://github.com/mikeroyal/pipewire-guide
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst
Last synced: 16 May 2025
https://github.com/unosquare/ffmediaelement
FFME: The Advanced WPF MediaElement (based on FFmpeg)
audio audio-processing codec dotnet dotnet-framework ffmpeg ffmpeg-binaries ffplay h264 macos media-playback mediaelement mp3 mp4 mpeg video volume wpf xamarin
Last synced: 14 May 2025
https://github.com/r3gm/sonitranslate
Synchronized Translation for Videos. Video dubbing
asr audio-processing automatic-dubbing diarization document-translator dubbing speech-to-text stt subtitle-to-speech text-to-speech translate-audio translate-video translation tts video-dubbing
Last synced: 12 Oct 2025
https://github.com/Tracktion/tracktion_engine
Tracktion Engine module
audio audio-processing c-plus-plus cpp daw framework juce
Last synced: 08 Apr 2025
https://github.com/bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university
Last synced: 14 Apr 2025
https://github.com/bytedance/salmonn
SALMONN: Speech Audio Language Music Open Neural Network
audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university
Last synced: 13 Apr 2025
https://github.com/mikeroyal/PipeWire-Guide
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst
Last synced: 09 May 2025
https://github.com/acoustid/chromaprint
C library for generating audio fingerprints used by AcoustID
acoustid audio audio-analysis audio-fingerprinting audio-processing chromaprint
Last synced: 17 Jan 2026
https://github.com/mravanelli/sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 16 May 2025
https://github.com/timschneeb/rootlessjamesdsp
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
android audio audio-processing convolution dsp effects equalizer non-root rootless
Last synced: 14 May 2025
https://github.com/mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 26 Apr 2025
https://github.com/midas-research/audino
Open source audio annotation tool for humans
annotation-tool audio-annotation audio-processing datasets machine-learning python speech-processing
Last synced: 13 Apr 2025
https://github.com/funaudiollm/inspiremusic
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
audio-generation audio-processing music-generation pytorch
Last synced: 15 May 2025
https://github.com/KinWaiCheuk/nnAudio
Audio processing by using pytorch 1D convolution network
1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft
Last synced: 14 Jul 2025
https://github.com/ictnlp/streamspeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
all-in-one asr audio-processing machine-translation non-autoregressive seamless simultaneous-translation speech speech-enhancement speech-processing speech-recognition speech-synthesis speech-to-text speech-translation streaming-audio text-to-audio text-to-speech translation tts voice
Last synced: 16 May 2025
https://github.com/kinwaicheuk/nnaudio
Audio processing by using pytorch 1D convolution network
1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft
Last synced: 15 May 2025
https://github.com/dbraun/dawdreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host
Last synced: 14 May 2025
https://github.com/DBraun/DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host
Last synced: 16 Mar 2025
https://github.com/addictedcs/soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam
Last synced: 23 Oct 2025
https://github.com/AddictedCS/soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam
Last synced: 05 Apr 2025
https://github.com/timschneeb/RootlessJamesDSP
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
android audio audio-processing convolution dsp effects equalizer non-root rootless
Last synced: 08 Apr 2025
https://github.com/spotify/klio
Smarter data pipelines for audio.
audio-processing data-pipeline media-processing signal-processing
Last synced: 15 May 2025
https://github.com/vadymmarkov/beethoven
:guitar: A maestro of pitch detection.
audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner
Last synced: 16 May 2025
https://github.com/vadymmarkov/Beethoven
:guitar: A maestro of pitch detection.
audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner
Last synced: 06 Aug 2025
https://github.com/f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
audio-processing deep-learning mit-license waveform-analysis
Last synced: 14 Jul 2025
https://github.com/x-lance/slam-llm
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 15 May 2025
https://github.com/goxr3plus/xr3player
🎧 🎼 The MOST ADVANCED JavaFX Media Player
audio-formats audio-player audio-processing audio-recorder audio-visualizer dropbox-client java-speech java-stream-player javafx mp3 spectrum-analyzer speech stream-player web-browser
Last synced: 14 Apr 2025
https://github.com/Blaizzy/mlx-audio
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers
Last synced: 04 May 2025
https://github.com/rnchg/APT
AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and other models, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 14 Aug 2025
https://github.com/rnchg/apt
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 15 May 2025
https://github.com/rnchg/Apt
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 24 Mar 2025
https://github.com/X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 11 Sep 2025
https://github.com/omeryusufyagci/fast-music-remover
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
audio-cleaner audio-enhancement audio-extractor audio-processing cpp deepfilternet ffmpeg flask machine-learning media-editor media-processing music-remover noise-removal processing realtime speech-extractor vocal-extractor youtube yt-dlp
Last synced: 15 May 2025
https://github.com/open-mmlab/foleycrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 04 Apr 2025
https://github.com/RelevanceAI/vectorhub
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec
Last synced: 27 Apr 2025
https://github.com/relevanceai/vectorhub
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec
Last synced: 04 Apr 2025
https://github.com/lagmoellertim/unsilence
Console Interface and Library to remove silent parts of a media file 🔈
audio-processing contributions-welcome hacktoberfest media python silence-speedup silencedetect video-processing
Last synced: 02 Jan 2026
https://github.com/open-mmlab/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 30 Oct 2025
https://github.com/YuanGongND/ltu
Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".
audio audio-processing deep-learning large-language-models speech-recognition
Last synced: 24 Jan 2026
https://github.com/josephernest/samplerbox
SamplerBox is a sampler musical instrument based on RaspberryPi.
audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer
Last synced: 13 Apr 2025
https://github.com/josephernest/SamplerBox
SamplerBox is a sampler musical instrument based on RaspberryPi.
audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer
Last synced: 29 Mar 2025
https://github.com/sfluor/musig
A shazam like tool to store songs fingerprints and retrieve them
audio audio-processing digital-signal-processing go golang microphone musig shazam song
Last synced: 11 Oct 2025
https://github.com/novoic/surfboard
Novoic's audio feature extraction library
alzheimers-disease audio audio-processing feature-extraction healthcare machine-learning parkinsons-disease python signal-processing speech-processing
Last synced: 03 Apr 2025
https://github.com/opencodewin/MediaEditor
A non-linear editing software that helps you to make nice video.
audio audio-mixing audio-processing filter imgui media-decode media-encode non-linear-editing subtitle-editing video video-editor video-effects video-processing vulkan-shader
Last synced: 05 Apr 2025
https://github.com/marcogdepinto/emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 05 Apr 2025
https://github.com/marcogdepinto/Emotion-Classification-Ravdess
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 12 Mar 2025
https://github.com/wofwca/jumpcutter
⏩ Fast-forwards long pauses between sentences — watch lectures ~1.5x faster (browser extension)
agpl audio audio-processing browser-extension chrome-extension firefox-addon firefox-extension productivity video web-audio-api webextension youtube
Last synced: 16 May 2025
https://github.com/adobe-research/deepafx-st
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
adaptive-presets afx ai audio audio-processing audio-production compressor deeplearning drc effects eq music-production styletransfer
Last synced: 06 Apr 2025
https://github.com/justinsalamon/scaper
A library for soundscape synthesis and augmentation
audio audio-processing data-augmentation machine-learning machine-listening soundscape soundscape-synthesis sox synthesis
Last synced: 04 Apr 2025
https://github.com/alnitak/flutter_soloud
Flutter low-level audio plugin using SoLoud C++ library and FFI
audio audio-player audio-processing audio-visualizer dart-ffi flutter flutter-plugin miniaudio soloud
Last synced: 21 Jan 2026
https://github.com/Parisson/TimeSide
scalable audio processing framework and server written in Python
Last synced: 13 Mar 2025
https://github.com/gkonovalov/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
android audio-processing deep-neural-networks dnn gmm neural-networks offline on-device-ai onnx-models real-time silero silero-vad speech-detection speech-recoginition vad voice-activity-detection voice-activity-detector voice-detection webrtc yamnet
Last synced: 16 May 2025
https://github.com/scopeInfinity/Video2Description
Video to Text: Natural language description generator for some given video. [Video Captioning]
audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text
Last synced: 07 Apr 2025
https://github.com/Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 13 Apr 2025
https://github.com/busyyang/python_sound_open
语音信号处理试验教程,Python代码
audio-processing blog matlab python
Last synced: 06 Apr 2025
https://github.com/carleslc/audiototext
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 06 Apr 2025
https://github.com/etienneab3d/whisperhallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper
Last synced: 16 May 2025
https://github.com/YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
audio audio-classification audio-processing audio-tagging speech-recognition
Last synced: 01 Apr 2025
https://github.com/fabiogra/moseca
A Streamilt web app for music source separation & karaoke
audio-processing demucs huggingface music-separation streamlit vocal-remover
Last synced: 06 Apr 2025