Projects in Awesome Lists tagged with audio-processing
A curated list of projects in awesome lists tagged with audio-processing .
https://github.com/google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing
Last synced: 12 May 2025
https://google.github.io/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing
Last synced: 02 Apr 2025
https://github.com/deezer/spleeter
Deezer source separation library including pretrained models.
audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals
Last synced: 12 May 2025
https://github.com/Deezer/spleeter
Deezer source separation library including pretrained models.
audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals
Last synced: 05 May 2025
https://github.com/speechbrain/speechbrain
A PyTorch-based Speech Toolkit
asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition
Last synced: 13 May 2025
https://github.com/tenacityteam/tenacity-legacy
THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.
audacity audio audio-applications audio-processing floss hacktoberfest libre privacy-friendly privacy-preserving recorder recording-app
Last synced: 27 Sep 2025
https://github.com/bitgapp/eqMac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer
Last synced: 14 Mar 2025
https://github.com/bitgapp/eqmac
macOS System-wide Audio Equalizer & Volume Mixer 🎧
angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer
Last synced: 14 May 2025
https://github.com/spotify/pedalboard
🎛 🔊 A Python library for audio.
audio audio-processing audio-production audio-research audio-unit augmentation juce machine-learning pybind11 python vst3 vst3-host
Last synced: 14 May 2025
https://github.com/nvidia/dali
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 13 May 2025
https://github.com/NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch
Last synced: 15 Mar 2025
https://github.com/cgzirim/seek-tune
An implementation of Shazam's song recognition algorithm.
audio-fingerprinting audio-processing go golang not-shazam shazam song song-recognition-algorithm
Last synced: 17 Jul 2025
https://github.com/wyattblue/auto-editor
Auto-Editor: Efficient media analysis and rendering
audio audio-editing audio-processing automatic python3 video video-editing video-processing
Last synced: 12 May 2025
https://github.com/WyattBlue/auto-editor
Auto-Editor: Efficient media analysis and rendering
audio audio-editing audio-processing automatic python3 video video-editing video-processing
Last synced: 29 Mar 2025
https://github.com/libAudioFlux/audioFlux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 13 Mar 2025
https://github.com/libaudioflux/audioflux
A library for audio and music analysis, feature extraction.
audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform
Last synced: 14 May 2025
https://github.com/stemrollerapp/stemroller
Isolate vocals, drums, bass, and other instrumental stems from any song
audio-processing bass deep-learning demucs drums electron javascript machine-learning python source-separation vocals
Last synced: 14 May 2025
https://github.com/scottlawsonbc/audio-reactive-led-strip
:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi
arduino audio-processing esp8266 music-visualizer python raspberry-pi signal-processing
Last synced: 15 May 2025
https://github.com/bitfieldaudio/OTTO
Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design
Last synced: 23 Apr 2025
https://github.com/bitfieldaudio/otto
Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]
audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design
Last synced: 15 May 2025
https://github.com/pytorch/audio
Data manipulation and transformation for audio signal processing, powered by PyTorch
audio audio-processing io machine-learning python pytorch speech
Last synced: 05 May 2025
https://github.com/blaizzy/mlx-audio
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers
Last synced: 29 Jun 2025
https://github.com/axinc-ai/ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
action-recognition anomaly-detection audio-processing background-removal crowd-counting deep-learning embeddings face-detection face-recognition fashion-ai gan hand-detection image-classification image-segmentation llm neural-network object-detection object-recognition object-tracking pose-estimation
Last synced: 13 May 2025
https://github.com/faiface/beep
A little package that brings sound to any Go application. Suitable for playback and audio-processing.
audio audio-playback audio-processing go golang
Last synced: 14 May 2025
https://github.com/julius-speech/julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
audio-processing recognition speech speech-recognition
Last synced: 14 Jun 2025
https://github.com/gauravbh1010tt/deeplearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 15 May 2025
https://github.com/GauravBh1010tt/DeepLearn
Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.
audio-processing computer-vision deep-learning nlp
Last synced: 14 Mar 2025
https://github.com/monocasual/giada
Your Hardcore Loop Machine.
audio audio-processing audio-production beatmaking cpp20 daw drum-machine giada giadaloopmachine hardcore-loopmachine juce linux loop-machine macos midi midi-device music music-composition vst3 windows
Last synced: 14 May 2025
https://github.com/kfrlib/kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd
Last synced: 14 May 2025
https://github.com/letoram/arcan
Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"
audio-processing c desktop-environment display-server freebsd game-engine linux lua multimedia-graphic-library openbsd video-processing virtual-reality visualization wayland
Last synced: 14 May 2025
https://github.com/audiamus/AaxAudioConverter
Convert Audible aax files to mp3 and m4a/m4b
aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3
Last synced: 10 May 2025
https://github.com/audiamus/aaxaudioconverter
Convert Audible aax files to mp3 and m4a/m4b
aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3
Last synced: 01 Apr 2025
https://github.com/mltframework/mlt
MLT Multimedia Framework
audio audio-processing c c-plus-plus ffmpeg framework frei0r ladspa multimedia opengl qt sdl2 video video-processing
Last synced: 14 May 2025
https://github.com/ledfx/ledfx
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled
Last synced: 13 May 2025
https://github.com/LedFx/LedFx
LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.
audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled
Last synced: 09 Apr 2025
https://github.com/cycfi/Q
C++ Library for Audio Digital Signal Processing
audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth
Last synced: 11 May 2025
https://github.com/guitarml/smartguitaramp
Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.
audio-processing guitar juce machinelearning neuralnetworks
Last synced: 16 May 2025
https://github.com/cycfi/q
C++ Library for Audio Digital Signal Processing
audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth
Last synced: 14 May 2025
https://github.com/tracktion/tracktion_engine
Tracktion Engine module
audio audio-processing c-plus-plus cpp daw framework juce
Last synced: 13 Apr 2025
https://github.com/flutydeer/audio-slicer
A simple GUI application that slices audio with silence detection
audio-processing gui pyside6 qt6
Last synced: 03 Oct 2025
https://github.com/GuitarML/SmartGuitarAmp
Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.
audio-processing guitar juce machinelearning neuralnetworks
Last synced: 30 Apr 2025
https://github.com/mikeroyal/pipewire-guide
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst
Last synced: 16 May 2025
https://github.com/unosquare/ffmediaelement
FFME: The Advanced WPF MediaElement (based on FFmpeg)
audio audio-processing codec dotnet dotnet-framework ffmpeg ffmpeg-binaries ffplay h264 macos media-playback mediaelement mp3 mp4 mpeg video volume wpf xamarin
Last synced: 14 May 2025
https://github.com/r3gm/sonitranslate
Synchronized Translation for Videos. Video dubbing
asr audio-processing automatic-dubbing diarization document-translator dubbing speech-to-text stt subtitle-to-speech text-to-speech translate-audio translate-video translation tts video-dubbing
Last synced: 12 Oct 2025
https://github.com/Tracktion/tracktion_engine
Tracktion Engine module
audio audio-processing c-plus-plus cpp daw framework juce
Last synced: 08 Apr 2025
https://github.com/bytedance/SALMONN
SALMONN: Speech Audio Language Music Open Neural Network
audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university
Last synced: 14 Apr 2025
https://github.com/bytedance/salmonn
SALMONN: Speech Audio Language Music Open Neural Network
audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university
Last synced: 13 Apr 2025
https://github.com/mikeroyal/PipeWire-Guide
PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.
alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst
Last synced: 09 May 2025
https://github.com/mravanelli/sincnet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 16 May 2025
https://github.com/timschneeb/rootlessjamesdsp
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
android audio audio-processing convolution dsp effects equalizer non-root rootless
Last synced: 14 May 2025
https://github.com/mravanelli/SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform
Last synced: 26 Apr 2025
https://github.com/midas-research/audino
Open source audio annotation tool for humans
annotation-tool audio-annotation audio-processing datasets machine-learning python speech-processing
Last synced: 13 Apr 2025
https://github.com/funaudiollm/inspiremusic
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
audio-generation audio-processing music-generation pytorch
Last synced: 15 May 2025
https://github.com/KinWaiCheuk/nnAudio
Audio processing by using pytorch 1D convolution network
1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft
Last synced: 14 Jul 2025
https://github.com/ictnlp/streamspeech
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
all-in-one asr audio-processing machine-translation non-autoregressive seamless simultaneous-translation speech speech-enhancement speech-processing speech-recognition speech-synthesis speech-to-text speech-translation streaming-audio text-to-audio text-to-speech translation tts voice
Last synced: 16 May 2025
https://github.com/kinwaicheuk/nnaudio
Audio processing by using pytorch 1D convolution network
1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft
Last synced: 15 May 2025
https://github.com/dbraun/dawdreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host
Last synced: 14 May 2025
https://github.com/DBraun/DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host
Last synced: 16 Mar 2025
https://github.com/addictedcs/soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam
Last synced: 23 Oct 2025
https://github.com/AddictedCS/soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam
Last synced: 05 Apr 2025
https://github.com/acoustid/chromaprint
C library for generating audio fingerprints used by AcoustID
acoustid audio audio-analysis audio-fingerprinting audio-processing chromaprint
Last synced: 15 Mar 2025
https://github.com/timschneeb/RootlessJamesDSP
An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices
android audio audio-processing convolution dsp effects equalizer non-root rootless
Last synced: 08 Apr 2025
https://github.com/spotify/klio
Smarter data pipelines for audio.
audio-processing data-pipeline media-processing signal-processing
Last synced: 15 May 2025
https://github.com/vadymmarkov/beethoven
:guitar: A maestro of pitch detection.
audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner
Last synced: 16 May 2025
https://github.com/vadymmarkov/Beethoven
:guitar: A maestro of pitch detection.
audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner
Last synced: 06 Aug 2025
https://github.com/f90/Wave-U-Net
Implementation of the Wave-U-Net for audio source separation
audio-processing deep-learning mit-license waveform-analysis
Last synced: 14 Jul 2025
https://github.com/x-lance/slam-llm
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 15 May 2025
https://github.com/goxr3plus/xr3player
🎧 🎼 The MOST ADVANCED JavaFX Media Player
audio-formats audio-player audio-processing audio-recorder audio-visualizer dropbox-client java-speech java-stream-player javafx mp3 spectrum-analyzer speech stream-player web-browser
Last synced: 14 Apr 2025
https://github.com/Blaizzy/mlx-audio
A text-to-speech (TTS) and Speech-to-Speech (STS) library built on Apple's MLX framework, providing efficient speech synthesis on Apple Silicon.
apple-silicon audio-processing mlx multimodal speech-recognition speech-synthesis speech-to-text text-to-speech transformers
Last synced: 04 May 2025
https://github.com/rnchg/APT
AI Productivity Tool - Free and open source, improve user productivity, and protect privacy and data security. Including but not limited to: built-in local exclusive ChatGPT, DeepSeek, Phi, Qwen and other models, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 14 Aug 2025
https://github.com/rnchg/apt
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 15 May 2025
https://github.com/rnchg/Apt
AI Productivity Tool - Free and open source, improve user productivity, protect privacy and data security. Provide efficient and convenient AI solutions, built-in local exclusive ChatGPT, Phi, DeepSeek, one-click batch intelligent processing of pictures, videos, audio, etc.
ai ai-framework aigc audio-processing chatgpt computer-vision deep-learning deepseek generative-ai image-processing inference llm machine-learning machinelearning neural-network onnx onnxruntime video-processing
Last synced: 24 Mar 2025
https://github.com/X-LANCE/SLAM-LLM
Speech, Language, Audio, Music Processing with Large Language Model
audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing
Last synced: 11 Sep 2025
https://github.com/omeryusufyagci/fast-music-remover
A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.
audio-cleaner audio-enhancement audio-extractor audio-processing cpp deepfilternet ffmpeg flask machine-learning media-editor media-processing music-remover noise-removal processing realtime speech-extractor vocal-extractor youtube yt-dlp
Last synced: 15 May 2025
https://github.com/open-mmlab/foleycrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 04 Apr 2025
https://github.com/RelevanceAI/vectorhub
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec
Last synced: 27 Apr 2025
https://github.com/relevanceai/vectorhub
Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)
artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec
Last synced: 04 Apr 2025
https://github.com/lagmoellertim/unsilence
Console Interface and Library to remove silent parts of a media file 🔈
audio-processing contributions-welcome hacktoberfest media python silence-speedup silencedetect video-processing
Last synced: 12 Jul 2025
https://github.com/open-mmlab/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio
Last synced: 30 Oct 2025
https://github.com/josephernest/samplerbox
SamplerBox is a sampler musical instrument based on RaspberryPi.
audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer
Last synced: 13 Apr 2025
https://github.com/josephernest/SamplerBox
SamplerBox is a sampler musical instrument based on RaspberryPi.
audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer
Last synced: 29 Mar 2025
https://github.com/sfluor/musig
A shazam like tool to store songs fingerprints and retrieve them
audio audio-processing digital-signal-processing go golang microphone musig shazam song
Last synced: 11 Oct 2025
https://github.com/novoic/surfboard
Novoic's audio feature extraction library
alzheimers-disease audio audio-processing feature-extraction healthcare machine-learning parkinsons-disease python signal-processing speech-processing
Last synced: 03 Apr 2025
https://github.com/opencodewin/MediaEditor
A non-linear editing software that helps you to make nice video.
audio audio-mixing audio-processing filter imgui media-decode media-encode non-linear-editing subtitle-editing video video-editor video-effects video-processing vulkan-shader
Last synced: 05 Apr 2025
https://github.com/marcogdepinto/emotion-classification-from-audio-files
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 05 Apr 2025
https://github.com/marcogdepinto/Emotion-Classification-Ravdess
Understanding emotions from audio files using neural networks and multiple datasets.
audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow
Last synced: 12 Mar 2025
https://github.com/wofwca/jumpcutter
⏩ Fast-forwards long pauses between sentences — watch lectures ~1.5x faster (browser extension)
agpl audio audio-processing browser-extension chrome-extension firefox-addon firefox-extension productivity video web-audio-api webextension youtube
Last synced: 16 May 2025
https://github.com/adobe-research/deepafx-st
DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/
adaptive-presets afx ai audio audio-processing audio-production compressor deeplearning drc effects eq music-production styletransfer
Last synced: 06 Apr 2025
https://github.com/justinsalamon/scaper
A library for soundscape synthesis and augmentation
audio audio-processing data-augmentation machine-learning machine-listening soundscape soundscape-synthesis sox synthesis
Last synced: 04 Apr 2025
https://github.com/Parisson/TimeSide
scalable audio processing framework and server written in Python
Last synced: 13 Mar 2025
https://github.com/scopeInfinity/Video2Description
Video to Text: Natural language description generator for some given video. [Video Captioning]
audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text
Last synced: 07 Apr 2025
https://github.com/Carleslc/AudioToText
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 13 Apr 2025
https://github.com/gkonovalov/android-vad
Android Voice Activity Detection (VAD) library. Supports WebRTC VAD GMM, Silero VAD DNN, Yamnet VAD DNN models.
android audio-processing deep-neural-networks dnn gmm neural-networks offline on-device-ai onnx-models real-time silero silero-vad speech-detection speech-recoginition vad voice-activity-detection voice-activity-detector voice-detection webrtc yamnet
Last synced: 16 May 2025
https://github.com/busyyang/python_sound_open
语音信号处理试验教程,Python代码
audio-processing blog matlab python
Last synced: 06 Apr 2025
https://github.com/carleslc/audiototext
Transcribe and translate audio to text using Whisper and DeepL.
audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api
Last synced: 06 Apr 2025
https://github.com/YuanGongND/whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
audio audio-classification audio-processing audio-tagging speech-recognition
Last synced: 01 Apr 2025
https://github.com/etienneab3d/whisperhallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper
Last synced: 16 May 2025
https://github.com/fabiogra/moseca
A Streamilt web app for music source separation & karaoke
audio-processing demucs huggingface music-separation streamlit vocal-remover
Last synced: 06 Apr 2025
https://github.com/aetaric/checkrr
Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr
audio audio-processing ffprobe media radarr-api sonarr-api video video-processing
Last synced: 31 Mar 2025
https://github.com/Yuan-ManX/audio-development-tools
This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.
artificial-intelligence audio audio-generation audio-processing deep-learning dsp machine-learning music music-generation signal-processing speech speech-processing speech-synthesis
Last synced: 17 Mar 2025