Projects in Awesome Lists tagged with audio-processing

https://github.com/google-ai-edge/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing

Last synced: 30 Dec 2024

https://github.com/google/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing

Last synced: 13 Dec 2024

https://google.github.io/mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

android audio-processing c-plus-plus calculator computer-vision deep-learning framework graph-based graph-framework inference machine-learning mediapipe mobile-development perception pipeline-framework stream-processing video-processing

Last synced: 03 Nov 2024

https://github.com/deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 30 Dec 2024

https://github.com/Deezer/spleeter

Deezer source separation library including pretrained models.

audio-processing bass deep-learning deezer drums model pretrained-models python tensorflow vocals

Last synced: 13 Nov 2024

https://github.com/speechbrain/speechbrain

A PyTorch-based Speech Toolkit

asr audio audio-processing deep-learning huggingface language-model pytorch speaker-diarization speaker-recognition speaker-verification speech-enhancement speech-processing speech-recognition speech-separation speech-to-text speech-toolkit speechrecognition spoken-language-understanding transformers voice-recognition

Last synced: 30 Dec 2024

https://github.com/tenacityteam/tenacity-legacy

THIS REPO IS NOT MAINTAINED ANYMORE. Please see https://codeberg.org/tenacityteam/tenacity for Tenacity, which is maintained.

audacity audio audio-applications audio-processing floss hacktoberfest libre privacy-friendly privacy-preserving recorder recording-app

Last synced: 25 Sep 2024

https://github.com/bitgapp/eqmac

macOS System-wide Audio Equalizer & Volume Mixer 🎧

angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer

Last synced: 24 Dec 2024

https://github.com/bitgapp/eqMac

macOS System-wide Audio Equalizer & Volume Mixer 🎧

angular audio audio-applications audio-effect audio-processing avaudioengine coreaudio eq equalizer hal macos osx swift volume-control volume-mixer

Last synced: 26 Oct 2024

https://github.com/spotify/pedalboard

🎛 🔊 A Python library for audio.

audio audio-processing audio-production audio-research audio-unit augmentation juce machine-learning pybind11 python vst3 vst3-host

Last synced: 30 Dec 2024

https://github.com/nvidia/dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 30 Dec 2024

https://github.com/NVIDIA/DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

audio-processing data-augmentation data-processing deep-learning fast-data-pipeline gpu gpu-tensorflow image-augmentation image-processing machine-learning mxnet neural-network paddle python pytorch

Last synced: 26 Oct 2024

https://github.com/wyattblue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 24 Dec 2024

https://github.com/WyattBlue/auto-editor

Auto-Editor: Efficient media analysis and rendering

audio audio-editing audio-processing automatic python3 video video-editing video-processing

Last synced: 31 Oct 2024

https://github.com/libaudioflux/audioflux

A library for audio and music analysis, feature extraction.

audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform

Last synced: 24 Dec 2024

https://github.com/scottlawsonbc/audio-reactive-led-strip

:musical_note: :rainbow: Real-time LED strip music visualization using Python and the ESP8266 or Raspberry Pi

arduino audio-processing esp8266 music-visualizer python raspberry-pi signal-processing

Last synced: 27 Dec 2024

https://github.com/stemrollerapp/stemroller

Isolate vocals, drums, bass, and other instrumental stems from any song

audio-processing bass deep-learning demucs drums electron javascript machine-learning python source-separation vocals

Last synced: 26 Dec 2024

https://github.com/bitfieldaudio/otto

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 27 Dec 2024

https://github.com/bitfieldaudio/OTTO

Sampler, Sequencer, Multi-engine synth and effects - in a box! [WIP]

audio audio-processing music raspberry-pi sequencing synth synthesizer ui-design

Last synced: 10 Nov 2024

https://github.com/pytorch/audio

Data manipulation and transformation for audio signal processing, powered by PyTorch

audio audio-processing io machine-learning python pytorch speech

Last synced: 30 Dec 2024

https://github.com/faiface/beep

A little package that brings sound to any Go application. Suitable for playback and audio-processing.

audio audio-playback audio-processing go golang

Last synced: 25 Dec 2024

https://github.com/libAudioFlux/audioFlux

A library for audio and music analysis, feature extraction.

audio audio-analysis audio-features audio-processing deep-learning machine-learning mfcc mir music music-analysis music-information-retrieval pitch python signal-processing spectral-analysis spectrogram time-frequency-analysis wavelet-analysis wavelet-transform

Last synced: 25 Oct 2024

https://github.com/axinc-ai/ailia-models

The collection of pre-trained, state-of-the-art AI models for ailia SDK

action-recognition anomaly-detection audio-processing background-removal crowd-counting deep-learning embeddings face-detection face-recognition fashion-ai gan hand-detection image-classification image-segmentation llm neural-network object-detection object-recognition object-tracking pose-estimation

Last synced: 24 Dec 2024

https://github.com/julius-speech/julius

Open-Source Large Vocabulary Continuous Speech Recognition Engine

audio-processing recognition speech speech-recognition

Last synced: 25 Dec 2024

https://github.com/GauravBh1010tt/DeepLearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 25 Oct 2024

https://github.com/gauravbh1010tt/deeplearn

Implementation of research papers on Deep Learning+ NLP+ CV in Python using Keras, Tensorflow and Scikit Learn.

audio-processing computer-vision deep-learning nlp

Last synced: 28 Dec 2024

https://github.com/monocasual/giada

Your Hardcore Loop Machine.

audio audio-processing audio-production beatmaking cpp20 daw drum-machine giada giadaloopmachine hardcore-loopmachine juce linux loop-machine macos midi midi-device music music-composition vst3 windows

Last synced: 26 Dec 2024

https://github.com/kfrlib/kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd

Last synced: 26 Dec 2024

https://github.com/audiamus/aaxaudioconverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 27 Dec 2024

https://github.com/letoram/arcan

Arcan - [Display Server, Multimedia Framework, Game Engine] -> "Desktop Engine"

audio-processing c desktop-environment display-server freebsd game-engine linux lua multimedia-graphic-library openbsd video-processing virtual-reality visualization wayland

Last synced: 26 Dec 2024

https://github.com/audiamus/AaxAudioConverter

Convert Audible aax files to mp3 and m4a/m4b

aa aax audible audio-processing audiobook ffmpeg m4a m4b mp3

Last synced: 16 Nov 2024

https://github.com/mltframework/mlt

MLT Multimedia Framework

audio audio-processing c c-plus-plus ffmpeg framework frei0r ladspa multimedia opengl qt sdl2 video video-processing

Last synced: 26 Dec 2024

https://github.com/ledfx/ledfx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 24 Dec 2024

https://github.com/LedFx/LedFx

LedFx is a network based LED effect engine designed to deliver advanced real-time audio effects to a wide variety of devices.

audio-processing e131 led-strips microphone music-visualizer python qlc raspberry-pi react webinterface wled

Last synced: 06 Nov 2024

https://github.com/guitarml/smartguitaramp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 29 Dec 2024

https://github.com/GuitarML/SmartGuitarAmp

Guitar plugin made with JUCE that uses neural networks to emulate a tube amplifier.

audio-processing guitar juce machinelearning neuralnetworks

Last synced: 12 Nov 2024

https://github.com/cycfi/q

C++ Library for Audio Digital Signal Processing

audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth

Last synced: 27 Dec 2024

https://github.com/flutydeer/audio-slicer

A simple GUI application that slices audio with silence detection

audio-processing gui pyside6 qt6

Last synced: 28 Sep 2024

https://github.com/tracktion/tracktion_engine

Tracktion Engine module

audio audio-processing c-plus-plus cpp daw framework juce

Last synced: 27 Dec 2024

https://github.com/Tracktion/tracktion_engine

Tracktion Engine module

audio audio-processing c-plus-plus cpp daw framework juce

Last synced: 06 Nov 2024

https://github.com/cycfi/Q

C++ Library for Audio Digital Signal Processing

audio audio-processing c-plus-plus cpp cpp-library cpp20 dsp dsp-library effects frequency function-composition guitar-processor modern-cpp music pitch-detection pitch-tracking synth

Last synced: 17 Nov 2024

https://github.com/unosquare/ffmediaelement

FFME: The Advanced WPF MediaElement (based on FFmpeg)

audio audio-processing codec dotnet dotnet-framework ffmpeg ffmpeg-binaries ffplay h264 macos media-playback mediaelement mp3 mp4 mpeg video volume wpf xamarin

Last synced: 26 Dec 2024

https://github.com/mravanelli/sincnet

SincNet is a neural architecture for efficiently processing raw audio samples.

artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform

Last synced: 29 Dec 2024

https://github.com/mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

artificial-intelligence asr audio audio-processing cnn convolutional-neural-networks deep-learning digital-signal-processing filtering neural-networks python pytorch signal-processing speaker-identification speaker-recognition speaker-verification speech-processing speech-recognition timit waveform

Last synced: 11 Nov 2024

https://github.com/bytedance/salmonn

SALMONN: Speech Audio Language Music Open Neural Network

audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university

Last synced: 27 Dec 2024

https://github.com/mikeroyal/pipewire-guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 27 Dec 2024

https://github.com/midas-research/audino

Open source audio annotation tool for humans

annotation-tool audio-annotation audio-processing datasets machine-learning python speech-processing

Last synced: 27 Dec 2024

https://github.com/mikeroyal/PipeWire-Guide

PipeWire Guide. Learn about how PipeWire gives your Linux system a Professional Audio/Video Processing workflow.

alsa audio audio-analysis audio-processing audio-production audio-streaming compressor daw gstreamer ladspa low-latency lv2 midi multimedia pipewire playback pulseaudio spatial-audio video-streaming vst

Last synced: 16 Nov 2024

https://github.com/bytedance/SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

audio audio-processing bytedance iclr2024 icml-2024 large-language-models multi-modal music research speech speech-recognition tsinghua-university

Last synced: 08 Nov 2024

https://github.com/kinwaicheuk/nnaudio

Audio processing by using pytorch 1D convolution network

1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft

Last synced: 27 Dec 2024

https://github.com/KinWaiCheuk/nnAudio

Audio processing by using pytorch 1D convolution network

1d-convolution audio-processing cqt-spectrogram melspectrogram neural-network preprocessing pytorch spectrogram spectrogram-conversion-toolbox stft

Last synced: 22 Nov 2024

https://github.com/ictnlp/streamspeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

all-in-one asr audio-processing machine-translation non-autoregressive seamless simultaneous-translation speech speech-enhancement speech-processing speech-recognition speech-synthesis speech-to-text speech-translation streaming-audio text-to-audio text-to-speech translation tts voice

Last synced: 27 Dec 2024

https://github.com/addictedcs/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 28 Dec 2024

https://github.com/cgzirim/seek-tune

An implementation of Shazam's song matching algorithm.

audio-fingerprinting audio-processing go golang not-shazam shazam song song-recognition-algorithm

Last synced: 24 Nov 2024

https://github.com/dbraun/dawdreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 26 Dec 2024

https://github.com/AddictedCS/soundfingerprinting

Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.

acoustic-fingerprints algorithm audio audio-processing c-sharp fingerprints locality-sensitive-hashing nearest-neighbor-search recognition shazam

Last synced: 05 Nov 2024

https://github.com/DBraun/DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

ableton audio audio-plugin audio-processing daw faust jax juce midi python synthesizer vst vst-host vst3 vst3-host

Last synced: 27 Oct 2024

https://github.com/timschneeb/RootlessJamesDSP

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 06 Nov 2024

https://github.com/timschneeb/rootlessjamesdsp

An implementation of the system-wide JamesDSP audio processing engine for non-rooted Android devices

android audio audio-processing convolution dsp effects equalizer non-root rootless

Last synced: 30 Dec 2024

https://github.com/acoustid/chromaprint

C library for generating audio fingerprints used by AcoustID

acoustid audio audio-analysis audio-fingerprinting audio-processing chromaprint

Last synced: 26 Oct 2024

https://github.com/spotify/klio

Smarter data pipelines for audio.

audio-processing data-pipeline media-processing signal-processing

Last synced: 27 Dec 2024

https://github.com/vadymmarkov/Beethoven

:guitar: A maestro of pitch detection.

audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner

Last synced: 09 Dec 2024

https://github.com/vadymmarkov/beethoven

:guitar: A maestro of pitch detection.

audio audio-processing ios pitch-detection pitch-engine pitch-estimation swift tuner

Last synced: 30 Dec 2024

https://github.com/f90/Wave-U-Net

Implementation of the Wave-U-Net for audio source separation

audio-processing deep-learning mit-license waveform-analysis

Last synced: 22 Nov 2024

https://github.com/goxr3plus/xr3player

🎧 🎼 The MOST ADVANCED JavaFX Media Player

audio-formats audio-player audio-processing audio-recorder audio-visualizer dropbox-client java-speech java-stream-player javafx mp3 spectrum-analyzer speech stream-player web-browser

Last synced: 30 Dec 2024

https://github.com/x-lance/slam-llm

Speech, Language, Audio, Music Processing with Large Language Model

audio-processing large-language-model multimodal-large-language-models music-processing peft speech-processing

Last synced: 28 Dec 2024

https://github.com/lagmoellertim/unsilence

Console Interface and Library to remove silent parts of a media file 🔈

audio-processing contributions-welcome hacktoberfest media python silence-speedup silencedetect video-processing

Last synced: 22 Nov 2024

https://github.com/relevanceai/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 28 Dec 2024

https://github.com/RelevanceAI/vectorhub

Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, video2vec, graph2vec, bert, inception, etc)

artificial-intelligence audio-processing deep-learning deeplearning embeddings encodings image2vec machine-learning neural-network python pytorch tensorflow tfhub transformers vector vector-similarity video-processing word2vec

Last synced: 11 Nov 2024

https://github.com/open-mmlab/foleycrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 29 Dec 2024

https://github.com/josephernest/samplerbox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 25 Dec 2024

https://github.com/josephernest/SamplerBox

SamplerBox is a sampler musical instrument based on RaspberryPi.

audio audio-processing music piano python raspberry-pi raspberrypi raspios sampler samplerbox synthesizer

Last synced: 31 Oct 2024

https://github.com/novoic/surfboard

Novoic's audio feature extraction library

alzheimers-disease audio audio-processing feature-extraction healthcare machine-learning parkinsons-disease python signal-processing speech-processing

Last synced: 04 Nov 2024

https://github.com/opencodewin/MediaEditor

A non-linear editing software that helps you to make nice video.

audio audio-mixing audio-processing filter imgui media-decode media-encode non-linear-editing subtitle-editing video video-editor video-effects video-processing vulkan-shader

Last synced: 05 Nov 2024

https://github.com/omeryusufyagci/fast-music-remover

A C++ based, lightweight music and noise remover for YouTube and other internet media, using DeepFilterNet for audio enhancement.

audio-cleaner audio-enhancement audio-extractor audio-processing cpp deepfilternet ffmpeg flask machine-learning media-editor media-processing music-remover noise-removal processing realtime speech-extractor vocal-extractor youtube yt-dlp

Last synced: 28 Dec 2024

https://github.com/marcogdepinto/emotion-classification-from-audio-files

Understanding emotions from audio files using neural networks and multiple datasets.

audio audio-processing classification-report datascience deep-learning deep-neural-networks emotion emotion-classification-ravdess keras keras-neural-networks librosa livingstone machine-learning python python3 ravdess-dataset song songs speech tensorflow

Last synced: 29 Dec 2024

https://github.com/justinsalamon/scaper

A library for soundscape synthesis and augmentation

audio audio-processing data-augmentation machine-learning machine-listening soundscape soundscape-synthesis sox synthesis

Last synced: 25 Dec 2024

https://github.com/adobe-research/deepafx-st

DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/

adaptive-presets afx ai audio audio-processing audio-production compressor deeplearning drc effects eq music-production styletransfer

Last synced: 30 Dec 2024

https://github.com/Parisson/TimeSide

scalable audio processing framework and server written in Python

audio-processing python web

Last synced: 25 Oct 2024

https://github.com/wofwca/jumpcutter

⏩ Fast-forwards long pauses between sentences — watch lectures ~1.5x faster (browser extension)

agpl audio audio-processing browser-extension chrome-extension firefox-addon firefox-extension productivity video web-audio-api webextension youtube

Last synced: 29 Dec 2024

https://github.com/scopeInfinity/Video2Description

Video to Text: Natural language description generator for some given video. [Video Captioning]

audio-processing cnn-keras deep-neural-networks image-captioning lstm-neural-networks video-captioning video-processing video-to-text

Last synced: 06 Nov 2024

https://github.com/YuanGongND/whisper-at

Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"

audio audio-classification audio-processing audio-tagging speech-recognition

Last synced: 02 Nov 2024

https://github.com/busyyang/python_sound_open

语音信号处理试验教程，Python代码

audio-processing blog matlab python

Last synced: 24 Dec 2024

https://github.com/open-mmlab/FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

aigc audio-processing diffusion-models foley-sound-synthesis video-to-audio

Last synced: 12 Oct 2024

https://github.com/etienneab3d/whisperhallu

Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts

asr audio-processing noise-removal sound-processing text-to-speech vad vocals whisper

Last synced: 25 Dec 2024

https://github.com/Yuan-ManX/audio-development-tools

This is a list of sound, audio and music development tools which contains machine learning, audio generation, audio signal processing, sound synthesis, spatial audio, music information retrieval, music generation, speech recognition, speech synthesis, singing voice synthesis and more.

artificial-intelligence audio audio-generation audio-processing deep-learning dsp machine-learning music music-generation signal-processing speech speech-processing speech-synthesis

Last synced: 27 Oct 2024

https://github.com/carleslc/audiototext

Transcribe and translate audio to text using Whisper and DeepL.

audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api

Last synced: 24 Dec 2024

https://github.com/fabiogra/moseca

A Streamilt web app for music source separation & karaoke

audio-processing demucs huggingface music-separation streamlit vocal-remover

Last synced: 30 Dec 2024

https://github.com/igorski/mwengine

Audio engine and DSP library for Android, written in C++ providing low latency performance within a musical context, while providing a Java/Kotlin API. Supports both OpenSL and AAudio.

aaudio android android-ndk audio audio-engine audio-library audio-processing c-plus-plus cplusplus cpp java low-latency ndk opensl

Last synced: 24 Dec 2024

https://github.com/Carleslc/AudioToText

Transcribe and translate audio to text using Whisper and DeepL.

audio audio-processing captions colab-notebook deepl ffmpeg google-colab jupyter-notebook language openai-whisper python speech-to-text subtitles text transcribe transcription translate translation whisper whisper-api

Last synced: 07 Nov 2024

https://github.com/gtreshchev/runtimespeechrecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp

Last synced: 25 Dec 2024

https://github.com/LeviBorodenko/spectrographic

Turn an image into sound whose spectrogram looks like the image.

audio-processing audio-visualizer frequencies image-processing image-to-sound python sound sound-processing sound-synthesis spectrogram

Last synced: 31 Oct 2024

https://github.com/gtreshchev/RuntimeSpeechRecognizer

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

audio-processing openai speech-detection speech-processing speech-recognition speech-to-text ue4 ue4-plugin ue5 ue5-plugin unreal-engine unreal-engine-4 unreal-engine-5 voice-recognition whis whisper whisper-ai whisper-cpp

Last synced: 06 Nov 2024

https://github.com/openshot/libopenshot-audio

OpenShot Audio Library (libopenshot-audio) is a free, open-source project that enables high-quality editing and playback of audio, and is based on the amazing JUCE library.

audio audio-effects audio-library audio-processing c-plus-plus gplv3 juce juce-framework openshot

Last synced: 30 Dec 2024

https://github.com/jonnor/machinehearing

Machine Learning applied to sound

audio-analysis audio-classsification audio-processing machine-learning notes

Last synced: 24 Dec 2024

https://github.com/calebzulawski/fourier

Fast Fourier transforms (FFTs) in Rust

audio-processing digital-signal-processing dsp fft fourier-transform

Last synced: 29 Dec 2024

https://github.com/alnitak/flutter_soloud

Flutter low-level audio plugin using SoLoud C++ library and FFI

audio audio-player audio-processing audio-visualizer dart-ffi flutter flutter-plugin miniaudio soloud

Last synced: 25 Dec 2024

https://github.com/amishshah/prism-media

Easily transcode media using Node.js 🎶

audio audio-processing ffmpeg media transcoding

Last synced: 29 Dec 2024

https://github.com/s-a/sonic-sound-picture

Sonic Sound Picture (SSP) is a free, offline, and customizable music/audio visualizer software. With a range of templates to choose from, users can easily create stunning audio-visual experiences in just a few simple steps. SSP also allows users to create their own templates, giving them endless possibilities to bring their music to life.

audio audio-processing audio-signal-analysis audio-signal-processing audio-visualizer blender cross-platform digital-art music music-production-enhancement music-visualization music-visualizer templates user-friendly visual-effects visualisation visualization visuals

Last synced: 26 Dec 2024

https://github.com/aetaric/checkrr

Checkrr Scans your library files for corrupt media and optionally replaces the files via sonarr and radarr

audio audio-processing ffprobe media radarr-api sonarr-api video video-processing

Last synced: 02 Nov 2024