Projects in Awesome Lists tagged with audio-processing
A curated list of projects in awesome lists tagged with audio-processing .
https://github.com/fabio-sim/audiosep-jax
JAX implementation of AudioSep: Separate Anything You Describe
audio-processing audio-source-separation audiosep deep-learning jax sound-separation
Last synced: 25 Jan 2025
https://github.com/skulux/voicetral
This repository contains an amateur implementation of an interface between the Ollama model and Applio's TTS and voice conversion services. It serves as a basic example of integrating speech recognition, text generation, and audio processing for personal or experimental use.
ai applio audio-processing conversational-ai local natural-language ollama rvc speech-to-text stt text-generation text-to-speech tts voicetral
Last synced: 20 Jan 2025
https://github.com/cizodevahm/speech-recognition-web-application
This repository contains a Flask web application that allows users to upload audio files and convert them to text using Google’s Speech Recognition API.
audio-processing flask google-speech-recognition speech-recognition speech-to-text
Last synced: 17 Jan 2025
https://github.com/ankushrathour/splitaudio
Splitting audio files into chunks on silence using Python Pydub
audio-processing flask pydub python
Last synced: 19 Feb 2025
https://github.com/giovannibedetti/csoundunitypackage
Csound as a Unity Package
audio audio-processing csound dsp generative-music package unity unity-asset unity3d-plugin
Last synced: 21 Feb 2025
https://github.com/gituser12981u2/audio_visualizer
A janky, yet charming terminal-based audio visualizer
audio-processing audio-visualizer fft linux live-music-visualizer macosx music-visualization music-visualizer windows
Last synced: 24 Oct 2024
https://github.com/bkraad47/fat_llama_fftw
fat_llama_fftw is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes fftw-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT, resulting in richer and more detailed audio.
audio audio-processing fftw hires hpc ist parallel upscaler
Last synced: 06 Nov 2024
https://github.com/zeloe/juce_cuda_convolution
Linear realtime convolution using CUDA
audio audio-processing convolution cuda dsp juce
Last synced: 17 Feb 2025
https://github.com/brandonmfong/audionoisefilter
Using Butterworth filter to cancel out the noise from an .wav file
audio-analysis audio-processing digital-signal-processing matlab
Last synced: 10 Feb 2025
https://github.com/atharv-naik/whale-call-recognition-flask-app
Web app that allows to upload audio files to recognize it as Blue whale A-call or not
audio-classification audio-processing flask machine-learning
Last synced: 17 Feb 2025
https://github.com/grohith327/artist_classification
Detect artist of a song from a 5 second snippet of song
audio-processing deep-learning neural-networks signal-processing spectrogram
Last synced: 19 Feb 2025
https://github.com/elhaban3ro/asegtool
AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵
audio-processing audio-segmentation video-processing video-segmentation
Last synced: 30 Jan 2025
https://github.com/cbenoit/media-cutter
GTK based tool using ffmpeg and optionally SoX to easily process audio and video files.
audio-processing graphical-applications rust video-processing
Last synced: 28 Jan 2025
https://github.com/beingamanforever/audio-analysis
Here I have done various feature extractions upon audio samples, and this repository also contains all the work I have done in audio analysis. The aim of this repository is to host the methods need in other audio-analysis-projects that I would be doing hence forward.
audio-processing feature-extraction music-generation-deep-learning python
Last synced: 09 Jan 2025
https://github.com/ayushverma135/audfake-ai-audio-generator
This project demonstrates real-time audio processing using Python. It captures audio from a microphone, converts the speech to text, and then synthesizes the text back to speech using a different voice. This can be useful for applications such as voice changers, real-time translation, and more.
ai-audio-generation audio audio-processing python pyttsx3 speech-recognition speech-to-text
Last synced: 21 Feb 2025
https://github.com/scootpl/go-tensorflow-audio-example
An example of using a neural network model (LSTM) with the Tensorflow Go API.
audio audio-processing example golang guitar lstm-neural-networks machine-learning neural-network tensorflow tensorflow-go wav
Last synced: 12 Feb 2025
https://github.com/HasiruNavodya/audio-effects-web-api
A simple web API for adding audio effects (reverb, compression, delay) for audio files and video files
api audio-effects audio-processing firebase-admin-sdk flask python spotify-pedalboard
Last synced: 22 Nov 2024
https://github.com/zbo14/audiollusion
A toolkit for generating auditory illusions using SoX.
audio audio-processing bash illusions sox
Last synced: 31 Jan 2025
https://github.com/dynesshely/audiostudio
Your audio studio
audio audio-library audio-processing sound studio voice
Last synced: 26 Jan 2025
https://github.com/dantehemerson/audio-tag-editor
Editor of audio files tags, build with express and gatsbyjs.
api audio-processing editor express gatsby id3 mp3 tageditor
Last synced: 26 Jan 2025
https://github.com/alexadam/encode
convert cellular automata's output to audio spectrum
audio audio-effect audio-processing c cellular-automata music-composition music-visualizer spectrogram spectrum
Last synced: 01 Feb 2025
https://github.com/johannesklauss/syra
Syra is a web based DAW with collaboration in mind, powered by SOUL (Sound Language).
audio-processing music typescript webassembly webaudio-api
Last synced: 05 Feb 2025
https://github.com/sveinse/elns-release
Multi channel audio processing tool [RELEASES]
audio audio-processing dsp python3
Last synced: 01 Feb 2025
https://github.com/dinoosauro/ffmpeg-vlc-audio-metadata
Convert from an audio file to another using ffmpeg. Plus, parse all of the metadata from the input to the output.
audio audio-processing ffmpeg m4a m4a-tags metadata mp3 mp3-tags ogg ogg-files ogg-opus ogg-vorbis taglib-sharp taglibsharp
Last synced: 03 Jan 2025
https://github.com/mariona-ft/multimedia-networks-xamu
XARXES MULTIMÈDIA Curs 2023-24 EPSEVG
audio-processing image-processing iptv multimedia-systems qos telephony telephony-services video-processing
Last synced: 15 Feb 2025
https://github.com/maschlr/summaree_bot
AI assistant to transcribe, translate and summarize voice messages and audio files
ai ai-agents ai-assistant ai-assisted audio-processing python-telegram-bot speech-to-text speech-to-text-app telegram telegram-bot telegram-bots voice-assistant voice-chat voice-recognition
Last synced: 28 Jan 2025
https://github.com/risaddex/spotify-radio-jsexpert-06
Repositório referente ao evento ministrado pelo @ErickWendel
audio-processing docker e2e-tests jest multiplex node-streams nodejs streams-api streams2 testing
Last synced: 05 Feb 2025
https://github.com/nannigalaxy/audio-preprocessing-tool
Audio preprocessing tool for signal processing and machine learning applications.
audio-processing augmentation machine-learning mfcc signal-processing
Last synced: 12 Feb 2025
https://github.com/brlin-tw/snap-packaging-for-mp3splt
Snap Packaging for Mp3splt
application audio-processing mp3 mp3splt ogg snap snap-packaging snappy splitter
Last synced: 22 Jan 2025
https://github.com/anastasiya-masalava/audio_recorder
A sample Audio Recording application in Swift.
Last synced: 28 Jan 2025
https://github.com/bhojpur/audio
The Bhojpur Audio is a software-as-a-service product used as an Audio Processing Engine based on Bhojpur.NET Platform for application delivery.
Last synced: 24 Jan 2025
https://github.com/mohammed-majid/speech-emotion-recognition
Multi-Class Deep Audio Classification - Mel-frequency Cepstral Coefficients (MFCC)
audio-processing deep-learning lstm neural-network
Last synced: 17 Jan 2025
https://github.com/cbuschka/bpm-tools
Fork from http://www.pogo.org.uk/~mark/bpm-tools.git
Last synced: 16 Jan 2025
https://github.com/rdhillbb/ozz-wiz-realtimego
Example of Real Time Audio by OpenAI written in GO; Clone of Python/javascript example
audio-processing generative-ai golang-application golang-examples openai openai-realtime-api realtime
Last synced: 31 Jan 2025
https://github.com/nagababumo/open-source-models-with-hugging-face
asr audio-detection audio-processing automatic-speech-recognition blip clip huggingface huggingface-spaces huggingface-transformers image-captioning image-classification image-retrieval multi-modality object-detection open-source segementation sentence-embeddings transformers visual-question-answering zero-shot-learning
Last synced: 14 Jan 2025
https://github.com/sriharikapu/languageaudioconverter
The objective for me to create this repository (LAC) Language Audio Converter is to help people convert the audio from on language to another language
ai audio audio-processing language-model ml
Last synced: 22 Jan 2025
https://github.com/iffyloop/easyairwindows
Easily integrate Airwindows effects into any application, without VST or AU frameworks, just like any other external library
airwindows audio audio-effect audio-library audio-processing cpp
Last synced: 21 Jan 2025
https://github.com/michalspano/waveform-audio-enhancer
Enhance your .wav files programmatically.
audio audio-processing enhancement waveform
Last synced: 18 Jan 2025
https://github.com/ynsrc/kotlin-javafx-canvas-audio
Transforms microphone input to graphics on canvas
audio audio-analysis audio-processing audio-visualizer javafx kotlin
Last synced: 08 Jan 2025
https://github.com/silasberger/wave
A simple tone generator.
audio audio-processing exercise fun music toy-project
Last synced: 15 Jan 2025
https://github.com/skitsanos/react-tts
Using microphone in react, audio prefillers and Speech Synthesis
audio audio-player audio-processing microphone react reactjs
Last synced: 15 Jan 2025
https://github.com/arslanex/whisperdemo
A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.
audio-processing openai openai-whisper python whisper
Last synced: 23 Nov 2024
https://github.com/mdbecker/whisper_cpp_macos_utils
Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.
audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp
Last synced: 29 Jan 2025
https://github.com/1dagord/chord-creator
Allows users to create chords and melodies through a sheet music inspired GUI
audio-processing gui music music-composition music-player python python3
Last synced: 28 Jan 2025
https://github.com/eye-wave/wavetable-to-image
This is a command-line tool that converts WAV files into images.
audio audio-analyser audio-processing audio-visualizer image image-processing music wavetable wavetable-synthesizer wavetable-visualizer
Last synced: 08 Jan 2025
https://github.com/aswajith7077/tictactoe-v2
An upgraded Tic Tac Toe game featuring a vibrant loading screen, a game menu with music and sound control, and engaging animations.
audio-processing kotlin-android svg-animations tic-tac-toe
Last synced: 29 Jan 2025
https://github.com/sourceduty/audio_analyzer
🎵 Analyze music and audio files.
ai ai-music analyst analyzer artificial-intelligence audio audio-analysis audio-analyzer audio-files audio-processing chatgpt custom-gpt gpt gpt-bot gpts music music-ai
Last synced: 28 Jan 2025
https://github.com/drscotthawley/fastproaudio-old
End-to-end audio with fast.ai
audio audio-processing deep-learning fastai
Last synced: 07 Jan 2025
https://github.com/jaketurner616/discordmusicdownloaderbot
Ultimate search, acquisition and organization system for .mp3 music files directly in discord.
asynchronous-programming audio-processing configparser discord-bot discord-py mp3-conversion pydub python pytube youtube-api youtube-downloader youtube-search
Last synced: 15 Jan 2025
https://github.com/jkarppinen/audio-split-helper
Tool for utilizing bash + FFmpeg to clip audio with proper timestamps.
audio-processing ffmpeg python
Last synced: 12 Feb 2025
https://github.com/avicted/hip_fm_synthesis
This project demonstrates FM Synthesis (Frequency Modulation) using HIP (Heterogeneous Compute Interface), enabling high-performance sound generation on both AMD and NVIDIA GPUs.
amd audio-processing cuda fm-synthesis hip nvidia rocm
Last synced: 23 Jan 2025
https://github.com/otonomee/against-the-clock-transcript-analysis
This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.
against-the-clock ai-analysis audio-processing creative-ai creative-process data-analysis fact-magazine machine-learning music-production natural-language-processing nlp text-mining yt-dlp
Last synced: 07 Jan 2025
https://github.com/wyy511511/chinese-phonetic-dictionary-dataset
Chinese Phonetic Dataset with Homophone Clustering
audio audio-classification audio-processing audio-visualizer chinese python speech
Last synced: 05 Feb 2025
https://github.com/olbrichattila/audionorm
Audio Normalization Tool for MP3 to WAV Conversion
audio-normalization audio-processing golang-cli normalization
Last synced: 08 Feb 2025
https://github.com/wahidpanda/audio-watermarking-matlab-project
Watermark Audio and Image GUI is a MATLAB-based graphical user interface that allows users to apply watermarks to audio files using image data.
audio-processing audio-steganography audio-watermarking cse-project cyber-security cyber-security-project eee-project matlab-gui matlab-project signal-processing
Last synced: 21 Jan 2025
https://github.com/ricci2511/audio_transcode
Script for transcoding audio in video files to AC3 format with customizable settings
Last synced: 29 Jan 2025
https://github.com/loglux/flexaudioprint
FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file
ai artificial-intelligence audio-processing gradio openai-whisper transcribe transcribe-audio-files
Last synced: 14 Jan 2025
https://github.com/arthurfdlr/swainsonsthrush-detector
🎶🐦 Swainson's Thrush's pit call detection using the Generalized Likelihood Ratio Test
audio-processing birdsong data-science glrt signal-processing statistics
Last synced: 14 Jan 2025
https://github.com/ashenoy95/music-info-processing
Some takeaways from a Music Info Processing course I took at IU in Fall '16
audio-processing music-information-retrieval r
Last synced: 07 Jan 2025
https://github.com/io7m-com/aradine
Modular programmable synthesis.
audio-processing audio-synthesis realtime
Last synced: 27 Jan 2025
https://github.com/anand-ma/kootta-suruki
This helps summarize meeting calls in text (STT / ASR)
ai asr audio audio-processing python streamlit streamlit-webapp summary tts
Last synced: 11 Feb 2025
https://github.com/richard-hartmann/tuner
piano tuner (general pitch detection) in Python
audio-processing frequency-analysis piano-utils pitch-detection pitch-estimation python3 tuner
Last synced: 16 Feb 2025
https://github.com/chloelavrat/speech-to-text-app
Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.
audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai
Last synced: 02 Jan 2025
https://github.com/unclechu/lv2-channel-delay
LV2 plugin. Signal delay by specific channel.
audio audio-processing c lv2 lv2-plugin signal-delay sound sound-processing
Last synced: 18 Feb 2025
https://github.com/chatrli/audio-wave
audio visualisation with waves sound
audio audio-processing audio-visualizer css html javascript
Last synced: 21 Jan 2025
https://github.com/fitzwilliammuseum/deathonthenileanalysis
Analysis of audio transcription of Death on the Nile audio guides
audio-processing cambridge-university museum pybossa r rstats
Last synced: 14 Jan 2025
https://github.com/realsba/esp32-mod-player
ESP32 MOD Player: A lightweight ESP-IDF component for playing MOD files on ESP32 devices, leveraging the ModPlayer library as a submodule. Includes an example project for easy integration and setup.
audio-player audio-processing cpp cpp23 embedded-audio esp-idf esp-idf-component esp32 mod mod-player sound-processing tracker
Last synced: 13 Feb 2025
https://github.com/fitzwilliammuseum/fitzdeathonthenileaudio
A MicroPasts Pybossa template for transcribing audio files
audio-processing citizen-science egyptology fitzwilliam-museum pybossa
Last synced: 14 Jan 2025
https://github.com/ximaz/audio-pitcher
Vanillia Javascript Client-side application enabling to upload an MP3 file, change it's pitch, play the newer version and export it, if good enough, to a WAV file format.
audio audio-effect audio-player audio-processing javascript js pitch-control
Last synced: 05 Feb 2025
https://github.com/whatuhh/on-dac-18
Cheap customizable digital mixing console!
audio audio-processing mixing mixing-audio raspberry-pi
Last synced: 06 Feb 2025
https://github.com/wa-lead/audio2md
Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5
audio-processing gpt-3 openai python whisper
Last synced: 26 Jan 2025
https://github.com/tanvirongh/resonance-mapper
Analyzes audio signals and categorizes them into predefined frequency ranges, providing valuable insights into the sonic makeup of the audio
audio-analysis audio-classification audio-processing
Last synced: 13 Feb 2025
https://github.com/fx2y/densim
Densim is a library for efficient similarity search and clustering of dense vectors, which are numerical representations of data such as images, text, or audio.
audio-processing clustering data-science dense-vectors image-processing large-scale-dataset machine-learning numerical-representation parameter-tuning performance-optimization similarity-search text-analysis
Last synced: 26 Jan 2025
https://github.com/blargian/sound_localization
STM32 based sound localization for STM32F746G-DISCO board.
audio-processing sound-localization sound-processing stm32 stm32f746g-discovery
Last synced: 03 Feb 2025
https://github.com/steviecurran/wav3mp3
Script to convert .wav files to MP3 via command line
audio-processing command-line-tool compression mp3 powerpoint wav
Last synced: 14 Jan 2025
https://github.com/prinuvinod/subtitle-generator-from-audio-input
This code generates subtitles from the given audio input.
audio-processing googlespeechapi python3 speech-recognition subtitles
Last synced: 06 Jan 2025
https://github.com/hiway-media/soundwave-go
SoundWave-go is a tool
audio audio-processing ffmpeg waveform
Last synced: 27 Jan 2025
https://github.com/ml13571/audio-classifier
Classification model to detect water, alarm and other sounds, including training, inference and dataset
ai audio-classification audio-processing classification ml
Last synced: 11 Jan 2025
https://github.com/dimitrisstyl7/speech-and-audio-processing-project-2024
University Project
audio-processing python speech-and-audio-processing speech-recognition
Last synced: 03 Nov 2024
https://github.com/rameshovyas/mp4-to-mp3
A python tool that converts mp4 to mp3
audio-processing convert-video-files ffmpeg ffmpeg-script python python3 video-to-audio
Last synced: 14 Jan 2025
https://github.com/sudeepacharjee/dictionary
Unlock the world of words with our dynamic dictionary web app. Powered by advanced algorithms, it provides comprehensive definitions, synonyms, antonyms, and examples for a vast range of words. Enhance your vocabulary, improve language skills, and explore the richness of language through this intuitive and informative online resource
api audio-processing css3 dictionary html5 javascript
Last synced: 14 Jan 2025
https://github.com/sagartr/deep-audio-classifier-using-machine-learning
Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance
audio-processing capuchin librosa python tensorflow tensorflow-models
Last synced: 13 Feb 2025
https://github.com/nmrr/signal-processing-in-100-lines
A 100 lines example of signal processing written in C++
audio-processing signal-processing
Last synced: 23 Jan 2025
https://github.com/jdsherbert/audio-haas-effect
Simple C++ implementation of the haas technique, with brief explanation.
audio audio-effect audio-effects audio-processing cpp delay haas
Last synced: 13 Feb 2025
https://github.com/skippi/osumix
A command-line tool for converting beatmaps into audio files.
audio audio-processing command-line osu osugame pydub python
Last synced: 21 Jan 2025
https://github.com/ahmed-ai-01/multimodal-rag
An AI-powered chat application using text, audio, and images for context-aware responses. It integrates language models and vector databases to enhance retrieval-augmented generation (RAG) capabilities, making it a versatile tool for intelligent conversations.
ai audio-processing chatbot image-processing language-model multimodal pdf-processing pinecone rag streamlit
Last synced: 03 Feb 2025
https://github.com/flokapi/easymix
Simple live and track audio mixer in python
Last synced: 04 Feb 2025
https://github.com/jdsherbert/audio-delay
Simple C++ implementation of a basic Delay technique. Includes an example usage case.
audio audio-effect audio-effects audio-processing cpp delay haas
Last synced: 13 Feb 2025
https://github.com/codybloemhard/lv2-host-minimal
A minimal lv2 host
audio audio-processing dsp linux lv2 lv2-host lv2-plugins
Last synced: 18 Feb 2025
https://github.com/dhartisangani/melomint-processor-service
A Unique music platform empowering artists with AI-driven nested royalties and NFT members and Transparent analytics. Seamless access to a vast high-quality song library for users. Support artists with exclusive NFTs with Flow blockchain.
ai audio-processing cosine-similarity fastapi ml python tenser
Last synced: 23 Jan 2025
https://github.com/akiomik/precountify
A tool for adding pre-count (count-off) click to audio file
audio-processing bpm-detection metronome music-practice
Last synced: 14 Dec 2024
https://github.com/healscodes/ogg-swift
Thin wrapper around libogg for Swift 5+
audio audio-processing ogg swift-wrapper swift5
Last synced: 07 Feb 2025
https://github.com/ernanej/audio-spectrum-visualizer
Visualization of the spectral content of an audio file using python.
audio-processing pds python spectrum-analyzer
Last synced: 05 Feb 2025
https://github.com/nightey3s/speech-emotion-recognition-using-wav2vec2
A Speech Emotion Recognition (SER) system using Facebook's Wav2Vec2 model that classifies speech into four emotions (Neutral, Happy, Sad, Angry). Achieves 69.02% accuracy on IEMOCAP dataset using modern transformer architecture and comprehensive data augmentation techniques.
audio-processing deep-learning emotion-recognition machine-learning pytorch speech-recognition wav2vec2
Last synced: 14 Feb 2025
https://github.com/fabe/ph
📮 Sharable podcast highlights.
audio-processing ffmpeg podcast
Last synced: 14 Feb 2025
https://github.com/healscodes/vorbis-swift
Thin wrapper around libvorbis for Swift5+
audio audio-codec audio-processing swift-wrapper swift5 vorbis
Last synced: 07 Feb 2025
https://github.com/1dagord/chord-guessr
Machine learning model chooses which chord would fit best over a chord progression
audio-processing feature-engineering jupyter-notebook machine-learning python python3 webscraping
Last synced: 12 Feb 2025
https://github.com/troy-lamerton/discordbotjvm
Voice bot for Discord. Pipes audio into the active Discord voice channel.
audio-processing discord discord-bot vivox
Last synced: 10 Feb 2025