Projects in Awesome Lists tagged with audio-processing

https://github.com/fabio-sim/audiosep-jax

JAX implementation of AudioSep: Separate Anything You Describe

audio-processing audio-source-separation audiosep deep-learning jax sound-separation

Last synced: 25 Jan 2025

https://github.com/skulux/voicetral

This repository contains an amateur implementation of an interface between the Ollama model and Applio's TTS and voice conversion services. It serves as a basic example of integrating speech recognition, text generation, and audio processing for personal or experimental use.

ai applio audio-processing conversational-ai local natural-language ollama rvc speech-to-text stt text-generation text-to-speech tts voicetral

Last synced: 20 Jan 2025

https://github.com/cizodevahm/speech-recognition-web-application

This repository contains a Flask web application that allows users to upload audio files and convert them to text using Google’s Speech Recognition API.

audio-processing flask google-speech-recognition speech-recognition speech-to-text

Last synced: 17 Jan 2025

https://github.com/ankushrathour/splitaudio

Splitting audio files into chunks on silence using Python Pydub

audio-processing flask pydub python

Last synced: 19 Feb 2025

https://github.com/giovannibedetti/csoundunitypackage

Csound as a Unity Package

audio audio-processing csound dsp generative-music package unity unity-asset unity3d-plugin

Last synced: 21 Feb 2025

https://github.com/gituser12981u2/audio_visualizer

A janky, yet charming terminal-based audio visualizer

audio-processing audio-visualizer fft linux live-music-visualizer macosx music-visualization music-visualizer windows

Last synced: 24 Oct 2024

https://github.com/bkraad47/fat_llama_fftw

fat_llama_fftw is a Python package for upscaling audio files to FLAC or WAV formats using advanced audio processing techniques. It utilizes fftw-accelerated calculations to enhance audio quality by upsampling and adding missing frequencies through FFT, resulting in richer and more detailed audio.

audio audio-processing fftw hires hpc ist parallel upscaler

Last synced: 06 Nov 2024

https://github.com/zeloe/juce_cuda_convolution

Linear realtime convolution using CUDA

audio audio-processing convolution cuda dsp juce

Last synced: 17 Feb 2025

https://github.com/brandonmfong/audionoisefilter

Using Butterworth filter to cancel out the noise from an .wav file

audio-analysis audio-processing digital-signal-processing matlab

Last synced: 10 Feb 2025

https://github.com/atharv-naik/whale-call-recognition-flask-app

Web app that allows to upload audio files to recognize it as Blue whale A-call or not

audio-classification audio-processing flask machine-learning

Last synced: 17 Feb 2025

https://github.com/grohith327/artist_classification

Detect artist of a song from a 5 second snippet of song

audio-processing deep-learning neural-networks signal-processing spectrogram

Last synced: 19 Feb 2025

https://github.com/elhaban3ro/asegtool

AsegTool is a tool designed to generate a segmentation file that is usable within my other tool. 🌵

audio-processing audio-segmentation video-processing video-segmentation

Last synced: 30 Jan 2025

https://github.com/cbenoit/media-cutter

GTK based tool using ffmpeg and optionally SoX to easily process audio and video files.

audio-processing graphical-applications rust video-processing

Last synced: 28 Jan 2025

https://github.com/beingamanforever/audio-analysis

Here I have done various feature extractions upon audio samples, and this repository also contains all the work I have done in audio analysis. The aim of this repository is to host the methods need in other audio-analysis-projects that I would be doing hence forward.

audio-processing feature-extraction music-generation-deep-learning python

Last synced: 09 Jan 2025

https://github.com/ayushverma135/audfake-ai-audio-generator

This project demonstrates real-time audio processing using Python. It captures audio from a microphone, converts the speech to text, and then synthesizes the text back to speech using a different voice. This can be useful for applications such as voice changers, real-time translation, and more.

ai-audio-generation audio audio-processing python pyttsx3 speech-recognition speech-to-text

Last synced: 21 Feb 2025

https://github.com/scootpl/go-tensorflow-audio-example

An example of using a neural network model (LSTM) with the Tensorflow Go API.

audio audio-processing example golang guitar lstm-neural-networks machine-learning neural-network tensorflow tensorflow-go wav

Last synced: 12 Feb 2025

https://github.com/HasiruNavodya/audio-effects-web-api

A simple web API for adding audio effects (reverb, compression, delay) for audio files and video files

api audio-effects audio-processing firebase-admin-sdk flask python spotify-pedalboard

Last synced: 22 Nov 2024

https://github.com/zbo14/audiollusion

A toolkit for generating auditory illusions using SoX.

audio audio-processing bash illusions sox

Last synced: 31 Jan 2025

https://github.com/dynesshely/audiostudio

Your audio studio

audio audio-library audio-processing sound studio voice

Last synced: 26 Jan 2025

https://github.com/dantehemerson/audio-tag-editor

Editor of audio files tags, build with express and gatsbyjs.

api audio-processing editor express gatsby id3 mp3 tageditor

Last synced: 26 Jan 2025

https://github.com/alexadam/encode

convert cellular automata's output to audio spectrum

audio audio-effect audio-processing c cellular-automata music-composition music-visualizer spectrogram spectrum

Last synced: 01 Feb 2025

https://github.com/yanivhaliwa/chatwithaimodels

ai anthropic anthropic-claude audio-processing chatbot gpt hume machine-learning nlp python voice-analysis voice-recognition

Last synced: 07 Jan 2025

https://github.com/johannesklauss/syra

Syra is a web based DAW with collaboration in mind, powered by SOUL (Sound Language).

audio-processing music typescript webassembly webaudio-api

Last synced: 05 Feb 2025

https://github.com/sveinse/elns-release

Multi channel audio processing tool [RELEASES]

audio audio-processing dsp python3

Last synced: 01 Feb 2025

https://github.com/dinoosauro/ffmpeg-vlc-audio-metadata

Convert from an audio file to another using ffmpeg. Plus, parse all of the metadata from the input to the output.

audio audio-processing ffmpeg m4a m4a-tags metadata mp3 mp3-tags ogg ogg-files ogg-opus ogg-vorbis taglib-sharp taglibsharp

Last synced: 03 Jan 2025

https://github.com/mariona-ft/multimedia-networks-xamu

XARXES MULTIMÈDIA Curs 2023-24 EPSEVG

audio-processing image-processing iptv multimedia-systems qos telephony telephony-services video-processing

Last synced: 15 Feb 2025

https://github.com/maschlr/summaree_bot

AI assistant to transcribe, translate and summarize voice messages and audio files

ai ai-agents ai-assistant ai-assisted audio-processing python-telegram-bot speech-to-text speech-to-text-app telegram telegram-bot telegram-bots voice-assistant voice-chat voice-recognition

Last synced: 28 Jan 2025

https://github.com/risaddex/spotify-radio-jsexpert-06

Repositório referente ao evento ministrado pelo @ErickWendel

audio-processing docker e2e-tests jest multiplex node-streams nodejs streams-api streams2 testing

Last synced: 05 Feb 2025

https://github.com/nannigalaxy/audio-preprocessing-tool

Audio preprocessing tool for signal processing and machine learning applications.

audio-processing augmentation machine-learning mfcc signal-processing

Last synced: 12 Feb 2025

https://github.com/brlin-tw/snap-packaging-for-mp3splt

Snap Packaging for Mp3splt

application audio-processing mp3 mp3splt ogg snap snap-packaging snappy splitter

Last synced: 22 Jan 2025

https://github.com/anastasiya-masalava/audio_recorder

A sample Audio Recording application in Swift.

audio-processing swift

Last synced: 28 Jan 2025

https://github.com/bhojpur/audio

The Bhojpur Audio is a software-as-a-service product used as an Audio Processing Engine based on Bhojpur.NET Platform for application delivery.

audio-processing

Last synced: 24 Jan 2025

https://github.com/mohammed-majid/speech-emotion-recognition

Multi-Class Deep Audio Classification - Mel-frequency Cepstral Coefficients (MFCC)

audio-processing deep-learning lstm neural-network

Last synced: 17 Jan 2025

https://github.com/cbuschka/bpm-tools

Fork from http://www.pogo.org.uk/~mark/bpm-tools.git

audio-processing bpm

Last synced: 16 Jan 2025

https://github.com/rdhillbb/ozz-wiz-realtimego

Example of Real Time Audio by OpenAI written in GO; Clone of Python/javascript example

audio-processing generative-ai golang-application golang-examples openai openai-realtime-api realtime

Last synced: 31 Jan 2025

https://github.com/nagababumo/open-source-models-with-hugging-face

asr audio-detection audio-processing automatic-speech-recognition blip clip huggingface huggingface-spaces huggingface-transformers image-captioning image-classification image-retrieval multi-modality object-detection open-source segementation sentence-embeddings transformers visual-question-answering zero-shot-learning

Last synced: 14 Jan 2025

https://github.com/sriharikapu/languageaudioconverter

The objective for me to create this repository (LAC) Language Audio Converter is to help people convert the audio from on language to another language

ai audio audio-processing language-model ml

Last synced: 22 Jan 2025

https://github.com/iffyloop/easyairwindows

Easily integrate Airwindows effects into any application, without VST or AU frameworks, just like any other external library

airwindows audio audio-effect audio-library audio-processing cpp

Last synced: 21 Jan 2025

https://github.com/michalspano/waveform-audio-enhancer

Enhance your .wav files programmatically.

audio audio-processing enhancement waveform

Last synced: 18 Jan 2025

https://github.com/ynsrc/kotlin-javafx-canvas-audio

Transforms microphone input to graphics on canvas

audio audio-analysis audio-processing audio-visualizer javafx kotlin

Last synced: 08 Jan 2025

https://github.com/silasberger/wave

A simple tone generator.

audio audio-processing exercise fun music toy-project

Last synced: 15 Jan 2025

https://github.com/skitsanos/react-tts

Using microphone in react, audio prefillers and Speech Synthesis

audio audio-player audio-processing microphone react reactjs

Last synced: 15 Jan 2025

https://github.com/arslanex/whisperdemo

A scalable Python module for robust audio transcription using OpenAI's Whisper model. Supports multiple languages, batch processing, and output formats like JSON and SRT.

audio-processing openai openai-whisper python whisper

Last synced: 23 Nov 2024

https://github.com/mdbecker/whisper_cpp_macos_utils

Automated transcription workflow for macOS: Shell scripts to streamline audio recording, conversion, and transcription using whisper.cpp with macOS utilities like QuickTime Player and BlackHole-2ch.

audio-processing openai shell-scripts speech-to-text transcription whisper whisper-cpp

Last synced: 29 Jan 2025

https://github.com/1dagord/chord-creator

Allows users to create chords and melodies through a sheet music inspired GUI

audio-processing gui music music-composition music-player python python3

Last synced: 28 Jan 2025

https://github.com/eye-wave/wavetable-to-image

This is a command-line tool that converts WAV files into images.

audio audio-analyser audio-processing audio-visualizer image image-processing music wavetable wavetable-synthesizer wavetable-visualizer

Last synced: 08 Jan 2025

https://github.com/aswajith7077/tictactoe-v2

An upgraded Tic Tac Toe game featuring a vibrant loading screen, a game menu with music and sound control, and engaging animations.

audio-processing kotlin-android svg-animations tic-tac-toe

Last synced: 29 Jan 2025

https://github.com/sourceduty/audio_analyzer

🎵 Analyze music and audio files.

ai ai-music analyst analyzer artificial-intelligence audio audio-analysis audio-analyzer audio-files audio-processing chatgpt custom-gpt gpt gpt-bot gpts music music-ai

Last synced: 28 Jan 2025

https://github.com/drscotthawley/fastproaudio-old

End-to-end audio with fast.ai

audio audio-processing deep-learning fastai

Last synced: 07 Jan 2025

https://github.com/jaketurner616/discordmusicdownloaderbot

Ultimate search, acquisition and organization system for .mp3 music files directly in discord.

asynchronous-programming audio-processing configparser discord-bot discord-py mp3-conversion pydub python pytube youtube-api youtube-downloader youtube-search

Last synced: 15 Jan 2025

https://github.com/jkarppinen/audio-split-helper

Tool for utilizing bash + FFmpeg to clip audio with proper timestamps.

audio-processing ffmpeg python

Last synced: 12 Feb 2025

https://github.com/avicted/hip_fm_synthesis

This project demonstrates FM Synthesis (Frequency Modulation) using HIP (Heterogeneous Compute Interface), enabling high-performance sound generation on both AMD and NVIDIA GPUs.

amd audio-processing cuda fm-synthesis hip nvidia rocm

Last synced: 23 Jan 2025

https://github.com/otonomee/against-the-clock-transcript-analysis

This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.

against-the-clock ai-analysis audio-processing creative-ai creative-process data-analysis fact-magazine machine-learning music-production natural-language-processing nlp text-mining yt-dlp

Last synced: 07 Jan 2025

https://github.com/wyy511511/chinese-phonetic-dictionary-dataset

Chinese Phonetic Dataset with Homophone Clustering

audio audio-classification audio-processing audio-visualizer chinese python speech

Last synced: 05 Feb 2025

https://github.com/olbrichattila/audionorm

Audio Normalization Tool for MP3 to WAV Conversion

audio-normalization audio-processing golang-cli normalization

Last synced: 08 Feb 2025

https://github.com/wahidpanda/audio-watermarking-matlab-project

Watermark Audio and Image GUI is a MATLAB-based graphical user interface that allows users to apply watermarks to audio files using image data.

audio-processing audio-steganography audio-watermarking cse-project cyber-security cyber-security-project eee-project matlab-gui matlab-project signal-processing

Last synced: 21 Jan 2025

https://github.com/ricci2511/audio_transcode

Script for transcoding audio in video files to AC3 format with customizable settings

audio-processing shell

Last synced: 29 Jan 2025

https://github.com/loglux/flexaudioprint

FlexAudioPrint is a Python-based app for transcribing audio to text using OpenAI's Whisper model. It offers a Gradio web interface and a script for programmatic use. With FFmpeg for audio conversion, it supports multiple formats like MP3 and WAV. Ideal for transcribing meetings, lectures, and podcasts, with options to save results as text file

ai artificial-intelligence audio-processing gradio openai-whisper transcribe transcribe-audio-files

Last synced: 14 Jan 2025

https://github.com/arthurfdlr/swainsonsthrush-detector

🎶🐦 Swainson's Thrush's pit call detection using the Generalized Likelihood Ratio Test

audio-processing birdsong data-science glrt signal-processing statistics

Last synced: 14 Jan 2025

https://github.com/ashenoy95/music-info-processing

Some takeaways from a Music Info Processing course I took at IU in Fall '16

audio-processing music-information-retrieval r

Last synced: 07 Jan 2025

https://github.com/io7m-com/aradine

Modular programmable synthesis.

audio-processing audio-synthesis realtime

Last synced: 27 Jan 2025

https://github.com/anand-ma/kootta-suruki

This helps summarize meeting calls in text (STT / ASR)

ai asr audio audio-processing python streamlit streamlit-webapp summary tts

Last synced: 11 Feb 2025

https://github.com/richard-hartmann/tuner

piano tuner (general pitch detection) in Python

audio-processing frequency-analysis piano-utils pitch-detection pitch-estimation python3 tuner

Last synced: 16 Feb 2025

https://github.com/chloelavrat/speech-to-text-app

Speech to text web app based on Streamlit and whisper that extract script for audio or youtube video.

audio-processing machine-learning machinelearning speech-to-text streamlit streamlit-webapp stt whisper whisper-ai

Last synced: 02 Jan 2025

https://github.com/unclechu/lv2-channel-delay

LV2 plugin. Signal delay by specific channel.

audio audio-processing c lv2 lv2-plugin signal-delay sound sound-processing

Last synced: 18 Feb 2025

https://github.com/chatrli/audio-wave

audio visualisation with waves sound

audio audio-processing audio-visualizer css html javascript

Last synced: 21 Jan 2025

https://github.com/fitzwilliammuseum/deathonthenileanalysis

Analysis of audio transcription of Death on the Nile audio guides

audio-processing cambridge-university museum pybossa r rstats

Last synced: 14 Jan 2025

https://github.com/realsba/esp32-mod-player

ESP32 MOD Player: A lightweight ESP-IDF component for playing MOD files on ESP32 devices, leveraging the ModPlayer library as a submodule. Includes an example project for easy integration and setup.

audio-player audio-processing cpp cpp23 embedded-audio esp-idf esp-idf-component esp32 mod mod-player sound-processing tracker

Last synced: 13 Feb 2025

https://github.com/fitzwilliammuseum/fitzdeathonthenileaudio

A MicroPasts Pybossa template for transcribing audio files

audio-processing citizen-science egyptology fitzwilliam-museum pybossa

Last synced: 14 Jan 2025

https://github.com/ximaz/audio-pitcher

Vanillia Javascript Client-side application enabling to upload an MP3 file, change it's pitch, play the newer version and export it, if good enough, to a WAV file format.

audio audio-effect audio-player audio-processing javascript js pitch-control

Last synced: 05 Feb 2025

https://github.com/junxian428/audiodatawithfft

audio-processing fft

Last synced: 21 Jan 2025

https://github.com/whatuhh/on-dac-18

Cheap customizable digital mixing console!

audio audio-processing mixing mixing-audio raspberry-pi

Last synced: 06 Feb 2025

https://github.com/wa-lead/audio2md

Summarizes audio using openai Whisper-1 model and GPT-Turbo3.5

audio-processing gpt-3 openai python whisper

Last synced: 26 Jan 2025

https://github.com/tanvirongh/resonance-mapper

Analyzes audio signals and categorizes them into predefined frequency ranges, providing valuable insights into the sonic makeup of the audio

audio-analysis audio-classification audio-processing

Last synced: 13 Feb 2025

https://github.com/fx2y/densim

Densim is a library for efficient similarity search and clustering of dense vectors, which are numerical representations of data such as images, text, or audio.

audio-processing clustering data-science dense-vectors image-processing large-scale-dataset machine-learning numerical-representation parameter-tuning performance-optimization similarity-search text-analysis

Last synced: 26 Jan 2025

https://github.com/blargian/sound_localization

STM32 based sound localization for STM32F746G-DISCO board.

audio-processing sound-localization sound-processing stm32 stm32f746g-discovery

Last synced: 03 Feb 2025

https://github.com/steviecurran/wav3mp3

Script to convert .wav files to MP3 via command line

audio-processing command-line-tool compression mp3 powerpoint wav

Last synced: 14 Jan 2025

https://github.com/prinuvinod/subtitle-generator-from-audio-input

This code generates subtitles from the given audio input.

audio-processing googlespeechapi python3 speech-recognition subtitles

Last synced: 06 Jan 2025

https://github.com/hiway-media/soundwave-go

SoundWave-go is a tool

audio audio-processing ffmpeg waveform

Last synced: 27 Jan 2025

https://github.com/ml13571/audio-classifier

Classification model to detect water, alarm and other sounds, including training, inference and dataset

ai audio-classification audio-processing classification ml

Last synced: 11 Jan 2025

https://github.com/dimitrisstyl7/speech-and-audio-processing-project-2024

University Project

audio-processing python speech-and-audio-processing speech-recognition

Last synced: 03 Nov 2024

https://github.com/rameshovyas/mp4-to-mp3

A python tool that converts mp4 to mp3

audio-processing convert-video-files ffmpeg ffmpeg-script python python3 video-to-audio

Last synced: 14 Jan 2025

https://github.com/sudeepacharjee/dictionary

Unlock the world of words with our dynamic dictionary web app. Powered by advanced algorithms, it provides comprehensive definitions, synonyms, antonyms, and examples for a vast range of words. Enhance your vocabulary, improve language skills, and explore the richness of language through this intuitive and informative online resource

api audio-processing css3 dictionary html5 javascript

Last synced: 14 Jan 2025

https://github.com/sagartr/deep-audio-classifier-using-machine-learning

Languages Used: Python Developed and implemented a deep audio classifier using CNNs and LSTMs to accurately categorize diverse audio signals, achieving high accuracy and robustness. Utilized Python and TensorFlow for model development and training, incorporating data augmentation techniques to enhance performance

audio-processing capuchin librosa python tensorflow tensorflow-models

Last synced: 13 Feb 2025

https://github.com/nmrr/signal-processing-in-100-lines

A 100 lines example of signal processing written in C++

audio-processing signal-processing

Last synced: 23 Jan 2025

https://github.com/jdsherbert/audio-haas-effect

Simple C++ implementation of the haas technique, with brief explanation.

audio audio-effect audio-effects audio-processing cpp delay haas

Last synced: 13 Feb 2025

https://github.com/skippi/osumix

A command-line tool for converting beatmaps into audio files.

audio audio-processing command-line osu osugame pydub python

Last synced: 21 Jan 2025

https://github.com/ahmed-ai-01/multimodal-rag

An AI-powered chat application using text, audio, and images for context-aware responses. It integrates language models and vector databases to enhance retrieval-augmented generation (RAG) capabilities, making it a versatile tool for intelligent conversations.

ai audio-processing chatbot image-processing language-model multimodal pdf-processing pinecone rag streamlit

Last synced: 03 Feb 2025

https://github.com/flokapi/easymix

Simple live and track audio mixer in python

audio-processing mixer

Last synced: 04 Feb 2025

https://github.com/jdsherbert/audio-delay

Simple C++ implementation of a basic Delay technique. Includes an example usage case.

audio audio-effect audio-effects audio-processing cpp delay haas

Last synced: 13 Feb 2025

https://github.com/codybloemhard/lv2-host-minimal

A minimal lv2 host

audio audio-processing dsp linux lv2 lv2-host lv2-plugins

Last synced: 18 Feb 2025

https://github.com/dhartisangani/melomint-processor-service

A Unique music platform empowering artists with AI-driven nested royalties and NFT members and Transparent analytics. Seamless access to a vast high-quality song library for users. Support artists with exclusive NFTs with Flow blockchain.

ai audio-processing cosine-similarity fastapi ml python tenser

Last synced: 23 Jan 2025

https://github.com/akiomik/precountify

A tool for adding pre-count (count-off) click to audio file

audio-processing bpm-detection metronome music-practice

Last synced: 14 Dec 2024

https://github.com/healscodes/ogg-swift

Thin wrapper around libogg for Swift 5+

audio audio-processing ogg swift-wrapper swift5

Last synced: 07 Feb 2025

https://github.com/ernanej/audio-spectrum-visualizer

Visualization of the spectral content of an audio file using python.

audio-processing pds python spectrum-analyzer

Last synced: 05 Feb 2025

https://github.com/nightey3s/speech-emotion-recognition-using-wav2vec2

A Speech Emotion Recognition (SER) system using Facebook's Wav2Vec2 model that classifies speech into four emotions (Neutral, Happy, Sad, Angry). Achieves 69.02% accuracy on IEMOCAP dataset using modern transformer architecture and comprehensive data augmentation techniques.

audio-processing deep-learning emotion-recognition machine-learning pytorch speech-recognition wav2vec2