Projects in Awesome Lists tagged with pyaudio
A curated list of projects in awesome lists tagged with pyaudio .
https://github.com/aiXander/Realtime_PyAudio_FFT
Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.
audio-visualizer fft pyaudio realtime-audio spectral-analysis
Last synced: 03 Apr 2025
https://github.com/mattmoony/figaro
Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵
audio cli discord figaro microphone pyaudio python roadmap sound sound-effects soundboard teamspeak virtual voice voice-changer voice-chat voice-filters
Last synced: 04 Apr 2025
https://github.com/markjay4k/audio-spectrum-analyzer-in-python
A series of Jupyter notebooks and python files which stream audio from a microphone using pyaudio, then processes it.
fft jupyter-notebook matplotlib notebook pyaudio python python3 scipy signal-processing spectrum-analyzer stream-audio
Last synced: 13 Apr 2025
https://github.com/oliverguhr/wav2vec2-live
A live speech recognition using Facebooks wav2vec 2.0 model.
asr pyaudio speech speech-recognition speech-to-text wav2vec wav2vec2
Last synced: 22 Jan 2026
https://github.com/s0d3s/pyaudiowpatch
🐍 PyAudio | PortAudio fork with WASAPI loopback support 🔊 Record audio from speakers on Windows
audio loopback pyaudio python record-speaker-output record-what-you-hear speaker-recording wasapi windows
Last synced: 16 May 2025
https://github.com/TomSchimansky/GuitarTuner
Guitar tuner program made with Python, Tkinter and PyAudio.
audio-analysis chromatic-tuner dark-mode gui-application guitar guitar-tuner macos macos-app music numpy py2app pyaudio python python3 tkinter tkinter-gui tkinter-python tuner ui-design
Last synced: 08 May 2025
https://github.com/tomschimansky/guitartuner
Guitar tuner program made with Python, Tkinter and PyAudio.
audio-analysis chromatic-tuner dark-mode gui-application guitar guitar-tuner macos macos-app music numpy py2app pyaudio python python3 tkinter tkinter-gui tkinter-python tuner ui-design
Last synced: 07 May 2025
https://github.com/lihanghang/casr-demo
基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。
baidu-aip casr-demo ctc flask-application gmm pyaudio speaker-recognition speech-to-text
Last synced: 20 Aug 2025
https://github.com/zthxxx/python-speech_recognition
A simple example for use speech recognition baidu api with python.
pyaudio python scipy speech speech-recognition
Last synced: 12 Apr 2025
https://github.com/nikhiljohn10/pi-clap
A python package for clap detection
gpio linux macos package pi-clap pyaudio python3 raspberrypi raspbian-os
Last synced: 20 Jun 2025
https://github.com/hacky1997/voice-based-email-for-blind
Emailing System for visually impaired persons
avbin blind pyaudio python3 speech voice voice-commands
Last synced: 01 Apr 2026
https://github.com/Soumya-Kushwaha/SoundScape
Real Time Sound Visualizer
audio-visualizer pyaudio pysimplegui real-time sound-visualizer
Last synced: 11 Aug 2025
https://github.com/soumya-kushwaha/soundscape
Real Time Sound Visualizer
audio-visualizer pyaudio pysimplegui real-time sound-visualizer
Last synced: 06 Apr 2025
https://github.com/ttop32/wav2vec2-live-japanese-translator
real time japanese speech recognition translator using wav2vec2
asr audio automatic-speech-recognition fine-tuning huggingface japanese live pyaudio pyqt5 pytorch real-time speaker-recognition speech-to-text spoken-language-understanding stt translation translator voice voice-recognition wav2vec2
Last synced: 03 Sep 2025
https://github.com/irahorecka/visuaudio
GUI application to visualize audio spectrum
audio audio-visualizer gui-application pyaudio pyqt5 pyqtgraph
Last synced: 14 Oct 2025
https://github.com/mantiereid/mta-metrocard-reader
MTA metrocard reader
card-reader metrocard-reader mta perl pyaudio python shell
Last synced: 13 May 2025
https://github.com/mgonzs13/audio_common
A PortAudio based audio_common with text to speech for ROS 2
audio espeak pyaudio ros2 text-to-speech tts
Last synced: 17 Sep 2025
https://github.com/mramshaw/speech-recognition
Speech recognition with Python
microprocessor monotonic nlp pocketsphinx pyaudio python raspberry-pi raspberry-pi-3 speech-recognition
Last synced: 11 Apr 2025
https://github.com/denczo/pyblaster
Monophonic synthesizer with Midi, ADSR, Reverb and LFO
digital-signal-processing lowlevel pyaudio python tkinter
Last synced: 19 Jun 2025
https://github.com/koushikphy/telespy
Take Photo/Audio/Video from webcam by remotely controlling it using a Telegram bot.
audio opencv opencv-python pyaudio pytelegrambotapi python python-telegram-bot screenshot telegram telegram-bot video webcam
Last synced: 21 Mar 2025
https://github.com/omonimus1/personal_assistant
🎙️ A vocal assistant that performs other tasks than simply talk, using Text-to-Speech and Speech-To-text.
google-api personal-assistant personal-assistants pyaudio python pyttsx3 speech speech-to-text text-to-speech text-to-speech-python3
Last synced: 10 Mar 2026
https://github.com/kazuhito00/draw-audio-spectrum-using-opencv
オーディオスペクトラムや波形をOpenCVで描画するサンプル
audio-spectrum fft opencv pyaudio
Last synced: 24 Jun 2025
https://github.com/Koushikphy/TeleSpy
Take Photo/Audio/Video from webcam by remotely controlling it using a Telegram bot.
audio opencv opencv-python pyaudio pytelegrambotapi python python-telegram-bot screenshot telegram telegram-bot video webcam
Last synced: 09 Jul 2025
https://github.com/mycroftai/pylisten
A simple pyaudio microphone interface
library microphone pyaudio recording
Last synced: 11 Jul 2025
https://github.com/vistaran/speech-to-type
Speech to type text. Basic python script that continuously listens to your voice and transforms it to keyboard typing events.
pyaudio pyinput python speech-recognition speech-to-text speech-to-type
Last synced: 15 Apr 2025
https://github.com/f33rni/webinar-hacker
Automatic lectures recording and transcription on the webinar.ru platform
ai hacking lecture opencv pyaudio python recording screenshots subtitles transcriber webinar
Last synced: 03 Oct 2025
https://github.com/jharrilim/rasadocker
Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio + SpeechRecognition
conda docker portaudio pyaudio rasa rasa-x speech-recognition tensorflow
Last synced: 04 Sep 2025
https://github.com/mwoss/sound-stream
Real-time sound monitor app with data visualization
audio audio-visualizer data-visualization fft hacktoberfest phase-vocoder pyaudio python raspberry-pi
Last synced: 11 Apr 2025
https://github.com/bhattsameer/eyeshield
Data Transmission Between two devices using Sound
datasharing filesharing librosa matplotlib pied-piper pyaudio pydub python python3 sound soundcompare sounddatatransmission soundjoin soundsplit textsharing wave
Last synced: 19 Apr 2025
https://github.com/ys-sudo/image2sound
Image to Sound Converter Project - python desktop GUI app.
numpy pil pyaudio pyqt-gui pyqt5 pyqt5-desktop-application pyqtgraph python3 wave
Last synced: 15 Apr 2025
https://github.com/bacdong/virtual-assistant-v1
Learning build virtual assistant with python and python library support.
ai library pyaudio python python3 pyttsx3 speech-recognition virtual-assistant
Last synced: 25 Jun 2025
https://github.com/castella1313/speech-emotion-recognizer
This is a speech emotion recogniser. This will tell you the emotion in the speech.
kivy kivy-framework machine-learning mlp-classifier pyaudio python
Last synced: 12 Apr 2025
https://github.com/leionion/voice-to-trade-binance-whisper
Hands-free crypto trading — speak a command, execute on Binance. Powered by OpenAI Whisper for real-time speech-to-order execution.
algorithmic-trading binance binance-api crypto fintech hands-free-trading openai order-execution pyaudio python speech-to-text trading-bot voice-commands voice-trading whisper
Last synced: 31 May 2026
https://github.com/ritwik880/virtual-assistant
By using modules of python and some concepts of AI/ML, I have developed a virtual assistant.
ai npm-package pyaudio python-library python3
Last synced: 12 May 2025
https://github.com/andy671/audioretranslator
Raw audio sender for cases like streaming computer audio from your laptop to the PC.
audio-streaming pyaudio soundflower waveform
Last synced: 06 Jul 2025
https://github.com/icereed/openai-whisper-voice-transcriber
Record voice -> OpenAI API -> Get text
Last synced: 12 Apr 2025
https://github.com/rcdalj/speech2speech
Full speech-to-speech workflow (can be customized to user's requirements)
chatgpt machine-translation pyaudio pydub python-3 speech-recognition whisper-ai
Last synced: 27 Jun 2025
https://github.com/daveshap/maragi_sensor_audio
Audio sensor microservice for robots and AI
artificial-intelligence maragi maragi-sensor pyaudio rest-api
Last synced: 16 Mar 2026
https://github.com/skulltech/prattle
Prattle away! Made using Python3.
chat opencv peer-to-peer pyaudio python3 sockets video-chat
Last synced: 30 Apr 2026
https://github.com/hxndev/human-voice-to-automated-voice-text
This project converts your human voice input to its text transcript and to an automated voice too.
code gtts human-to-robo-voice human-voice pyaudio python speech-recognition speech-to-text text text-to-speech
Last synced: 31 Mar 2025
https://github.com/snlionel90/tts-pyquendo
Convert your text in speech using TTS Loquendo libraries
locuendo pyaudio pyqt5 python3 text-to-speech tkinter-gui tts tts-engines visual-studio-code
Last synced: 23 Mar 2025
https://github.com/abdullahashfaqvirk/Speech-Translation-Agent
The Speech Translation Agent is a real time application with a Streamlit interface that allows users to select languages, speak, view the translation and hear the agent vocalize the translated text.
googletrans gtts playsound pyaudio python speech-recognition streamlit
Last synced: 27 Sep 2025
https://github.com/arbazkhan4712/speech-to-text
A program that can convert Speech into Text using python
pyaudio python pyttsx3 speech-recognition speech-to-text speechrecognition speechrecognition-python
Last synced: 10 Apr 2025
https://github.com/akash-rajak/volume-suggester
Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.
audio-feature-extraction audio-loudness ffmpeg librosa matplotlib mutagen numpy os path pyaudio pydub pynput python3 subprocess tkinter volume-suggester wave
Last synced: 18 Feb 2026
https://github.com/konradlinkowski/voicemanager
Brainless voice assisstant.
chromedriver google pipenv pyaudio voice-assistant
Last synced: 22 Mar 2025
https://github.com/notsooshariff/noura-ai
An AI Voice Assistant that can read emails, WhatsApp messages, clipboard data, and captures webcam images and screenshots for contextual understanding.
customtkinter gemini groq openai-tts pyaudio
Last synced: 26 Jan 2026
https://github.com/vpanjeta/audio-based-train-debugging
Debugging or analyzing neural network training by generating audio samples wrt model gradients
neural-network pyaudio pytorch
Last synced: 26 Apr 2026
https://github.com/inforkgodara/python-speech-to-text
A few lines of code which convert speech to text.
inforkgodara pyaudio python python-script python-speech python-speech-to-text speech-recognition speech-to-text speech-to-text-script speechtotext
Last synced: 30 Jul 2025
https://github.com/soumyapro/speech_to_text
Code to convert speech into text and save it in a text file.
Last synced: 01 Mar 2025
https://github.com/otamajakusi/opencv_video_with_audio
opencv video with audio play
Last synced: 08 May 2026
https://github.com/nomadsdev/pulse-detect
PulseDetect is a Python tool that detects audio frequencies in real-time. It captures sound from the microphone and identifies the dominant frequency using pyaudio and numpy
numpy pulse-detect pyaudio python scipy
Last synced: 07 Jan 2026
https://github.com/suvashsumon/speechrecognitiondemo
A python program to create text to audio and audio to a text file.
pyaudio python3 speechrecognition-python
Last synced: 04 Apr 2025
https://github.com/fardinhash/speech-recognition
This is just a simple speech recognition experiment. Based on PyAudio, pyttsx3. You can customize it for advance level AI bot or Assistant etc.
pyaudio python pyttsx3 speech-recognition
Last synced: 15 May 2026
https://github.com/thedvlprs/alexis-speech-assistant
alexis-speech-assistant
alexis-speech-assistant gtts playsound pyaudio pyobjc python-3 speech-recognition
Last synced: 18 Oct 2025
https://github.com/boudhayan-dev/rasppa
A personal assistant that transcribes notes , using Raspberry Pi 2.
dropbox pyaudio pydub python3 raspberry-pi-2 raspbian speech-recognition
Last synced: 14 Feb 2026
https://github.com/mukeshlilawat1/voice-assistant-using-python
A smart voice assistant built with Python that listens to your voice commands, responds using speech, opens websites, and answers questions using OpenAI (or local AI models). Designed like your own Jarvis from Iron Man!
Last synced: 04 Oct 2025
https://github.com/nomadsdev/frequency-insight
SoundFreqAnalyzer: A Python tool to record audio, analyze frequencies, and save results.
audio-analysis audio-recording audio-tool data-visualization fft frequency-analysis keyboard-python numpy pyaudio python scipy signal-processing sound-engineering
Last synced: 21 Feb 2026
https://github.com/psa-jforestier/pypymorse
Morse decoder program. Use audio input from soundcard. In Python. Work under Windows. From an old code fftmorse.c . Use PyAudio and NCurses
cw ham-radio morse-code pyaudio python python-curses-library
Last synced: 16 Jun 2025
https://github.com/bolisettysujith/screenrecorder
It is a screen recorder program which can record both voice from the mic and Screen
cv2 ffmpeg pyaudio python screenrecorder voicerecorder
Last synced: 10 May 2026
https://github.com/damp11113/idrb
IDRB (Internet Digital Radio Broadcasting)
damp11113 encryption epg epgdata imgui internet internet-radio internet-radio-player internet-radio-server internet-radio-station opus opus-codec pyaudio python python-project radio tcp tcp-client tcp-server zeromq
Last synced: 04 May 2025
https://github.com/djleamen/renamer
Utility to rename mp3 files based on speech content
ffmpeg google-speech-recognition googlespeechapi mp3 openai pyaudio pydub python speech-recognition speech-to-text torch util utility wav whisper whisper-ai
Last synced: 12 Apr 2026
https://github.com/arbazkhan4712/text-to-speech
A program that can convert Text into Speech using python
pyaudio python python3 pyttsx3 text-to-speech texttospeech
Last synced: 16 May 2025
https://github.com/subuhana2303/vaanirakshak_offline-emergency-voice-assistant
VaaniRakshak is an offline voice assistant built for disaster scenarios, enabling hands-free emergency support without internet connectivity. It assists users in locating shelters, requesting help, and accessing life-saving information through voice interaction.
audio-input json pyaudio pyttsx3 speech-recognition text-to-speech tkinter-gui vosk
Last synced: 23 Jul 2025
https://github.com/fikriaf/ai
🤖 API-Based AI Chatbot from OpenAI
chatbot openai pyaudio speech-recognition
Last synced: 30 Apr 2026
https://github.com/lasithaamarasinghe/real-time-speech-recognition
This project includes a system that can record live speech using your microphone and then transcribe it using speech recognition.
ipywidgets jupyter-notebook machine-learning pyaudio pydub python3 pytorch realtime-speech-recognition speech-to-text transformers vosk
Last synced: 06 May 2026
https://github.com/ajitashwath/voice-recorder-gui
A user-friendly GUI-based voice recorder in Python for seamless audio recording.
Last synced: 27 Jun 2025
https://github.com/saeed-dev2/link_app-multifunctionl-app
Developed a cutting-edge application that integrates real-time text chatting, voice calling, and file sharing functionalities into a user-friendly interface. This project leverages socket programming to deliver seamless communication.
data-structures gui-application network-programming os pickle pil pillow pyaudio python3 socket-programming struct threading-and-concurrency tkinter-python
Last synced: 26 Feb 2025
https://github.com/jonasmarquesdev/voice-recognition-automation
Automação com reconhecimento de voz em Python utilizando as bibliotecas pyttsx3, subprocess, pyaudio e speech_recognition envolve a criação de um programa que permite a interação com o computador por meio de comandos de voz.
automation pyaudio python pyttsx3 speech-recognition
Last synced: 10 Apr 2025
https://github.com/metawake/speech_to_orders_cli
deep-learning keras neural-networks pyaudio python speech-recognition tensorflow
Last synced: 11 May 2026
https://github.com/tristan-mcinnis/simultaneous-interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 10 Apr 2026
https://github.com/kientech/speech-recognition-study
The study involves various aspects of speech recognition, including audio preprocessing, model training, and real-time speech-to-text conversion
assembly-language pyaudio python speech-recognition
Last synced: 15 Mar 2025
https://github.com/ghsaboias/jarvis-voice-assistant
This project is a voice assistant named Jarvis, designed for macOS. It uses speech recognition and text-to-speech to interact with your computer.
pyaudio pyttsx3 speech-recognition voice-assistant
Last synced: 14 Mar 2025
https://github.com/minhhieu3012/online-meeting-room-app
Ứng dụng phòng họp trực tuyến
chat-application file-transfer multiclient-server opencv pyaudio python socket-programming tcp-udp voice-chat
Last synced: 11 Apr 2026
https://github.com/mohamedsaidsallam/stream-audio-over-network
A simple python project to stream audio over the network. Mainly made so I can stream my laptop's audio to my PC with minimum quality loss.
batch pyaudio python socket socketpython tkinter venv
Last synced: 24 Mar 2025
https://github.com/ankushrathour/audiomaker
AudioMaker is a Python package for generating seamless, long-form audio from massive text inputs. Unlike traditional TTS tools, AudioMaker can handle book-length content (even 4+ hours) by splitting text into chunks, synthesizing each chunk, and merging them into a single audio file.
edge-tts pyaudio python text-to-audio tqdm
Last synced: 13 Aug 2025
https://github.com/src3453/fluentscope
An oscilloscope that makes listening waveform more fun
audio-visualizer correlation mic-audio numpy oscilloscope pyaudio pygame pygame-application python python3 realtime trigger
Last synced: 30 Apr 2026
https://github.com/d1ogocs/afinador-de-instrumentos
Desenvolvimento de um afinador que se ajusta automaticamente ao instrumento musical escolhido pelo utilizador
butterworth-filter instrument-tuner matplotlib numpy pyaudio python scipy threading tkinter
Last synced: 07 Jan 2026
https://github.com/minjii1079/pytune
Building PyTune, a Python guitar tuner that uses PyAudio for recording, NumPy for math operations, and SciPy for FFT (Fast Fourier Transform) and signal processing.
fft guitar-tuner music numpy pyaudio python3 scipy
Last synced: 07 May 2026
https://github.com/engageintellect/text-to-speech
A text-to-speech engine using microsoft/speecht5_tts and OpenAI.
huggingface openai pyaudio python pytorch tensorflow
Last synced: 13 Apr 2026
https://github.com/dantasl/vocal
Simple application for controlling leds by voice and sockets, simulating lights on a house.
beaglebone-black house-automation pyaudio python3 sockets
Last synced: 31 Mar 2025
https://github.com/deshwalmahesh/whisper-fastapi-realtime
It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription
audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large
Last synced: 13 Mar 2025
https://github.com/davidy22/offkey
Control your computer with the tone of your voice
kivy pyaudio python voice-control
Last synced: 20 May 2026
https://github.com/vit0r/poc-audio-reco
audio to text
audio pyaudio python qt5 qt5-gui speechrecognition-python
Last synced: 21 Apr 2026
https://github.com/tristan-mcinnis/Simultaneous-Interpretation
Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.
agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper
Last synced: 23 Oct 2025
https://github.com/afia45/python-birthday-cake-blowing-candle
Simple Birthday Cake with Candles that Blow Out in Python! 🍰🕯️
Last synced: 25 Oct 2025
https://github.com/ahsouza/timenow
API Laravel & SPA Vue.JS containerized
docker docker-compose dockerfile laravel mysql-database nginx nodejs npm numpy php7 pyaudio python3 security-tools shell-script vagrant vuejs vuesax vuex vuex-store zenity
Last synced: 04 Apr 2026