An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with pyaudio

A curated list of projects in awesome lists tagged with pyaudio .

https://github.com/aiXander/Realtime_PyAudio_FFT

Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.

audio-visualizer fft pyaudio realtime-audio spectral-analysis

Last synced: 03 Apr 2025

https://github.com/mattmoony/figaro

Real-time voice-changer for voice-chat, etc. Will support many different voice-filters and features in the future. 🎵

audio cli discord figaro microphone pyaudio python roadmap sound sound-effects soundboard teamspeak virtual voice voice-changer voice-chat voice-filters

Last synced: 04 Apr 2025

https://github.com/markjay4k/audio-spectrum-analyzer-in-python

A series of Jupyter notebooks and python files which stream audio from a microphone using pyaudio, then processes it.

fft jupyter-notebook matplotlib notebook pyaudio python python3 scipy signal-processing spectrum-analyzer stream-audio

Last synced: 13 Apr 2025

https://github.com/oliverguhr/wav2vec2-live

A live speech recognition using Facebooks wav2vec 2.0 model.

asr pyaudio speech speech-recognition speech-to-text wav2vec wav2vec2

Last synced: 22 Jan 2026

https://github.com/s0d3s/pyaudiowpatch

🐍 PyAudio | PortAudio fork with WASAPI loopback support 🔊 Record audio from speakers on Windows

audio loopback pyaudio python record-speaker-output record-what-you-hear speaker-recording wasapi windows

Last synced: 16 May 2025

https://github.com/lihanghang/casr-demo

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

baidu-aip casr-demo ctc flask-application gmm pyaudio speaker-recognition speech-to-text

Last synced: 20 Aug 2025

https://github.com/zthxxx/python-speech_recognition

A simple example for use speech recognition baidu api with python.

pyaudio python scipy speech speech-recognition

Last synced: 12 Apr 2025

https://github.com/nikhiljohn10/pi-clap

A python package for clap detection

gpio linux macos package pi-clap pyaudio python3 raspberrypi raspbian-os

Last synced: 20 Jun 2025

https://github.com/maqifrnswa/scimpy

Scimpy Speaker Design Tool

pyaudio speaker

Last synced: 07 Apr 2026

https://github.com/hacky1997/voice-based-email-for-blind

Emailing System for visually impaired persons

avbin blind pyaudio python3 speech voice voice-commands

Last synced: 01 Apr 2026

https://github.com/irahorecka/visuaudio

GUI application to visualize audio spectrum

audio audio-visualizer gui-application pyaudio pyqt5 pyqtgraph

Last synced: 14 Oct 2025

https://github.com/mgonzs13/audio_common

A PortAudio based audio_common with text to speech for ROS 2

audio espeak pyaudio ros2 text-to-speech tts

Last synced: 17 Sep 2025

https://github.com/denczo/pyblaster

Monophonic synthesizer with Midi, ADSR, Reverb and LFO

digital-signal-processing lowlevel pyaudio python tkinter

Last synced: 19 Jun 2025

https://github.com/koushikphy/telespy

Take Photo/Audio/Video from webcam by remotely controlling it using a Telegram bot.

audio opencv opencv-python pyaudio pytelegrambotapi python python-telegram-bot screenshot telegram telegram-bot video webcam

Last synced: 21 Mar 2025

https://github.com/omonimus1/personal_assistant

🎙️ A vocal assistant that performs other tasks than simply talk, using Text-to-Speech and Speech-To-text.

google-api personal-assistant personal-assistants pyaudio python pyttsx3 speech speech-to-text text-to-speech text-to-speech-python3

Last synced: 10 Mar 2026

https://github.com/kazuhito00/draw-audio-spectrum-using-opencv

オーディオスペクトラムや波形をOpenCVで描画するサンプル

audio-spectrum fft opencv pyaudio

Last synced: 24 Jun 2025

https://github.com/Koushikphy/TeleSpy

Take Photo/Audio/Video from webcam by remotely controlling it using a Telegram bot.

audio opencv opencv-python pyaudio pytelegrambotapi python python-telegram-bot screenshot telegram telegram-bot video webcam

Last synced: 09 Jul 2025

https://github.com/fujiwarachoki/wds

War Drone Simulation System in Python

pyaudio python scipy ucav

Last synced: 30 Apr 2026

https://github.com/mycroftai/pylisten

A simple pyaudio microphone interface

library microphone pyaudio recording

Last synced: 11 Jul 2025

https://github.com/vistaran/speech-to-type

Speech to type text. Basic python script that continuously listens to your voice and transforms it to keyboard typing events.

pyaudio pyinput python speech-recognition speech-to-text speech-to-type

Last synced: 15 Apr 2025

https://github.com/f33rni/webinar-hacker

Automatic lectures recording and transcription on the webinar.ru platform

ai hacking lecture opencv pyaudio python recording screenshots subtitles transcriber webinar

Last synced: 03 Oct 2025

https://github.com/jharrilim/rasadocker

Docker image with Rasa + Anaconda + Tensorflow + portaudio + PyAudio + SpeechRecognition

conda docker portaudio pyaudio rasa rasa-x speech-recognition tensorflow

Last synced: 04 Sep 2025

https://github.com/ys-sudo/image2sound

Image to Sound Converter Project - python desktop GUI app.

numpy pil pyaudio pyqt-gui pyqt5 pyqt5-desktop-application pyqtgraph python3 wave

Last synced: 15 Apr 2025

https://github.com/bacdong/virtual-assistant-v1

Learning build virtual assistant with python and python library support.

ai library pyaudio python python3 pyttsx3 speech-recognition virtual-assistant

Last synced: 25 Jun 2025

https://github.com/vinniem-3/guitar-fretboard-trainer

Python program to help you learn the guitar fretboard

fretboard guitar numpy pyaudio python

Last synced: 29 Jan 2026

https://github.com/castella1313/speech-emotion-recognizer

This is a speech emotion recogniser. This will tell you the emotion in the speech.

kivy kivy-framework machine-learning mlp-classifier pyaudio python

Last synced: 12 Apr 2025

https://github.com/leionion/voice-to-trade-binance-whisper

Hands-free crypto trading — speak a command, execute on Binance. Powered by OpenAI Whisper for real-time speech-to-order execution.

algorithmic-trading binance binance-api crypto fintech hands-free-trading openai order-execution pyaudio python speech-to-text trading-bot voice-commands voice-trading whisper

Last synced: 31 May 2026

https://github.com/ritwik880/virtual-assistant

By using modules of python and some concepts of AI/ML, I have developed a virtual assistant.

ai npm-package pyaudio python-library python3

Last synced: 12 May 2025

https://github.com/andy671/audioretranslator

Raw audio sender for cases like streaming computer audio from your laptop to the PC.

audio-streaming pyaudio soundflower waveform

Last synced: 06 Jul 2025

https://github.com/icereed/openai-whisper-voice-transcriber

Record voice -> OpenAI API -> Get text

openai pyaudio whisper-ai

Last synced: 12 Apr 2025

https://github.com/rcdalj/speech2speech

Full speech-to-speech workflow (can be customized to user's requirements)

chatgpt machine-translation pyaudio pydub python-3 speech-recognition whisper-ai

Last synced: 27 Jun 2025

https://github.com/xasopheno/whoyouare

A Deep Learning Toolkit for Composition and Improvisation

audio keras pyaudio python real-time

Last synced: 05 Sep 2025

https://github.com/daveshap/maragi_sensor_audio

Audio sensor microservice for robots and AI

artificial-intelligence maragi maragi-sensor pyaudio rest-api

Last synced: 16 Mar 2026

https://github.com/skulltech/prattle

Prattle away! Made using Python3.

chat opencv peer-to-peer pyaudio python3 sockets video-chat

Last synced: 30 Apr 2026

https://github.com/hxndev/human-voice-to-automated-voice-text

This project converts your human voice input to its text transcript and to an automated voice too.

code gtts human-to-robo-voice human-voice pyaudio python speech-recognition speech-to-text text text-to-speech

Last synced: 31 Mar 2025

https://github.com/donno2048/canon

Play canon in d but not in d

canon music pyaudio

Last synced: 09 Nov 2025

https://github.com/snlionel90/tts-pyquendo

Convert your text in speech using TTS Loquendo libraries

locuendo pyaudio pyqt5 python3 text-to-speech tkinter-gui tts tts-engines visual-studio-code

Last synced: 23 Mar 2025

https://github.com/abdullahashfaqvirk/Speech-Translation-Agent

The Speech Translation Agent is a real time application with a Streamlit interface that allows users to select languages, speak, view the translation and hear the agent vocalize the translated text.

googletrans gtts playsound pyaudio python speech-recognition streamlit

Last synced: 27 Sep 2025

https://github.com/akash-rajak/volume-suggester

Python Script to suggest the volume at which the music audio file needs to be played for better experience and feeling.

audio-feature-extraction audio-loudness ffmpeg librosa matplotlib mutagen numpy os path pyaudio pydub pynput python3 subprocess tkinter volume-suggester wave

Last synced: 18 Feb 2026

https://github.com/notsooshariff/noura-ai

An AI Voice Assistant that can read emails, WhatsApp messages, clipboard data, and captures webcam images and screenshots for contextual understanding.

customtkinter gemini groq openai-tts pyaudio

Last synced: 26 Jan 2026

https://github.com/vpanjeta/audio-based-train-debugging

Debugging or analyzing neural network training by generating audio samples wrt model gradients

neural-network pyaudio pytorch

Last synced: 26 Apr 2026

https://github.com/soumyapro/speech_to_text

Code to convert speech into text and save it in a text file.

pyaudio speech-recognition

Last synced: 01 Mar 2025

https://github.com/otamajakusi/opencv_video_with_audio

opencv video with audio play

ffmpeg opencv pyaudio

Last synced: 08 May 2026

https://github.com/nomadsdev/pulse-detect

PulseDetect is a Python tool that detects audio frequencies in real-time. It captures sound from the microphone and identifies the dominant frequency using pyaudio and numpy

numpy pulse-detect pyaudio python scipy

Last synced: 07 Jan 2026

https://github.com/suvashsumon/speechrecognitiondemo

A python program to create text to audio and audio to a text file.

pyaudio python3 speechrecognition-python

Last synced: 04 Apr 2025

https://github.com/fardinhash/speech-recognition

This is just a simple speech recognition experiment. Based on PyAudio, pyttsx3. You can customize it for advance level AI bot or Assistant etc.

pyaudio python pyttsx3 speech-recognition

Last synced: 15 May 2026

https://github.com/boudhayan-dev/rasppa

A personal assistant that transcribes notes , using Raspberry Pi 2.

dropbox pyaudio pydub python3 raspberry-pi-2 raspbian speech-recognition

Last synced: 14 Feb 2026

https://github.com/mukeshlilawat1/voice-assistant-using-python

A smart voice assistant built with Python that listens to your voice commands, responds using speech, opens websites, and answers questions using OpenAI (or local AI models). Designed like your own Jarvis from Iron Man!

pip pyaudio python

Last synced: 04 Oct 2025

https://github.com/psa-jforestier/pypymorse

Morse decoder program. Use audio input from soundcard. In Python. Work under Windows. From an old code fftmorse.c . Use PyAudio and NCurses

cw ham-radio morse-code pyaudio python python-curses-library

Last synced: 16 Jun 2025

https://github.com/bolisettysujith/screenrecorder

It is a screen recorder program which can record both voice from the mic and Screen

cv2 ffmpeg pyaudio python screenrecorder voicerecorder

Last synced: 10 May 2026

https://github.com/arbazkhan4712/text-to-speech

A program that can convert Text into Speech using python

pyaudio python python3 pyttsx3 text-to-speech texttospeech

Last synced: 16 May 2025

https://github.com/subuhana2303/vaanirakshak_offline-emergency-voice-assistant

VaaniRakshak is an offline voice assistant built for disaster scenarios, enabling hands-free emergency support without internet connectivity. It assists users in locating shelters, requesting help, and accessing life-saving information through voice interaction.

audio-input json pyaudio pyttsx3 speech-recognition text-to-speech tkinter-gui vosk

Last synced: 23 Jul 2025

https://github.com/fikriaf/ai

🤖 API-Based AI Chatbot from OpenAI

chatbot openai pyaudio speech-recognition

Last synced: 30 Apr 2026

https://github.com/bharatkalluri/rewinder

Travel back in time and get audio from the recent past. On demand.

audio pyaudio python3 recorder typer-cli

Last synced: 12 Mar 2025

https://github.com/lasithaamarasinghe/real-time-speech-recognition

This project includes a system that can record live speech using your microphone and then transcribe it using speech recognition.

ipywidgets jupyter-notebook machine-learning pyaudio pydub python3 pytorch realtime-speech-recognition speech-to-text transformers vosk

Last synced: 06 May 2026

https://github.com/ajitashwath/voice-recorder-gui

A user-friendly GUI-based voice recorder in Python for seamless audio recording.

pyaudio python3

Last synced: 27 Jun 2025

https://github.com/saeed-dev2/link_app-multifunctionl-app

Developed a cutting-edge application that integrates real-time text chatting, voice calling, and file sharing functionalities into a user-friendly interface. This project leverages socket programming to deliver seamless communication.

data-structures gui-application network-programming os pickle pil pillow pyaudio python3 socket-programming struct threading-and-concurrency tkinter-python

Last synced: 26 Feb 2025

https://github.com/jonasmarquesdev/voice-recognition-automation

Automação com reconhecimento de voz em Python utilizando as bibliotecas pyttsx3, subprocess, pyaudio e speech_recognition envolve a criação de um programa que permite a interação com o computador por meio de comandos de voz.

automation pyaudio python pyttsx3 speech-recognition

Last synced: 10 Apr 2025

https://github.com/tristan-mcinnis/simultaneous-interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 10 Apr 2026

https://github.com/kientech/speech-recognition-study

The study involves various aspects of speech recognition, including audio preprocessing, model training, and real-time speech-to-text conversion

assembly-language pyaudio python speech-recognition

Last synced: 15 Mar 2025

https://github.com/ghsaboias/jarvis-voice-assistant

This project is a voice assistant named Jarvis, designed for macOS. It uses speech recognition and text-to-speech to interact with your computer.

pyaudio pyttsx3 speech-recognition voice-assistant

Last synced: 14 Mar 2025

https://github.com/priyanshscpp/raone_ai

This is is a desktop AI Interface for Windows developed in Python Language

newsapi openai pyaudio pycharm python

Last synced: 18 Apr 2026

https://github.com/priyanshscpp/RaOne_AI

This is is a desktop AI Interface for Windows developed in Python Language

newsapi openai pyaudio pycharm python

Last synced: 20 Aug 2025

https://github.com/wdbm/tonescale

sound utilities and sounds

aplay pyaudio sound sounds stream

Last synced: 28 Mar 2025

https://github.com/mohamedsaidsallam/stream-audio-over-network

A simple python project to stream audio over the network. Mainly made so I can stream my laptop's audio to my PC with minimum quality loss.

batch pyaudio python socket socketpython tkinter venv

Last synced: 24 Mar 2025

https://github.com/ankushrathour/audiomaker

AudioMaker is a Python package for generating seamless, long-form audio from massive text inputs. Unlike traditional TTS tools, AudioMaker can handle book-length content (even 4+ hours) by splitting text into chunks, synthesizing each chunk, and merging them into a single audio file.

edge-tts pyaudio python text-to-audio tqdm

Last synced: 13 Aug 2025

https://github.com/d1ogocs/afinador-de-instrumentos

Desenvolvimento de um afinador que se ajusta automaticamente ao instrumento musical escolhido pelo utilizador

butterworth-filter instrument-tuner matplotlib numpy pyaudio python scipy threading tkinter

Last synced: 07 Jan 2026

https://github.com/minjii1079/pytune

Building PyTune, a Python guitar tuner that uses PyAudio for recording, NumPy for math operations, and SciPy for FFT (Fast Fourier Transform) and signal processing.

fft guitar-tuner music numpy pyaudio python3 scipy

Last synced: 07 May 2026

https://github.com/engageintellect/text-to-speech

A text-to-speech engine using microsoft/speecht5_tts and OpenAI.

huggingface openai pyaudio python pytorch tensorflow

Last synced: 13 Apr 2026

https://github.com/dantasl/vocal

Simple application for controlling leds by voice and sockets, simulating lights on a house.

beaglebone-black house-automation pyaudio python3 sockets

Last synced: 31 Mar 2025

https://github.com/deshwalmahesh/whisper-fastapi-realtime

It is Front + Backend app that uses openai/whisper-large-v3-turbo in your consumer grade system to provide real live audio transcription

audio-transcription fastapi huggingface live pyaudio realtime transcription transformers whisper whisper-large

Last synced: 13 Mar 2025

https://github.com/davidy22/offkey

Control your computer with the tone of your voice

kivy pyaudio python voice-control

Last synced: 20 May 2026

https://github.com/tristan-mcinnis/Simultaneous-Interpretation

Simultaneous-Interpretation is an advanced tool for real-time simultaneous interpretation. It transcribes and translates spoken language from a microphone input instantaneously, continually refining translations for accuracy. Ideal for business meetings, educational settings, and live events, it enhances multilingual communication effortlessly.

agents asr faster-whisper openai pyaudio simultaneous-intepreting simultaneous-translation speech-recognition speech-to-text transcription translation whisper

Last synced: 23 Oct 2025

https://github.com/afia45/python-birthday-cake-blowing-candle

Simple Birthday Cake with Candles that Blow Out in Python! 🍰🕯️

pyaudio pygame python

Last synced: 25 Oct 2025

https://github.com/jesse-stewart/octotrackpy

Octophonic 8 Track Player for DAC8x

dac8x dub hifiberry pyaudio python reggae

Last synced: 18 Mar 2026