An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with deepgram

A curated list of projects in awesome lists tagged with deepgram .

https://github.com/alexandresajus/jarvis

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

deepgram elevenlabs llm openai python taipy tts voice-assistant

Last synced: 10 Apr 2025

https://github.com/AlexandreSajus/JARVIS

Your own personal voice assistant: Voice to Text to LLM to Speech, displayed in a web interface

deepgram elevenlabs llm openai python taipy tts voice-assistant

Last synced: 05 Apr 2025

https://github.com/deepgram-starters/nextjs-live-transcription

Get started using Deepgram's Live Transcription with this Next.js demo app

deepgram live real-time speech-to-text stt transcription websocket

Last synced: 21 Jan 2026

https://github.com/deepgram/deepgram-js-sdk

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

ai asr automated-speech-recognition deepgram hacktoberfest javascript speech-recognition speech-to-text typescript

Last synced: 29 Dec 2025

https://github.com/deepgram/deepgram-node-sdk

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

ai asr automated-speech-recognition deepgram hacktoberfest javascript speech-recognition speech-to-text typescript

Last synced: 03 Mar 2025

https://github.com/Dhravya/DeepSubtitles

A Python script that generates subtitles and renders them onto the video.

accessibility deepgram devto moviepy python subtitles video-processing

Last synced: 09 Jul 2025

https://github.com/dhravya/deepsubtitles

A Python script that generates subtitles and renders them onto the video.

accessibility deepgram devto moviepy python subtitles video-processing

Last synced: 06 Aug 2025

https://github.com/deepgram/deepgram-rust-sdk

Rust SDK for Deepgram's automated speech recognition APIs.

deepgram hacktoberfest rust speech-recognition speech-to-text

Last synced: 13 Apr 2025

https://github.com/sbis04/decifer

Generate your audio transcripts with ease.

deepgram dev firebase flutter

Last synced: 17 Mar 2025

https://github.com/danieladdisonorg/ai-agent-for-telephony-voice-bot

AI-powered telephony solution that enables businesses to deploy intelligent voice agents for various use cases such as customer support, appointment scheduling, lead qualification, and information collection.

deepgram elevenlabs fastapi openai redis telephony twillio

Last synced: 20 Oct 2025

https://github.com/craigsdennis/genai-phone-call

WIP exploration using Twilio Media Streams and Generative AI

deepgram elevenlabs genai media-streams-api phone twilio

Last synced: 28 Jun 2025

https://github.com/dhravya/discord-voice-transcript-for-teams

A simple discord bot that listens to voice channel and generates a transcript, then assigns tasks and summarises the conversation

anyscale deepgram discord mistral pycord python

Last synced: 07 May 2025

https://github.com/theaifutureguy/livekit-voice-agent

A production-ready voice agent implementation using LiveKit and Python, featuring advanced conversational AI capabilities and optional telephony integration. It provides intelligent turn detection, function calling, comprehensive logging, telephony integration, and audio enhancement.

ai-agent deepgram elevenlabs livekit openai pytho twillio

Last synced: 18 May 2026

https://github.com/xriddin/real-time-ai-voice-assistant

Real Time AI Voice Assistant using nodejs

ai assistant deepgram groq llama3 nodejs playht voice

Last synced: 25 Oct 2025

https://github.com/deepgram-starters/node-transcription

Get started using Deepgram's Transcription with this Node demo app

asr deepgram speech-to-text stt transcription

Last synced: 21 Jan 2026

https://github.com/kaloprojects/kalo-esp32-voice-assistant

Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.

audio deepgram deepgram-stt esp32 google-tts i2s i2s-audio i2s-microphone inmp441 is2-audio max98357 openai-tts recording sd-card speechgen speechgen-io speechtotext stt texttospeech tts

Last synced: 14 Apr 2025

https://github.com/agentic-insights/voice-bot

AI Agent for Telephony voice bot - based on vocode, twilio, deepgram, and elevenlabs. Just add your own keys and prompt.

deepgram docker docker-compose elevenlabs helm helm-charts kubernetes poetry-python python twilio vocode

Last synced: 27 Feb 2026

https://github.com/yixin0829/push-to-talk

Ultra-fast, customizable AI voice dictation in any active app on Windows (MacOS and Linux coming soon)

cerebras deepgram dictation dictation-tool speech-to-text voice-dictation voice-input whisper

Last synced: 01 May 2026

https://github.com/sinanuozdemir/oreilly-multimodal-ai

Learn how multimodal AI merges text, image, and audio for smarter models

dalle-3 deepgram diffusion dreambooth generative-ai livekit llama3 llava multimodal multimodal-ai openai stable-diffusion

Last synced: 24 Apr 2025

https://github.com/kaloprojects/kalo-esp32-voice-chat-ai-friends

ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.

audio deepgram deepgram-stt elevenlabs elevenlabs-stt esp32 groq groq-api i2s i2s-audio inmp441 max98357 openai-chatgpt openai-tts recording sd-card speechtotext stt texttospeech tts

Last synced: 19 Aug 2025

https://github.com/deepgram-starters/flask-transcription

Get started using Deepgram's Pre-Recorded Transcription with this Flask demo app

asr deepgram speech-to-text stt transcription

Last synced: 21 Jan 2026

https://github.com/deepgram-starters/nextjs-text-to-speech

Get started using Deepgram's Text-to-Speech with this Next.js demo app

deepgram text-to-speech tts

Last synced: 21 Jan 2026

https://github.com/tover0314-w/opentypeless

Talkmore with Opentypeless. Type with your voice. Anywhere. Talk - Recoding - Polish - Done!

ai byok cross-platform deepgram desktop-app llm open-source rust speech-to-text stt tauri typescript voice-input voice-typing whisper

Last synced: 02 Apr 2026

https://github.com/dotaadarsh/youtxt

App that convert any YouTube video to text. Created for Learn Build Teach Hackathon 2022

deepgram openai streamlit youtube

Last synced: 04 Mar 2026

https://github.com/kaymen99/ai-voice-assistant

AI Voice Assistant: talk to an AI agent that handles event scheduling, managing contacts, accessing your knowledge base and web searching through simple voice commands.

ai-agent ai-speech ai-voice-assistant deepgram gemini-pro-vision gmail google-calendar google-contacts groq litellm llama3 voice-assistant

Last synced: 13 Jul 2025

https://github.com/deepgram-starters/node-text-to-speech

Get started using Deepgram's Text-to-Speech with this Node demo app.

deepgram text-to-speech tts

Last synced: 21 Jan 2026

https://github.com/deepgram/deepgram-js-captions

This package is the JavaScript implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.

asr audio closed-captions deepgram ffmpeg javascript sdk speech speech-to-text srt stt subtitles transcription typescript webvtt youtube

Last synced: 15 Jul 2025

https://github.com/mtwn105/podtext

View Text Versions of your favorite podcasts!

deepgram javascript music next nextjs nextui nodejs podcast react reactjs

Last synced: 12 Apr 2025

https://github.com/arndom/artsy

Artsy is a fun app that allows you to record audio and use the transcribed data to generate art.

ai art deepgram hackathon hackathon-project react transcription voice

Last synced: 09 May 2025

https://github.com/cycle-sync-ai/livekit-voice-ai-agent-setup

This is the guide to show the method to build your own AI-Powered voice agent with LiveKit and Twillio

agent ai assistant deepgram elevenlabs livekit openai phone pstn python realtime-chat realtime-messaging sip stt tts twilio voice websocket

Last synced: 09 Apr 2025

https://github.com/ebowwa/llm_telecenter

A fastapi wrapper of babca / python-gsmmodem for a waveshare sim7600x. Not an exact copy of the 'python-gsmmodem' so be sure to uninstall that lib or venv to run | Open-source Twilio with LLM batteries

agentgpt deepgram elevenlabs elevenlabs-api gsm gsm-modem gsm-module langchain langchain-python llama2 llamacpp mistral-7b mistralai oai openai openai-api pyserial raspberry-pi salesgpt whisper

Last synced: 23 Apr 2025

https://github.com/chai-dev682/talking-ai

Talking Avatar Website

d-id deepgram javascript react

Last synced: 15 Apr 2025

https://github.com/spac5y/vocal-agent

A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.

calendar deepgram email groq knowledgebase kokoro llama speech-to-speech speech-to-text text-to-speech vocal whisper

Last synced: 02 Sep 2025

https://github.com/dotaadarsh/formatsai

Transform the way you analyze data with our AI-powered chatbot

ai21 chatbot deepgram streamlit

Last synced: 14 Apr 2025

https://github.com/kaloprojects/kalo-esp32-voice-chatgpt

ESP32-based Open AI Voice chat device (similar ChatGPT). Recording questions with a microphone, transcribing via Deepgram STT, then sent to Open AI. Response is played with AI voices on speaker. Supporting ongoing dialogues with saved history for follow-up questions. User defined "system prompts" for own "personalities" and dedicated use cases.

audio deepgram deepgram-stt esp32 i2s i2s-audio i2s-microphone inmp441 max98357 open-ai-4 openai-api-chatbot openai-chatgpt openai-tts recording sd-card speechtotext stt texttospeech touch-pins tts

Last synced: 01 Aug 2025

https://github.com/deepgram-starters/django-transcription

Get started using Deepgram's PreRecorded Transcription with this Django demo app

asr deepgram speech-to-text stt transcription

Last synced: 21 Jan 2026

https://github.com/yousufkalim/deepgram-transcription-react

Deepgram - Automated Speech Recognition (ASR) - A simple user interface to test deepgram integration on server side.

asr deepgram transcription

Last synced: 26 Jun 2025

https://github.com/deepgram-starters/django-voice-agent

Get started using Deepgram's Voice Agent with this Django demo app

agent-api deepgram live real-time speech-to-speech voice-agent websocket

Last synced: 21 Jan 2026

https://github.com/jemisgoti/simli-flutter-client

Simli Client is a Flutter package for integrating with the Simli API, enabling live, low-latency avatars using WebRTC, ideal for virtual assistants and bots.

agents ai artificia assitant avatar deepgram simli video-streaming virtual voice-assistant

Last synced: 19 Feb 2026

https://github.com/spandan114/ai-realtime-voice-agent

A Python-based real-time voice-to-voice conversation system that lets you have natural conversations with very low latency, plug & play multiple llm based on your requirement.

conversational-ai deepgram groq llm openai realtime-voce-agent speach-to-speach streaming stt tts voiceagent

Last synced: 12 Jun 2025

https://github.com/oolunar/harmonyinsilence

Harmony in Silence: A Speech-to-Text Empowerment Initiative for the Hard of Hearing Community.

accessibility deepgram discord speech-to-text

Last synced: 21 Mar 2025

https://github.com/navjotdhanawat/py-ai-voice-agent

PipeCat Voice Agent is an AI-powered voice communication system that enables intelligent, real-time phone conversations through WebSocket connections. It combines multiple technologies including speech recognition (Deepgram), natural language processing (GPT-4), tts (Cartesia), and Telephony (Plivo) to create seamless voice inte

agent ai deepgram openai pipecat-ai plivo voice

Last synced: 15 Mar 2026

https://github.com/deepgram/cli

Official Deepgram CLI — speech-to-text, text-to-speech, and audio intelligence from your terminal

audio-intelligence cli deepgram developer-tools mcp python speech-to-text stt text-to-speech transcription tts voice-ai

Last synced: 08 May 2026

https://github.com/3choff/dictate

An Electron-based desktop dictation app for Windows with multiple speech-to-text providers, voice commands, and grammar correction

deepgram dictation electron-app groq transcription whisper

Last synced: 05 Oct 2025

https://github.com/trustlelab/siteware-backend-v2

Siteware Backend - German Voice AI Agent provider - Deepgram + Twilio + Elevenlabs + OpenAI + Pinecone

deepgram elevenlabs embedding-models function-calling javascript openai pinecone prompt-engineering twilio-api typescript vector-database

Last synced: 10 Apr 2025

https://github.com/nickytonline/deepgram-speech-to-text-stream

Bekah Hawrot Weigel joins Nick to show how you can transcribe text using Deepgram's Node.js SDK. They go through the demo code all the way to building out an app with Express that allows you to submit a URL for transcription.

deepgram nodejs speech-to-text

Last synced: 17 May 2026

https://github.com/hipcall/hipcall_deepgram

Unofficial Deepgram API Wrapper written in Elixir.

deepgram

Last synced: 22 Feb 2026

https://github.com/deepgram-starters/node-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this Node demo app

deepgram live real-time text-to-speech tts websockets

Last synced: 21 Jan 2026

https://github.com/0xichikawa/youtube-video-analyzer

A sophisticated Node.js application that analyzes YouTube videos for legal compliance. It transcribes the audio content of the videos using the Deepgram API and then compares it against predefined legal rules using the GPT-4 language model.

deepgram gpt4 nodejs openai youtube

Last synced: 13 May 2025

https://github.com/lukeocodes/supported-formats

A package to test if your media file is supported for transcription.

ai audio deepgram javascript node srt subtitles transcribe transcript typescript video webvtt

Last synced: 09 Apr 2026

https://github.com/hosuaby/transcriptionist

Tool to transcribe videos using AI.

ai captions deepgram transcribe video-processing

Last synced: 16 Feb 2026

https://github.com/michaelheckmann/media-processor

Transform, transcribe and anonymize your audio and video files.

deepgram electron ffmpeg

Last synced: 03 Feb 2026

https://github.com/dotaadarsh/streamlit-projects

Building projects with Streamlit

api deepgram openai projects python streamlit

Last synced: 14 Apr 2025

https://github.com/gateremark/nanaai

(In Development) An AI-powered SaaS web application that enables users to interact with PDFs, research papers and online blogs using voice commands enhancing user’s understanding and engagement with the content.

aws-s3 deepgram elevens-lab langchain neondb nextjs openai tailwindcss

Last synced: 12 Apr 2026

https://github.com/deepgram-starters/go-live-transcription

Get started using Deepgram's Live Transcription with this Go demo app

asr deepgram live real-time speech-to-text stt transcription websocket

Last synced: 21 Jan 2026

https://github.com/smartdev00/touch-base

Record the voice call and extract the name, summary, and follow-up date. Then, save this information to Firebase.

airtable deepgram firebase googleapi googleauth googlecloud googlesheets nextjs openai speech-recognition speech-to-text tailwindcss text-processing voicerecognition

Last synced: 04 Apr 2026

https://github.com/spark-engine-ai/alice

A voice AI named ALICE (Audio Language Interface and Communication Engine) which uses Deepgram, Groq and Neets APIs

ai audio deepgram gpt groq javascript llm node react voice voice-ai

Last synced: 30 Mar 2026

https://github.com/osnux/nio-voice-agent-sdk

🚀 Build production-ready voice agents easily with the Nio Voice Agent SDK, your self-hosted alternative to costly enterprise solutions.

agpl ai-agent anthropic claude deepgram llm openai sdk self-hosted speech-to-text text-to-speech typescript voice-agent voice-ai

Last synced: 17 May 2026

https://github.com/chai-dev682/plivo-interview-phone-agent

This project is for managing and conducting automated phone interviews with candidates

deepgram elevenlabs openai plivo python webhook

Last synced: 18 Apr 2026

https://github.com/deepgram-starters/fastapi-transcription

Get started using Deepgram's Speech-to-Text with this FastAPI demo app

asr deepgram demo fastapi pre-recorded python quickstart speech-to-text stt transcription

Last synced: 11 Feb 2026

https://github.com/deepgram-starters/django-text-intelligence

Get started using Deepgram's Text Intelligence with this Django demo app

deepgram demo django natural-language-processing nlp python quickstart text-analysis text-intelligence

Last synced: 11 Feb 2026

https://github.com/deepgram-starters/fastapi-text-to-speech

Get started using Deepgram's Text-to-Speech with this FastAPI demo app

deepgram demo fastapi python quickstart speech-synthesis text-to-speech tts

Last synced: 11 Feb 2026

https://github.com/saarthshah/youtube-stock-analyzer

Instantly analyze any youtube video for stock tips! 📈💎🙌🦍🤝💪

deepgram finance openai stocks youtube

Last synced: 14 Apr 2026

https://github.com/deepgram-starters/django-text-to-speech

Get started using Deepgram's Transcription with this Django demo app

deepgram demo django python quickstart speech-synthesis text-to-speech tts

Last synced: 11 Feb 2026

https://github.com/ark018/multi-voice-sdk

A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio content with a unified API.

deepgram gemini npm-package openai tts tts-api

Last synced: 11 May 2026

https://github.com/deepgram-starters/php-transcription

Get started using Deepgram's Transcription with this PHP demo app

asr deepgram speech-to-text stt transcription

Last synced: 21 Jan 2026

https://github.com/arnu515/speakcaptcha

SpeakCaptcha makes users speak out the captcha to complete it.

deepgram nodejs

Last synced: 09 May 2026

https://github.com/abdnh/anki-asr

Anki add-on for speech recognition

anki anki-addon deepgram speech-recognition whisper

Last synced: 27 Jan 2026

https://github.com/deepgram-starters/flask-live-text-to-speech

Get started using Deepgram's Live Text-to-Speech with this Flask demo app

deepgram live real-time text-to-speech tts websockets

Last synced: 21 Jan 2026

https://github.com/deepgram-starters/csharp-transcription

Get started using Deepgram's Transcription with this C# demo app

asr deepgram speech-to-text stt transcription

Last synced: 21 Jan 2026

https://github.com/gabrielelanzafamee/assistly

Modern customer service platform combining AI capabilities with multi-channel communication. Built with NestJS, Angular, OpenAI, and Twilio.

ai angular assistant-chat-bots deepgram elevenlabs nestjs openai twilio typescript

Last synced: 27 Apr 2026

https://github.com/deepgram/examples

Deepgram SDK integrations with popular platforms, frameworks, and ecosystems maintained by the DX team

ai audio-intelligence deepgram examples python sdk speech-to-text text-to-speech typescript voice voice-agents

Last synced: 24 Apr 2026

https://github.com/mrkkvnsndvl/kalma-copilot-extension

Kalma Copilot is a Chrome extension that provides real-time AI-powered assistance during online job interviews on platforms like Google Meet, Zoom, and Microsoft Teams. It offers features such as real-time audio capture, interview setup, and a draggable/minimizable interface to help users navigate their virtual interviews with confidence.

deepgram openrouter react-typescript shadcn-ui tailwindcss vite wxt zustand

Last synced: 03 May 2026

https://github.com/digispect-intel/business_voice_agent_backend

A voice-enabled AI assistant backend for a business website. This backend powers business_voice_agent_frontend, providing real-time voice interaction capabilities using Restack AI.

deepgram elevenlabs fasthtml livekit openai restack

Last synced: 21 Apr 2026

https://github.com/aixerum/ai-voice-assistant

Transform your digital interactions with your own AI Voice Assistant🎙️🤖 This project is an advanced AI Voice Assistant that integrates Text-to-Speech (TTS) and Speech-to-Text (STT) capabilities, allowing users to communicate directly with the agent and receive vocal responses.

calendar deepgram email groq knowledgebase speech-to-text text-to-speech vocal

Last synced: 25 Feb 2025

https://github.com/deepgram-starters/flask-voice-agent

Get started using Deepgram's Voice Agent with this Flask demo app

agent-api deepgram live real-time speech-to-speech voice-agent websocket

Last synced: 21 Jan 2026

https://github.com/harsha-yuvaraj/iris-voice-ai

A voice-to-voice conversational AI built with Django, Deepgram, OpenAI, and Twilio—designed with smart time-wasting capabilities. Live now! Call & Chat at +1 956 952 7270!

ai-voice-assistant amazon-web-services asynchronous-programming deepgram django django-channels docker javascript openai python redis speech-to-text text-to-speech twilio-voice voice-chat websockets

Last synced: 10 Apr 2026

https://github.com/ryanlevee/medication-reminder-system

Voice-driven, Node.js-based medication reminder system utilizing real-time communication technologies, along with Text-to-Speech (TTS), Speech-to-Text (STT), and a Large Language Model (LLM).

conversational-ai deepgram elevenlabs expressjs firebase firebase-realtime-db google-gemini javascript jest llm ngrok nodejs real-time-communication rest-api speech-recognition speech-to-text text-to-speech twilio voice-applications websockets

Last synced: 09 Apr 2026

https://github.com/sumit03guha/speech-to-speech-realtime-translation

This project provides a real-time speech-to-speech translation system using Deepgram for speech recognition, LangChain for language translation, and ElevenLabs for voice synthesis. All without using Openai's Realtime model.

deepgram elevenlabs gpt-4o langchain langchain-python llm openai realtime-streaming speech-to-speech speech-to-text

Last synced: 06 May 2026

https://github.com/403errors/echotasks

Manage your tasks entirely through voice commands. Fast, intuitive, and powered by AI

ai deepgram firebase gpt-4o-mini to-do

Last synced: 15 May 2026

https://github.com/prakharbhardwaj/twilio-deepgram-voice-assistant

Twilio Media Streams Integration with Deepgram’s Voice Agent API

ai claude deepgram nodejs openai twilio voice-assistant

Last synced: 06 Apr 2026

https://github.com/mohameddmansurr/voice-agent-rag-demo

Real-time Voice AI Agent featuring a modular RAG pipeline. Built with LiveKit, Deepgram (STT), Groq Llama 3.1 (LLM), and Cartesia (TTS). Features sub-second latency, local vector search (FAISS), and interruptibility.

cartesia deepgram groq livekit llama-3 nextjs python rag real-time-ai voice-agent

Last synced: 15 May 2026

https://github.com/brandonroberts/deepgram-appwrite-transcribe

Media Transcript Archive Application with Deepgram and Appwrite

appwrite deepgram opensource

Last synced: 28 Apr 2026

https://github.com/huzaifa-fullstack/eduvox-ai

EduVox AI is an AI-powered educational voice companion that delivers real-time tutoring across subjects with GPT-4, voice synthesis, speech recognition, secure auth, and a modern Next.js UI.

clerk deepgram education elevenlabs javascript lottie nextjs openai radix-ui react sentry superbase svix tailwind-css typescript vapi vercel voice-ai webhooks zod

Last synced: 07 Apr 2026