https://github.com/olney1/chatgpt-openai-smart-speaker
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.
https://github.com/olney1/chatgpt-openai-smart-speaker
agents ai artificial-intelligence chatgpt gpt-4 langchain langsmith openai smarthome smartspeaker speech-recognition speech-to-text tavily text-to-speech vision vision-and-language webscraping
Last synced: 3 months ago
JSON representation
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.
- Host: GitHub
- URL: https://github.com/olney1/chatgpt-openai-smart-speaker
- Owner: Olney1
- License: mit
- Created: 2023-01-07T12:21:18.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-11-13T20:14:12.000Z (5 months ago)
- Last Synced: 2025-01-22T05:46:50.558Z (3 months ago)
- Topics: agents, ai, artificial-intelligence, chatgpt, gpt-4, langchain, langsmith, openai, smarthome, smartspeaker, speech-recognition, speech-to-text, tavily, text-to-speech, vision, vision-and-language, webscraping
- Language: Python
- Homepage:
- Size: 145 MB
- Stars: 265
- Watchers: 14
- Forks: 28
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
- awesome-ChatGPT-repositories - ChatGPT-OpenAI-Smart-Speaker - This AI Smart Speaker uses speech recognition and text-to-speech to enable voice-driven conversations and vision capabilities with OpenAI and Agents. The user speaks a prompt into the microphone, and the program sends the prompt to OpenAI to generate a response. The response is then converted to an audio file and played back to the user. (Prompts)