Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/fschuindt/simple_chatgpt_voice_prompt
Speak and listen to the ChatGPT.
https://github.com/fschuindt/simple_chatgpt_voice_prompt
Last synced: about 1 month ago
JSON representation
Speak and listen to the ChatGPT.
- Host: GitHub
- URL: https://github.com/fschuindt/simple_chatgpt_voice_prompt
- Owner: fschuindt
- Created: 2023-08-05T21:45:21.000Z (over 1 year ago)
- Default Branch: master
- Last Pushed: 2023-08-05T21:45:41.000Z (over 1 year ago)
- Last Synced: 2024-11-21T19:07:10.831Z (about 2 months ago)
- Language: Shell
- Size: 1.95 KB
- Stars: 2
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Simple ChatGPT Voice Prompt
A simple example on how to use 3rd party APIs to prompt ChatGPT with voice. Responses will also be given in generative AI voice by Elevenlabs.
Currently, context is being dropped every new prompt.
It uses the following web APIs:
- OpenAI Whisper
- OpenAI ChatGPT
- [Elevenlabs.io](https://elevenlabs.io/)## Installation
Install the following dependencies.
- [parecord](https://command-not-found.com/parecord)
- [FFmpeg](https://ffmpeg.org/)
- [Whisper CLI](https://github.com/vatsalaggarwal/whisper-cli)
- [SGPT](https://github.com/tbckr/sgpt)
- [curl](https://curl.se/)For Whisper CLI and SGPT, make sure you have also configured their respective API keys.
For the Elevenlabs API key, just export it like so:
```
export ELEVENLABS_API_KEY=your_key_here
```
## Usage```
./simple_chatgpt_voice_prompt
```Start speaking, then enter `Ctrl + C` when done recording your message. Wait for the voice response.
All audio files are kept within the `./recordings` folder. `.wav` files are the original prompt, `.mp3` files are the original prompt converted into MP3 for lower file sizes and, `.mpeg` files are the generative voice responses.