Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/smolsoftboi/jack-o-gpt
An interactive Jack-O'-Lantern assistant that captures voice input, processes it through OpenAI's GPT-4, and responds with synthesized voice output. Built with Node.js, OpenAI API, and Eleven Labs Text-to-Speech.
https://github.com/smolsoftboi/jack-o-gpt
Last synced: about 15 hours ago
JSON representation
An interactive Jack-O'-Lantern assistant that captures voice input, processes it through OpenAI's GPT-4, and responds with synthesized voice output. Built with Node.js, OpenAI API, and Eleven Labs Text-to-Speech.
- Host: GitHub
- URL: https://github.com/smolsoftboi/jack-o-gpt
- Owner: SmolSoftBoi
- Created: 2023-10-26T20:49:26.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2024-09-06T21:25:40.000Z (2 months ago)
- Last Synced: 2024-09-07T00:18:45.796Z (2 months ago)
- Language: TypeScript
- Size: 308 KB
- Stars: 0
- Watchers: 1
- Forks: 1
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
Awesome Lists containing this project
README
# Jack-O'-Lantern: A Voice-Activated Assistant for Halloween
## Overview
This project is a voice-activated assistant designed to entertain and engage audiences during Halloween. It uses OpenAI's GPT-4 model to generate responses and Eleven Labs' Text-to-Speech API for voice output. The assistant can sing songs, tell jokes, and narrate classic Halloween tales.
## Table of Contents
- [Overview](#overview)
- [Installation](#installation)
- [Usage](#usage)
- [Environment Variables](#environment-variables)
- [Functionality](#functionality)
- [Error Handling](#error-handling)
- [Contributing](#contributing)## Installation
1. Clone the repository.
2. Run `yarn install` to install the required packages.## Usage
1. Set up your environment variables (see below).
2. Run `yarn compile` to compile the application.
3. Run `yarn start` to start the application.## Environment Variables
- `OPENAI_API_KEY`: Your OpenAI API key.
- `ELEVENLABS_API_KEY`: Your Eleven Labs API key.
- `MAX_RETRIES`: Maximum number of retries for API calls (optional).## Functionality
- **Voice Capture**: Captures voice input and saves it as an audio file.
- **Transcription**: Transcribes the voice input to text using OpenAI's Whisper model.
- **Assistant Interaction**: Processes the transcribed text and generates a response.
- **Voice Output**: Converts the generated text to voice using Eleven Labs' Text-to-Speech API.
- **Audio Playback**: Plays the generated voice output.## Error Handling
The application has robust error handling to deal with API failures, missing environment variables, and other issues.
## Contributing
Feel free to fork the repository and submit pull requests for any improvements or features you'd like to add.