Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/smolsoftboi/jack-o-gpt

An interactive Jack-O'-Lantern assistant that captures voice input, processes it through OpenAI's GPT-4, and responds with synthesized voice output. Built with Node.js, OpenAI API, and Eleven Labs Text-to-Speech.
https://github.com/smolsoftboi/jack-o-gpt

Last synced: about 15 hours ago
JSON representation

Host: GitHub
URL: https://github.com/smolsoftboi/jack-o-gpt
Owner: SmolSoftBoi
Created: 2023-10-26T20:49:26.000Z (about 1 year ago)
Default Branch: main
Last Pushed: 2024-09-06T21:25:40.000Z (2 months ago)
Last Synced: 2024-09-07T00:18:45.796Z (2 months ago)
Language: TypeScript
Size: 308 KB
Stars: 0
Watchers: 1
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml

Awesome Lists containing this project

README

# Jack-O'-Lantern: A Voice-Activated Assistant for Halloween

## Overview

This project is a voice-activated assistant designed to entertain and engage audiences during Halloween. It uses OpenAI's GPT-4 model to generate responses and Eleven Labs' Text-to-Speech API for voice output. The assistant can sing songs, tell jokes, and narrate classic Halloween tales.

## Table of Contents

- [Overview](#overview)
- [Installation](#installation)
- [Usage](#usage)
- [Environment Variables](#environment-variables)
- [Functionality](#functionality)
- [Error Handling](#error-handling)
- [Contributing](#contributing)

## Installation

1. Clone the repository.
2. Run `yarn install` to install the required packages.

## Usage

1. Set up your environment variables (see below).
2. Run `yarn compile` to compile the application.
3. Run `yarn start` to start the application.

## Environment Variables

- `OPENAI_API_KEY`: Your OpenAI API key.
- `ELEVENLABS_API_KEY`: Your Eleven Labs API key.
- `MAX_RETRIES`: Maximum number of retries for API calls (optional).

## Functionality

- **Voice Capture**: Captures voice input and saves it as an audio file.
- **Transcription**: Transcribes the voice input to text using OpenAI's Whisper model.
- **Assistant Interaction**: Processes the transcribed text and generates a response.
- **Voice Output**: Converts the generated text to voice using Eleven Labs' Text-to-Speech API.
- **Audio Playback**: Plays the generated voice output.

## Error Handling

The application has robust error handling to deal with API failures, missing environment variables, and other issues.

## Contributing

Feel free to fork the repository and submit pull requests for any improvements or features you'd like to add.