An open API service indexing awesome lists of open source software.

https://github.com/thorwebdev/webcam-audio-description-ai

Capture webcam via browser, generate text descriptions with Google Gemini, generate speech with ElevenLabs
https://github.com/thorwebdev/webcam-audio-description-ai

elevenlabs elevenlabs-api gemini gemini-api wcag2

Last synced: 10 months ago
JSON representation

Capture webcam via browser, generate text descriptions with Google Gemini, generate speech with ElevenLabs

Awesome Lists containing this project

README

          

# Webcam Audio Description Generator

Generate audio descriptions for your videos using [Google Gemini]() and [ElevenLabs]().

## Setup

- `cp supabase/functions/.env. example supabase/functions/.env`
- Set your [Gemini API key](https://ai.google.dev/gemini-api/docs/api-key) in `supabase/functions/.env`
- Set your [ElevenLabs API key](elevenlabs.io/?from=partnersmith6824) in `supabase/functions/.env`

## Run locally

```bash
supabase start
supabase functions serve --no-verify-jwt
# In another terminal
python3 -m http.server
```

Open http://localhost:8000/