https://github.com/gregsadetsky/sagittarius

A GPT-4/Gemini Voice/Video Exploration Tool
https://github.com/gregsadetsky/sagittarius

Last synced: 8 months ago
JSON representation

A GPT-4/Gemini Voice/Video Exploration Tool

Host: GitHub
URL: https://github.com/gregsadetsky/sagittarius
Owner: gregsadetsky
Created: 2023-12-11T02:01:02.000Z (almost 2 years ago)
Default Branch: main
Last Pushed: 2024-01-26T16:54:26.000Z (almost 2 years ago)
Last Synced: 2025-03-28T07:07:45.634Z (8 months ago)
Language: TypeScript
Homepage: http://sagittarius.greg.technology/
Size: 649 KB
Stars: 687
Watchers: 7
Forks: 94
Open Issues: 11
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml

Awesome Lists containing this project

awesome-ChatGPT-repositories - sagittarius - A GPT-4/Gemini Voice/Video Exploration Tool (Chatbots)

README

          # Sagittarius

What is this? A GPT-4/Gemini Voice/Video Exploration Tool!

Do you have an API key from either OpenAI or Gemini? [You can use the tool online](https://sagittarius.greg.technology/)! No need to install anything.

See below for more context:

- [Original "A Remake of the Google Gemini Fake Demo, Except Using GPT-4 and It's Real" video](https://www.youtube.com/watch?v=__nL7Vc0OCg)

- [Heads-to-heads comparison of Gemini Pro and GPT-4](https://www.youtube.com/watch?v=1RrkRA7wuoE)

- [Sagittarius supports multiple voices, can be used by anyone on the internet, and is much faster! Yay](https://www.youtube.com/watch?v=4i0Kc8Za5WI)

## how to build

- clone this repo, cd into it

- duplicate `.env.example` and name the copy `.env`

- fill out the `VITE_OPENAI_KEY=` value with your OpenAI api key. you must have access to the `gpt-4-vision-preview` model

  - you can also try out the Gemini API if you have a key -- fill out `VITE_GEMINI_KEY` in the same `.env`

- then, run:

- `npm install`

- `npm run dev`

- the demo will be running at [http://localhost:5173](http://localhost:5173)

note: the in-browser speech recognition works best in Google Chrome

## TODO

- [x] allow input of API keys as `` on the page

- [x] deploy frontend to site i.e. sagittarius.greg.technology via vite+github actions

- [x] enable streaming output..!

- [x] make new video with 3) streaming output / comparison

- [x] enable selection of dictation language

- [ ] make new video with 1) uses of repo in the wild / forks 2) UI improvements

- [ ] add allcontributors bot

- [ ] add dependabot

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gregsadetsky/sagittarius

Awesome Lists containing this project

README