Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/imrany/insight

A voice artificial intelligence assistant
https://github.com/imrany/insight

artificial-intelligence gemini-api webspeech-api

Last synced: about 5 hours ago
JSON representation

A voice artificial intelligence assistant

Awesome Lists containing this project

README

        

### Insight
Insight is an AI chatbot built in **Nextjs** that allows users to enter prompts using speech rather than text.
It uses **Gemini** under the hood to give back responses with the user can listen to or download a an audio file fo later.

### Why did we chose voice input only rather than both text and voice input
We've noticed a trend in AI chatbot like **ChatGPT** where users interacts with it mainly through text input, this can
limit people living with low or lack of vision and the elderly. Therefore we chose voice input for **insight**.
It can be used for assistance with general questions like *What's the recipe of chocolate cake...?* and other genral questions.

We've combine both speech and generative ai to create this program.

### Challenge statetement
We choose the open category of the hackathon.
#### Open category 🌟
Feeling adventurous? Build anything that combines two or more modalitiesβ€”go wild with your ideas! Whether it's voice-controlled art generators or AI-powered virtual assistants, the only limit is your imagination.

### Features πŸš€
- Speech recognition or input: Using `Web SpeechRecognition`
- Speech output:
Using `SpeechSynthesisUtterance` we convert text response to speech (Text-to-speech).
Using `MediaRecorder` we record it and store it for user to download.

Why this approach is not compute intensive and cost effective since some processes like text-to-speech is handled on the client side rathen than on the server-side (our side)

- Download audio files: Users would able to store and download audio files of their prompts and responses to later.

- Intelligent: We've integrate insight with **Gemini**, with this integration it can response fast and with nearly
accuracy to the subject matter.

### Demo video
You can view our demo on youtube, [https://youtu.be/zw7Cu6W9YCY](https://youtu.be/zw7Cu6W9YCY)
- Sign in page
sign in page

- Home page
home page

- How to download a chat as audio
How to download

- Short demo video


Your browser does not support the video tag.

### How to setup and run it locally
1. Ensure you have **Nodejs**, if not download here [https://nodejs.org/en/download/package-manager](https://nodejs.org/en/download/package-manager)

2. Clone this repository
```bash
git clone https://github.com/imrany/insight
```

3. Get into the project and open it with any IDE or code editor, is this we've used **vscode**
```bash
cd insight
code .
```

4. Install all project dependencies
```bash
npm install
```

5. Running the project
```bash
npm run dev
```

6. Open browser to [http://localhost:3000](http://localhost:3000)

7. You have to **Create an account** by clicking the `Get started` button
or you can sign in using my credentials
```
email: [email protected]
password: 12345678
```

8. Start by entering a prompt by pressing the `Microphone` button.

9. Once your voice prompt is recorded, you will receive a response which you can play or read by `double tapping` or `expanding` the card.

10. If you need any help, reach out here [[email protected]](mailto:[email protected])

### Live demo 🌏
Visit [https://insight-ai-ecru.vercel.app](https://insight-ai-ecru.vercel.app)