An open API service indexing awesome lists of open source software.

https://github.com/shaheennabi/multi-lingual-ai-assistant-with-gtts-and-gemini-pro

๐ŸŽ‡ Multi-lingual AI Assistant with gTTS & Gemini Pro ๐ŸŽ† An end-to-end AI assistant using gTTS for multi-lingual text-to-speech and Gemini Pro API for smart responses. ๐ŸŒ๐Ÿ’ฌ Experience seamless voice interaction in various languages with continuous updates and improvements! ๐Ÿš€โœจ
https://github.com/shaheennabi/multi-lingual-ai-assistant-with-gtts-and-gemini-pro

ai assistant end-to-end-project google-generative-ai gtts multilingual speech-recognition streamlit

Last synced: about 1 month ago
JSON representation

๐ŸŽ‡ Multi-lingual AI Assistant with gTTS & Gemini Pro ๐ŸŽ† An end-to-end AI assistant using gTTS for multi-lingual text-to-speech and Gemini Pro API for smart responses. ๐ŸŒ๐Ÿ’ฌ Experience seamless voice interaction in various languages with continuous updates and improvements! ๐Ÿš€โœจ

Awesome Lists containing this project

README

        

# ๐Ÿš€ **Multi-lingual AI Assistant with gTTS and Gemini Pro** ๐Ÿค–๐ŸŒ
* Caution: This is my mini_project here

Welcome to the **Multi-lingual AI Assistant**โ€”the future of voice-driven interaction powered by **Gemini Pro** and **gTTS**! This AI assistant brings the power of Googleโ€™s cutting-edge models to your fingertips, enabling **seamless, real-time voice interactions** across multiple languages. Speak your mind, and let the AI do the rest! ๐ŸŒŸ

Whether you want to ask a question, get a recommendation, or just chat, this assistant is ready to assist you in **multiple languages**. It takes **voice input**, processes it using **Gemini Pro**, and responds with **text-to-speech** using **gTTS**. ๐ŸŽงโœจ Plus, you can **download the speech output** for offline access and share it anytime!

This isn't just a simple assistantโ€”it's an experience! ๐Ÿ’ฅ๐Ÿ’ฅ

---

## ๐Ÿšจ **Key Features** ๐Ÿšจ

- ๐ŸŒ **Multi-Language Support**: Communicate in **multiple languages** with Gemini Proโ€™s robust capabilitiesโ€”whether you're in English, Spanish, French, or many others! The assistant speaks your language. ๐Ÿ’ฌ๐ŸŒ
- ๐ŸŽค **Voice Input**: No typing needed! Use the microphone to speak to your assistant, and it will convert your speech into text using **Speech Recognition**. ๐Ÿ—ฃ๏ธ๐ŸŽ™๏ธ
- ๐Ÿ”„ **Text-to-Speech with gTTS**: The assistant converts its generated responses back into speech using the **Google Text-to-Speech** (gTTS) API. Hear the assistantโ€™s voice in your preferred language. ๐ŸŽง๐Ÿ”Š
- ๐Ÿ”ฅ **Downloadable Speech Output**: After interacting with the assistant, get your generated speech as an **audio file** for offline use! ๐Ÿ’พ๐Ÿ“ฒ
- โœจ **Streamlit UI**: A stunning, **easy-to-use web interface** built with **Streamlit** to bring everything together in a beautiful package. Interact with the assistant effortlessly. ๐ŸŽจ๐Ÿ–ฅ๏ธ

---

## ๐Ÿ› ๏ธ **Installation Guide** ๐Ÿ› ๏ธ

### Step 1: Create Your Conda Environment

Let's get your environment set up and ready to go! Open your terminal and run:

```bash
conda create --name multilingual-assistant python=3.9
```
Activate the env:
```bash
conda activate multilingual-assistant
```

### Step 2: Install Dependencies

Now, install all the required dependencies using the following command:

```bash
pip install -r requirements.txt
```

Make sure youโ€™ve got everything you need to make the magic happen!

**Dependencies**:
- **gTTS** (Google Text-to-Speech): Converts the assistantโ€™s responses into speech.
- **Gemini Pro**: The language model behind all the intelligence.
- **Streamlit**: For building the stunning web interface.
- **Speech Recognition**: To convert your voice into text.

---

## ๐Ÿš€ **How to Use** ๐Ÿš€

### Step 1: Set Up API Keys for Gemini Pro

To interact with **Gemini Pro**, you'll need to set up API access. Head to **Google Cloud**, create a project, and enable **Gemini Pro**. Store your API key securely and configure it in your environment.

### Step 2: Launch the Streamlit Application

Now, it's time to see the magic in action. Run the following command:

```bash
streamlit run app.py
```

This will start the Streamlit app and open the web interface in your browser.

### Step 3: Interact with the Assistant

1. **Record Your Voice**: Click the **Record** button to start speaking.
2. **AI Processing**: The assistant will listen to your speech, convert it to text, and send it to **Gemini Pro** for processing.
3. **Listen to the Response**: The assistant will convert the AI-generated text back into speech using **gTTS** and play it back to you.
4. **Download the Speech**: After hearing the assistantโ€™s response, click the download button to save the speech for offline use.

---

## ๐Ÿ“‚ **Project Structure**

Hereโ€™s a look at the project structure:

```
Multi-lingual-AI-Assistant-with-gTTS-and-Gemini-Pro/
โ”‚
โ”œโ”€โ”€ app.py # Streamlit UI for interaction
โ”œโ”€โ”€ requirements.txt # All the necessary dependencies
โ”œโ”€โ”€ src
|-----helper.py
โ””โ”€โ”€ README.md # Project documentation (Youโ€™re looking at it right now!)
```

---

## ๐Ÿ’ก **Technologies Used** ๐Ÿ’ก

- **Gemini Pro**: Googleโ€™s state-of-the-art language model for intelligent AI responses.
- **gTTS (Google Text-to-Speech)**: Converting text to natural-sounding speech using Googleโ€™s powerful TTS engine.
- **Streamlit**: A super-fast, easy-to-use library for creating web apps with a focus on machine learning.
- **Speech Recognition**: Capturing voice input and converting it to text.
- **Python 3.9**: The Python version keeping everything running smoothly.

---

## ๐Ÿ“œ **License** ๐Ÿ“œ

This project is licensed under the **MIT License**. Check the [LICENSE](LICENSE) file for more details.

---

## ๐Ÿ™ **Acknowledgments** ๐Ÿ™

A big thank you to the following technologies that made this project possible:

- **Google Gemini Pro**
- **gTTS**
- **Streamlit**
- **Speech Recognition**

---

## ๐ŸŽ‰ **Letโ€™s Talk!** ๐ŸŽ‰

Ready to try it out? Clone the repository, install the dependencies, and fire up your assistant! ๐Ÿš€๐Ÿ’ฌ Letโ€™s create something amazing together. โœจ

---

## ๐ŸŒŸ **Stars are Always Welcome!** ๐ŸŒŸ

If you love the project, โญ **star** โญ it and show some love! Also, feel free to contribute and make this assistant even smarter. ๐Ÿ’ก