Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mohahmadraza/voicetovoicechatbot
https://github.com/mohahmadraza/voicetovoicechatbot
Last synced: 5 days ago
JSON representation
- Host: GitHub
- URL: https://github.com/mohahmadraza/voicetovoicechatbot
- Owner: mohAhmadRaza
- Created: 2024-08-20T14:42:12.000Z (3 months ago)
- Default Branch: main
- Last Pushed: 2024-08-23T06:20:19.000Z (3 months ago)
- Last Synced: 2024-08-23T07:33:31.408Z (3 months ago)
- Language: Python
- Size: 10.7 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
```markdown
# Whisper and Groq Audio Processing ApplicationThis project is an audio processing application that leverages OpenAI's Whisper for transcription and Groq's AI for generating responses. The application converts the response into speech and allows users to download the audio file.
## Features
- **Transcription**: Converts audio files into text using OpenAI's Whisper model.
- **AI Response**: Generates AI-based responses using Groq's API.
- **Text-to-Speech**: Converts the AI response into speech using Google's Text-to-Speech (gTTS).
- **Downloadable Audio**: Provides a download option for the generated audio response.## Prerequisites
Ensure you have the following installed:
- Python 3.8 or higher
- pip (Python package installer)## Installation
1. **Clone the Repository**:
```bash
git clone https://github.com/your-username/your-repo-name.git
cd your-repo-name
```2. **Install Dependencies**:
Install the required Python packages by running:
```bash
pip install -r requirements.txt
```3. **Set Up Environment Variables**:
Create a `.env` file in the root of the project directory and add your Groq API key:
```plaintext
GROQ_API_KEY=your_groq_api_key
```**Note**: The `.env` file should not be committed to version control for security reasons.
## Usage
1. **Running the Application**:
Start the application by running the following command:
```bash
python app.py
```2. **Using the Interface**:
- Upload an audio file using the interface.
- The application will transcribe the audio, generate a response, convert the response to speech, and provide a downloadable audio file.## Example Workflow
1. Upload an audio file in the interface.
2. The transcription of the audio is displayed as text.
3. A response is generated based on the transcribed text.
4. The response is converted to audio, which can be played directly or downloaded.## Contributing
If you'd like to contribute to this project, please follow these steps:
1. Fork the repository.
2. Create a new branch (`git checkout -b feature-branch`).
3. Make your changes and commit them (`git commit -m 'Add new feature'`).
4. Push to the branch (`git push origin feature-branch`).
5. Open a pull request.## License
This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
## Contact
For any questions or suggestions, feel free to reach out to:
- **GitHub**: [your-username](https://github.com/mohAhmadRaza)
```