Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/marwan2232004/esma3nyapi
This project focuses on converting spoken Egyptian Arabic into written text and translating English text into Arabic. The architecture is inspired by OpenAI's Whisper model and utilizes a custom Transformer-based implementation.
https://github.com/marwan2232004/esma3nyapi
api-gateway asr-model fastaapi jupyter-notebook machine-learning python3 transformers translation
Last synced: about 18 hours ago
JSON representation
This project focuses on converting spoken Egyptian Arabic into written text and translating English text into Arabic. The architecture is inspired by OpenAI's Whisper model and utilizes a custom Transformer-based implementation.
- Host: GitHub
- URL: https://github.com/marwan2232004/esma3nyapi
- Owner: marwan2232004
- Created: 2024-10-11T18:04:17.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2025-01-20T13:09:58.000Z (10 days ago)
- Last Synced: 2025-01-20T14:24:10.676Z (10 days ago)
- Topics: api-gateway, asr-model, fastaapi, jupyter-notebook, machine-learning, python3, transformers, translation
- Language: Python
- Homepage:
- Size: 85 KB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# 🌟 Esma3ny API 🌟
Welcome to the ASR and Translation API! This project focuses on converting spoken Egyptian Arabic into written text and translating English text into Arabic. The architecture is inspired by OpenAI's Whisper model and utilizes a custom Transformer-based implementation.
Frontend repository: [Esma3ny Frontend Repository](https://github.com/marwan2232004/Esma3ny)
Kaggle notebook: [Esma3ny Kaggle Notebook](https://www.kaggle.com/code/marwan2232004/speech2text-transformer)
## 🚀 Features
- **Automatic Speech Recognition (ASR)**: Converts spoken Egyptian Arabic into written text.
- **Translation**: Translates English text into Arabic using the OpenL Translation API.
- **Frontend**: Built with React Vite.
- **Backend**: Powered by FastAPI.
- **Deployment**: Hosted on Azure App Services and a Virtual Machine.## 📚 Architecture
- **ASR Component**: Inspired by OpenAI's Whisper model, leveraging a custom Transformer-based implementation.
- **Translation Component**: Integrates the OpenL Translation API for automatic language detection and translation from any language into Arabic.## 🛠️ Setup
1. **Clone the repository**:
```bash
git clone https://github.com/marwan2232004/Esma3nyAPI.git
cd Esma3nyAPI
2. **Install dependencies**:
```bash
pip install -r requirements.txt
3. **Run the server**:
```bash
uvicorn main:app --reload
## 📚 Deployment- The API is deployed on Azure App Services and a Virtual Machine. However, deployment is currently suspended due to costs.
## 📄 API Endpoints- **POST /audio2text**: Converts spoken Egyptian Arabic to text.
- **Request**: Audio file
- **Response**: JSON with transcribed text- **POST /translate/en-ar**: Translates English text to Arabic.
- **Request**: JSON with English text
- **Response**: JSON with translated Arabic text
- **POST /translate/auto**: Translates from any language to Arabic.
- **Request**: JSON with text
- **Response**: JSON with translated Arabic text
## ASR Transformer Architecture![Transformer_Architecture_complete drawio (1)](https://github.com/user-attachments/assets/139ad3f1-9ba0-491d-8dba-b749e8fc4e32)