https://github.com/axshatind/xethscribe

XethScribe is an AI-driven web application designed for real-time audio transcription and translation, leveraging advanced models like OpenAI Whisper for speech recognition. It seamlessly processes audio inputs to deliver accurate, timestamped text outputs for various use cases.
https://github.com/axshatind/xethscribe

css html javascript machine-learning openai-whisper react transcription translation vite web-workers xenova-transformers

Last synced: 3 months ago
JSON representation

Host: GitHub
URL: https://github.com/axshatind/xethscribe
Owner: axshatInd
Created: 2024-09-07T09:58:57.000Z (9 months ago)
Default Branch: main
Last Pushed: 2024-09-07T13:43:34.000Z (9 months ago)
Last Synced: 2025-01-20T17:49:14.683Z (5 months ago)
Topics: css, html, javascript, machine-learning, openai-whisper, react, transcription, translation, vite, web-workers, xenova-transformers
Language: JavaScript
Homepage: https://xethscribe.vercel.app
Size: 61.5 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# XethScribe

XethScribe is a lightweight, AI-driven web application that offers real-time audio transcription and translation. It utilizes state-of-the-art models like OpenAI's Whisper for speech recognition and Xenova's NLLB-200 for translations, providing accurate and timestamped text outputs. This tool is ideal for transcribing conversations, speeches, or meetings and translating them efficiently.

## Features

- **Real-time Transcription**: Converts speech to text instantly with accurate timestamps.
- **Automatic Translation**: Supports multilingual translations using advanced AI models.
- **Lightweight Interface**: User-friendly interface for seamless file uploads and transcription playback.
- **On-the-fly Processing**: Handles audio streams and files with fast processing times.
- **Modular Design**: Easily customizable and extendable for additional features or integrations.

## Technologies Used

- **React**: For building the front-end user interface.
- **Tailwind CSS**: For responsive and modern styling.
- **Vite**: For fast bundling and development experience.
- **OpenAI Whisper**: For automatic speech recognition (ASR) in English.
- **Xenova NLLB-200**: For accurate and scalable translations between languages.
- **Web Workers**: For running AI models and transcription tasks in the background without blocking the UI.

## Getting Started

### Prerequisites

- **Node.js** (v16 or higher)
- **npm** (v7 or higher)

### Installation

1. Clone the repository:

```bash
git clone https://github.com/axshatInd/XethScribe.git
cd XethScribe
```

2. Install dependencies:

```bash
npm install
```

3. Start the development server:

```bash
npm run dev
```

4. Open [http://localhost:3000](http://localhost:3000) to view it in the browser.

### Build for Production

```bash
npm run build
```

## Usage

1. Upload an audio file or use a live audio stream.
2. The app will automatically transcribe the audio and display the results in real-time.
3. For translation, the output can be selected in different languages using the available options.

---

Feel free to contribute or submit any issues!

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/axshatind/xethscribe

Awesome Lists containing this project

README