https://github.com/taylor-eos/speech-input
Python script for transcribing speech with Whisper
https://github.com/taylor-eos/speech-input
Last synced: over 1 year ago
JSON representation
Python script for transcribing speech with Whisper
- Host: GitHub
- URL: https://github.com/taylor-eos/speech-input
- Owner: Taylor-eOS
- Created: 2025-01-15T10:54:04.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-14T14:16:07.000Z (over 1 year ago)
- Last Synced: 2025-02-14T15:28:02.821Z (over 1 year ago)
- Language: Python
- Homepage:
- Size: 36.1 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
**A lightweight GUI tool for recording and transcribing speech to text on Linux using the built-in microphone**, designed to provide functionality similar to mobile device speech input tools.

This tool uses the `arecord` command for audio recording, which is compatible with most Linux systems and does not depend on specific hardware. It leverages Whisper's robust transcription capabilities, including support for multiple languages.
**Features**:
- Simple recording controls integrated into the GUI.
- Foreign language transcription: Enter the desired language code (e.g., `en` for English) before recording.
- Real-time status updates in the interface, so you know when to start speaking or when transcription is in progress.
To achieve optimal transcription results, ensure you speak slowly and clearly, leaving slight pauses between words. Position yourself close to your computers microphone, and consider adjusting the microphone input volume—lower settings can sometimes improve clarity and reduce noise.
**Instructions**:
Install the dependencies listed in `requirements.txt` and run `record.py` to start the tool.