https://github.com/fikriaf/easy-recognity
Easy Recognity is a real-time, browser-based speech recognition app using combination JS and Python
https://github.com/fikriaf/easy-recognity
python selenium speech-recognition voice-to-text web-speech-api
Last synced: 6 months ago
JSON representation
Easy Recognity is a real-time, browser-based speech recognition app using combination JS and Python
- Host: GitHub
- URL: https://github.com/fikriaf/easy-recognity
- Owner: fikriaf
- License: apache-2.0
- Created: 2025-04-05T00:18:33.000Z (6 months ago)
- Default Branch: main
- Last Pushed: 2025-04-05T00:35:48.000Z (6 months ago)
- Last Synced: 2025-04-09T19:18:23.544Z (6 months ago)
- Topics: python, selenium, speech-recognition, voice-to-text, web-speech-api
- Language: HTML
- Homepage:
- Size: 10.6 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Easy Recognity 🎙️
Easy Recognity is a real-time, browser-based speech recognition app. It leverages the built-in Web Speech API for speech-to-text and uses Python (with Selenium) for automation.
## ✨ Features
- Live Speech-to-Text via browser
- Web automation using Python + Selenium
- Auto-stop when speech is detected
- Clean and responsive user interface## 📁 Project Structure
- `recognity.html` — Web interface using HTML + JavaScript (Web Speech API)
- `recognity.py` — Python automation script to simulate user interaction using Selenium
- `webdriver/` — (Optional) Directory for WebDriver binaries (if not using webdriver-manager)## 🚀 How to Run
1. Clone the Repository
```
git clone https://github.com/fikriaf/easy-recognity.git
cd easy-recognity
```2. Install Python Dependencies
```
pip install selenium webdriver-manager
```3. Run the Script
```
python recognity.py
```This script will:
- Launch Chrome in headless mode
- Open `recognity.html`
- Simulate clicking the "Start" button to start speech recognition
- Display the transcribed speech in terminal output## 📋 Sample Output
```
Listening...
Hello world this is a test
```## ⚙️ Notes
- Chrome is required (with Web Speech API support)
- The script uses `--use-fake-ui-for-media-stream` and `--use-fake-device-for-media-stream` to simulate mic access
- The function `SpeechRecognitionModel()` can be integrated with other automation or voice-control systems## 🔧 Compatibility
| Component | Status |
|------------------------|---------------|
| Chrome (WebSpeech API) | ✅ Supported |
| Firefox | ❌ Not supported |
| Linux / macOS / Windows | ✅ |
| Python 3.7+ | ✅ |## 👤 Author
Made with ❤️ by [fikriaf](https://github.com/fikriaf)