https://github.com/tomkam1702/ocr-translator
๐ฎ Real-time game subtitle translator with AI-powered OCR. Context-aware translation for 20+ languages. Free offline models + dirt cheap APIs. Perfect for gaming in foreign languages!
https://github.com/tomkam1702/ocr-translator
deepl game-translation games gemini google-translate in-game language-learning-game llm marianmt ocr python real-time-translation subtitle-translation subtitles tesseract translation translator
Last synced: about 2 months ago
JSON representation
๐ฎ Real-time game subtitle translator with AI-powered OCR. Context-aware translation for 20+ languages. Free offline models + dirt cheap APIs. Perfect for gaming in foreign languages!
- Host: GitHub
- URL: https://github.com/tomkam1702/ocr-translator
- Owner: tomkam1702
- License: other
- Created: 2025-05-04T12:09:42.000Z (about 1 year ago)
- Default Branch: main
- Last Pushed: 2026-05-02T15:06:29.000Z (2 months ago)
- Last Synced: 2026-05-02T16:29:32.077Z (2 months ago)
- Topics: deepl, game-translation, games, gemini, google-translate, in-game, language-learning-game, llm, marianmt, ocr, python, real-time-translation, subtitle-translation, subtitles, tesseract, translation, translator
- Language: Python
- Homepage: https://tomkam1702.github.io/OCR-Translator/
- Size: 29.4 MB
- Stars: 43
- Watchers: 2
- Forks: 8
- Open Issues: 7
-
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project
README
# Game-Changing Translator v4
Copyright ยฉ 2025-2026 Tomasz Kamiลski
**Author of [](https://github.com/tomkam1702/OHLC-Forge) [OHLC Forge](https://github.com/tomkam1702/OHLC-Forge) โ Professional tool for crypto traders**

## ๐ Overview
**Game-Changing Translator** is a state-of-the-art desktop application designed for real-time screen translation. Using powerful AI-driven OCR and context-aware engines, it captures text from any part of your screenโbe it a game, a movie, or a documentโand translates it instantly into over 100 languages.
Whether you're exploring the world of *The Witcher*, learning a new language through entertainment, or simply need to understand content that can't be copied, GCT v4 provides a seamless, immersive experience with floating overlays that stay on top of your content.
---
## ๐ฌ Featured Game Demonstrations
๐งโโ๏ธ The Witcher 3
Revolutionary AI OCR & natural Polish-to-English translation
โ๏ธ Kingdom Come: Deliverance II
Czech-to-English Translation
๐ Star Wars: The Old Republic
French-to-English Translation
---
## ๐ช Introducing Version 4: A New Era of Translation
**Version 4** is not just an update โ it is a complete architectural redesign. Built with a brand-new engine from the ground up, it offers a smoother, more responsive experience and a state-of-the-art interface tailored for modern gaming.
### โจ Redesigned GUI & Simple Mode
The new interface offers two distinct modes: **Simple** for hassle-free operation and **Custom** for granular control.
> [!TIP]
> **Update v4.0.1 - Mouse Click-Through**: Overlays now automatically become "transparent" to mouse clicks during translation. This allows you to interact with the game underneath without losing focus or accidentally dragging frames.
### โก Up and running in 3 simple steps:
1. **Set your target language** โ Source language is auto-detected in Simple mode.
2. **Enter your API keys** โ Mandatory Gemini key and optional DeepL key.
3. **Select areas and start** โ Position your Source and Target overlays and hit **Start (~)**.
---
## ๐ ๏ธ Features
### ๐ FREE Features
* **Gemini AI OCR**: Industry-first AI-powered text recognition that handles stylised fonts and low-contrast backgrounds.
* **Gemini Translation**: Top-quality, context-aware translation in over 100 languages.
* **Sliding Context Window**: Remembers up to 5 previous subtitles to maintain narrative coherence.
* **Cost Monitoring**: Real-time token-level analytics and cost tracking.
* **Two-Tier Caching**: In-memory and file-based caching to save on API costs.
* **Native RTL Support**: Flawless bidirectional rendering for Arabic, Hebrew, Persian, etc.
* **Translation Prompt**: Inject custom instructions to define tone or game-specific context.
* **API Logs**: Comprehensive dual-layer logging for both OCR and Translation.
### ๐ PRO Features
* **DeepL Translation**: Elite precision for Japanese, Chinese, and European scripts with free context subtitles.
* **Find Subtitles**: Automatically scans the screen to detect and lock onto subtitle areas.
* **Target on Source**: Automatically overlays the translation directly onto the original subtitle area.
* **Scan Wider**: Expands the capture area to prevent word truncation and AI hallucinations.
* **OCR Prompt**: Custom instructions for Gemini OCR to filter HUD elements or speaker names.
* **Custom Appearance**: Full control over background and text colours with native pickers.
---
## ๐ PRO Feature Showcase
#### ๐ช Target on Source & ๐ Find Subtitles
These features work together to create an "invisible" translator. **Find Subtitles** scans your screen to detect where text appears, while **Target on Source** places the translation directly on top of the original subtitles.
**Step 1: Initial Scan**

*The capture frame starts at a default size.*
**Step 2: Adaptive Growth**

*The frame expands in real-time to fit longer subtitles.*
**Step 3: Locked & Immersive**

*The frame locks at the perfect width, providing a seamless translation overlay.*
### โ๏ธ OCR Prompt
Take direct control over what the AI "sees". Filter out complex HUD elements, minimaps, and speaker names to focus only on the dialogue.
Result: "Bang. Brain all up the walls." โ the speaker name "Vincenzo:" is stripped, and all HUD elements are ignored. The debug log on the right confirms the exact OCR output.
---
## โ ๏ธ Important Information
> [!CAUTION]
> ### Licensing & API Costs โ Important Note
> Please distinguish between the **GCT Software Licence** and **Third-Party API Costs**:
> *  **Features:** These are unlocked in the GCT software for everyone. However, using them requires a connection to the **Google Gemini API**, which carries its own usage costs.
> *  **Features:** These require a one-time purchase of a **GCT PRO Licence** to unlock advanced functionality within the program. This fee covers only software access and does **not** include or cover any API costs.
> * **Independent API Services:** GCT is a professional interface for AI services provided by **Google** and **DeepL**. These are independent commercial entities. You are responsible for all costs incurred through their respective APIs (with the exception of the 500,000 characters/month provided in DeepL's free tier).
> * **No Affiliation:** Game-Changing Translator and its author are entirely independent and have **no affiliation** with Google or DeepL. GCT is a tool designed to facilitate the use of these third-party paid services.
> [!WARNING]
> ### Compatibility & Version 4 Requirements
> Please review these technical requirements before proceeding:
> * **API-Only Architecture:** Version 4 is built entirely around third-party AI APIs. Operation is **impossible** without a valid **Gemini API Key**.
> * **Stay on v3.9.6:** If you do not have (and do not plan to obtain) a Gemini API key, you should **not** update to version 4. Please remain on **version 3.9.6**, which is the final release supporting free offline OCR (Tesseract) and offline translation models (MarianMT).
> * **Try Before You Buy:** Do not purchase the **GCT PRO Licence** before thoroughly testing the **FREE** version. Ensure the software works correctly on your system and that the Gemini-powered OCR and translation meet your expectations.
> [!NOTE]
> ### Updating from v3.9.6 or v4.0.0
> Due to significant architectural changes in v4, automatic updates from versions **3.9.6** and **4.0.0** are not supported. If you are currently on either of these versions, please download and install the latest release **manually** from the [Releases](https://github.com/tomkam1702/OCR-Translator/releases) page. Automatic updates will work normally starting from version **4.0.1**.
---
## ๐ Deprecated Features
Version 4 marks a significant shift towards high-quality AI-driven workflows. As a result, the following features have been retired:
* **Tesseract** OCR and **MarianMT** translation: These traditional/local technologies no longer keep up with the state-of-the-art results provided by modern AI.
* **OpenAI** and **Google Translate** support: Retired to streamline the application towards the most effective and cost-efficient engines.
---
## ๐ Links
* ๐ **[Full User Manual](https://tomkam1702.github.io/OCR-Translator/docs/user-manual.html)**
* ๐ฐ **[Get GCT PRO on Gumroad](https://tomkam17.gumroad.com/l/gct)**
* ๐ **[Official Website](https://tomkam1702.github.io/OCR-Translator/)**
---
## ๐ Licence
This project is proprietary software. The source code is provided for educational and evaluation purposes under a restrictive **End User License Agreement (EULA)**.
**You may:**
* Use the Free Edition for personal and evaluation purposes.
* Review the source code for learning.
**You may NOT:**
* Redistribute or sell the software.
* Modify the software to bypass PRO license checks.
* Use the code for commercial gain without permission.
For full details, see the [LICENSE](LICENSE) file.
[](LICENSE)
[-blue.svg)](https://www.qt.io/qt-for-python)
---
### ๐ Author's Portfolio
[
](https://github.com/tomkam1702/OHLC-Forge) **[OHLC Forge](https://github.com/tomkam1702/OHLC-Forge)** โ Professional OHLC daily data reconstruction for Binance and Bybit.