https://github.com/leapward-koex/namida-ocr

A purely browser based OCR tool designed recognizing, copying, and pronouncing Japanese text
https://github.com/leapward-koex/namida-ocr

chrome-extension edge-extension firefox-addon japanese-language language-learning optical-character-recognition text-to-speech yomitan

Last synced: 4 months ago
JSON representation

A purely browser based OCR tool designed recognizing, copying, and pronouncing Japanese text

Host: GitHub
URL: https://github.com/leapward-koex/namida-ocr
Owner: Leapward-Koex
License: gpl-3.0
Created: 2024-12-28T22:32:26.000Z (6 months ago)
Default Branch: master
Last Pushed: 2025-01-14T09:35:54.000Z (5 months ago)
Last Synced: 2025-01-14T10:53:07.565Z (5 months ago)
Topics: chrome-extension, edge-extension, firefox-addon, japanese-language, language-learning, optical-character-recognition, text-to-speech, yomitan
Language: TypeScript
Homepage:
Size: 66.6 MB
Stars: 0
Watchers: 1
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Namida OCR

**Namida OCR** is a completely local OCR browser extension for both **Chrome**, **Firefox**, and **Edge**. It enables you to take a “snip” (screenshot) of any part of your current tab, upscale it (either via basic linear upscaling or ESRGAN), and then perform OCR on the snipped region using Tesseract.js. The OCR supports Japanese vertical text at the moment and automatically copies the recognized text to your clipboard, making it easy to use with online dictionaries like [Yomitan](https://github.com/yomidevs/yomitan) or manual translation tools. Additionally, Namida OCR includes the option to speak the recognized text aloud using your browser’s text-to-speech capabilities.

***

## Features

- **Local OCR**
All OCR processing is done locally in your browser using [Tesseract.js](https://github.com/naptha/tesseract.js). No external servers are involved.

- **Snip & Upscale**
By default, **Alt + Q** on windows and **Option + Q** on mac activates the snipping mode. The selected image region is then upscaled:
- **Linear Upscaling** (fast, basic)
- **ESRGAN** (higher-quality AI-based upscaling)

- **Japanese Vertical Text Support**
Namida OCR includes trained data for Japanese vertical text recognition, making it ideal for reading manga, visual novels, or other sources with vertical text layout.

- **Clipboard Copy**
Upon successful OCR, the recognized text is automatically copied to your clipboard so you can quickly paste it into a dictionary or translation tool.

- **Furigana**
Choose from either Hiragana or Katakana phonetic prnounciation of Kanji

- **Text-to-Speech (TTS)**
Namida OCR includes the option to speak the recognized text aloud using your browser’s TTS engine.
- **Chrome**: High-quality remote Japanese voices are included by default.
- **Firefox & Edge (Windows)**: Requires a Japanese language pack with TTS installed.
- **Edge**: Can also use high-quality **“natural”** voices if available via the Windows language pack.

- **Privacy-Friendly**
No internet connection is required during OCR, upscaling, or text-to-speech. Everything is handled using local models bundled with the extension.

## Usage

1. **Activate Snip Mode**
Press **Alt + Q** on windows or **Option + Q** on mac on any web page. A snipping overlay will appear.

2. **Select the Region**
Click and drag to highlight the area you want to OCR.

3. **Upscale & OCR**
- Namida OCR upscales the snipped region using your chosen method (linear or ESRGAN).
- Tesseract.js then performs OCR on the upscaled image.

4. **Copy to Clipboard**
The recognized text is automatically copied to your clipboard. You can then paste it into any dictionary, translation app, or text editor.

5. **Speak the Text** *(Optional)*
If enabled in settings, you can speak the recognized text aloud using your browser’s TTS capabilities. Simply click the "Speak" button in the recognition window.

## Settings

- **Furigana display**
- **None** – Display no additional kana above kanji
- **Hiragana** – Displays hiragana above kanji
- **Katakana** – Displays katakana above kanji

- **Upscaling Mode**
- **Linear** – Uses basic canvas scaling (faster but lower quality).
- **ESRGAN** – AI-based upscaling for sharper text.

- **Supported Languages**
- Japanese (jpn_vert)

- **Enable TTS**
- Option to enable or disable the "Speak" button for recognized text.

- **Preferred TTS Voice**
- Choose which TTS voice to use when speaking recognized text. The available options depend on your browser and system configuration:
- **Chrome**: Includes high-quality remote Japanese voices.
- **Firefox & Edge**: Requires a Japanese language pack with TTS support installed.
- **Edge (Windows)**: Can use advanced **"natural"** voices from the Windows language pack.

## Notes

- For the best experience with TTS on Firefox or Edge, ensure your system has a Japanese language pack with text-to-speech capabilities installed. On Edge, you can access **natural** voices through the Windows settings.
- Namida OCR is ideal for users looking to OCR Japanese text, including vertical text layouts commonly found in manga, visual novels, or other Japanese media.
- All processing is performed locally within the browser, ensuring privacy and offline functionality.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/leapward-koex/namida-ocr

Awesome Lists containing this project

README