Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Sanster/text_renderer
Generate text images for training deep learning ocr model
https://github.com/Sanster/text_renderer
crnn ocr synthtext
Last synced: 4 months ago
JSON representation
Generate text images for training deep learning ocr model
- Host: GitHub
- URL: https://github.com/Sanster/text_renderer
- Owner: Sanster
- License: mit
- Created: 2018-05-28T06:33:14.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2022-01-17T07:58:32.000Z (about 3 years ago)
- Last Synced: 2024-10-29T19:59:10.255Z (4 months ago)
- Topics: crnn, ocr, synthtext
- Language: Python
- Size: 12.6 MB
- Stars: 1,390
- Watchers: 43
- Forks: 384
- Open Issues: 60
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# New version release:https://github.com/oh-my-ocr/text_renderer
# Text Renderer
Generate text images for training deep learning OCR model (e.g. [CRNN](https://github.com/bgshih/crnn)).
Support both latin and non-latin text.# Setup
- Ubuntu 16.04
- python 3.5+Install dependencies:
```
pip3 install -r requirements.txt
```# Demo
By default, simply run `python3 main.py` will generate 20 text images
and a labels.txt file in `output/default/`.data:image/s3,"s3://crabby-images/984f6/984f6856f7d29862434eb25e2424db48df0e97ae" alt="example1.jpg"
data:image/s3,"s3://crabby-images/59c2c/59c2c6f03b24fbfc62e0cde0e34ccd664eeb3a12" alt="example2.jpg"data:image/s3,"s3://crabby-images/0e035/0e0353f5cf430b42f9114312a4136bb840e8c272" alt="example3.jpg"
data:image/s3,"s3://crabby-images/d2a49/d2a49dbc1a3f5387d98136ba4172e5168289f17c" alt="example4.jpg"# Use your own data to generate image
1. Please run `python3 main.py --help` to see all optional arguments and their meanings.
And put your own data in corresponding folder.2. Config text effects and fraction in `configs/default.yaml` file(or create a
new config file and use it by `--config_file` option), here are some examples:|Effect name|Image|
|------------|----|
|Origin(Font size 25)|data:image/s3,"s3://crabby-images/d4d82/d4d820450c091a28aa981195d911e37f206fa6fe" alt="origin"|
|Perspective Transform|data:image/s3,"s3://crabby-images/ca48d/ca48db7a40d81f4d8d174672fbe43b78c5ff6357" alt="perspective"|
|Random Crop|data:image/s3,"s3://crabby-images/89ed7/89ed78e92e1171d89d01c4fc327c1e9b3e17bc8e" alt="rand_crop"|
|Curve|data:image/s3,"s3://crabby-images/928cd/928cd54e949d891f36d28cf2d0b73c7b0bc5d9c9" alt="curve"|
|Light border|data:image/s3,"s3://crabby-images/f6df3/f6df3557435f28f97d0c06927a87b389d9b33616" alt="light border"|
|Dark border|data:image/s3,"s3://crabby-images/7b627/7b627ec52c16cc7db5ea9fe4f6204cc3ae2eeca6" alt="dark border"|
|Random char space big|data:image/s3,"s3://crabby-images/db171/db171c406a2b8709a5c3ab31e968ecb87b034541" alt="random char space big"|
|Random char space small|data:image/s3,"s3://crabby-images/03f44/03f4417cf3ced53642abc78e2872cb3d3865d80d" alt="random char space small"|
|Middle line|data:image/s3,"s3://crabby-images/3379f/3379fe5a60be373c906a75e2407f98e7bc84a0cc" alt="middle line"|
|Table line|data:image/s3,"s3://crabby-images/c321d/c321da30ae0ba609cef2059ee0e0e6438b9e92fe" alt="table line"|
|Under line|data:image/s3,"s3://crabby-images/12ee2/12ee2706fc9bcd53f6e21c374939388e93005932" alt="under line"|
|Emboss|data:image/s3,"s3://crabby-images/62001/620012051bc4f821035045af853f5bf38b8b0e15" alt="emboss"|
|Reverse color|data:image/s3,"s3://crabby-images/5c812/5c812a0bdf32d8cac5952395be8635e742eff162" alt="reverse color"|
|Blur|data:image/s3,"s3://crabby-images/4af4d/4af4d96b0fa97021068f66be4217e76dee90bc76" alt="blur"|
|Text color|data:image/s3,"s3://crabby-images/9860e/9860ec777971fa503edb13e2c15e71cd134e2f95" alt="font_color"|
|Line color|data:image/s3,"s3://crabby-images/b2f84/b2f849f2f74d8b7b176683950e5f7eb2c7364289" alt="line_color"|3. Run `main.py` file.
# Strict mode
For no-latin language(e.g Chinese), it's very common that some fonts only support
limited chars. In this case, you will get bad results like these:data:image/s3,"s3://crabby-images/76ef7/76ef7cec2125eb3dc8363becb2b6adac5291a484" alt="bad_example1"
data:image/s3,"s3://crabby-images/4e4d5/4e4d5e4eec92847a1b9c17628b21f0fb4ce12391" alt="bad_example2"
data:image/s3,"s3://crabby-images/670b9/670b93e24f14b60194c60edd3d381d2e1947b99a" alt="bad_example3"
Select fonts that support all chars in `--chars_file` is annoying.
Run `main.py` with `--strict` option, renderer will retry get text from
corpus during generate processing until all chars are supported by a font.# Tools
You can use `check_font.py` script to check how many chars your font not support in `--chars_file`:
```bash
python3 tools/check_font.pychecking font ./data/fonts/eng/Hack-Regular.ttf
chars not supported(4971):
['第', '朱', '广', '沪', '联', '自', '治', '县', '驼', '身', '进', '行', '纳', '税', '防', '火', '墙', '掏', '心', '内', '容', '万', '警','钟', '上', '了', '解'...]
0 fonts support all chars(5071) in ./data/chars/chn.txt:
[]
```# Generate image using GPU
If you want to use GPU to make generate image faster, first compile opencv with CUDA.
[Compiling OpenCV with CUDA support](https://www.pyimagesearch.com/2016/07/11/compiling-opencv-with-cuda-support/)Then build Cython part, and add `--gpu` option when run `main.py`
```
cd libs/gpu
python3 setup.py build_ext --inplace
```# Debug mode
Run `python3 main.py --debug` will save images with extract information.
You can see how perspectiveTransform works and all bounding/rotated boxes.data:image/s3,"s3://crabby-images/2a61a/2a61a736b0b107839972f705862695b72c9db1d6" alt="debug_demo"
# Todo
See https://github.com/Sanster/text_renderer/projects/1## Citing text_renderer
If you use text_renderer in your research, please consider use the following BibTeX entry.
```BibTeX
@misc{text_renderer,
author = {weiqing.chu},
title = {text_renderer},
howpublished = {\url{https://github.com/Sanster/text_renderer}},
year = {2021}
}
```