Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/stefanhuber/txgen
Generate training data for scene text recognition
https://github.com/stefanhuber/txgen
Last synced: about 1 month ago
JSON representation
Generate training data for scene text recognition
- Host: GitHub
- URL: https://github.com/stefanhuber/txgen
- Owner: stefanhuber
- Created: 2019-11-11T08:41:38.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2023-01-05T00:45:25.000Z (almost 2 years ago)
- Last Synced: 2024-09-14T01:54:42.039Z (2 months ago)
- Language: JavaScript
- Size: 987 KB
- Stars: 0
- Watchers: 2
- Forks: 1
- Open Issues: 13
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# txgen
> Generate training data for scene text recognition
## Installation
```
npm i txgen (-g for global install)
```## CLI example 1
500 character images based on regex pattern `[0-9A-NP-Z#]` with size `64x64` pixels and a padding of `4` pixels. Backgrounds come from directory `./backgrounds`, fonts come from directory `./fonts`. The output should be placed in directory `./chars-train` and with the `-d` option files are placed in a directory with the name of the character and incremental file name (e.g., A/1.jpg, A/2.jpg, ...).
```bash
txgen -b ./backgrounds -f ./fonts -o ./chars-train -p "[0-9A-NP-Z#]" --number 500 -w 64 -h 64 -a 4 -d
```## CLI example 2
Similar to the above example, but with a different regex and no specified width (width is dynamically calculated based on length of generated text). The files are placed inside the output folder `./serial-numbers`.
```bash
txgen -b ./backgrounds -f ./fonts -o ./serial-numbers -p "[A-NP-Z0-9#][A-NP-Z0-9\-]{2,10}[A-NP-Z0-9]?" --number 500 -h 64 -a 10 -d
```## CLI Options
| Option | Description | Required |
| --- | --- | --- |
| `-b` | Path to directory with background image files | ✓ |
| `-f` | path to directory with .ttf font files (e.g., downloaded from [dafont](https://www.dafont.com)) | ✓ |
| `-o` | Output directory of generated files | |
| `-t` | A text string which should be generated | |
| `-a` | Padding in pixels between image border and generated text | |
| `-p` | A regex pattern which defines strings to be generated (use either `-t` or `-p`) | |
| `-n` | The total number of output images which should be generated | |
| `-s` | The font size which should be used in pixels | |
| `-w` | The width of the output image in pixels | |
| `-h` | The height of the output image in pixels | |
| `-d` | The generated output should be placed in directories with text/pattern name and incremental filenames | |# Examples
![example](examples/1.jpg "example")
![example](examples/2.jpg "example")
![example](examples/3.jpg "example")
![example](examples/4.jpg "example")
![example](examples/5.jpg "example")
![example](examples/6.jpg "example")
![example](examples/7.jpg "example")
![example](examples/8.jpg "example")
![example](examples/9.jpg "example")
![example](examples/10.jpg "example")
![example](examples/11.jpg "example")
![example](examples/12.jpg "example")
![example](examples/13.jpg "example")
![example](examples/14.jpg "example")
![example](examples/15.jpg "example")
![example](examples/16.jpg "example")