Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/MetythornPenn/ocr_toolkits

Collection of functions to work with ocr and synthetic data generator
https://github.com/MetythornPenn/ocr_toolkits

Last synced: 3 months ago
JSON representation

Collection of functions to work with ocr and synthetic data generator

Host: GitHub
URL: https://github.com/MetythornPenn/ocr_toolkits
Owner: MetythornPenn
License: mit
Created: 2024-03-07T13:29:35.000Z (11 months ago)
Default Branch: main
Last Pushed: 2024-06-14T03:48:15.000Z (8 months ago)
Last Synced: 2024-10-14T22:02:39.173Z (4 months ago)
Language: Jupyter Notebook
Homepage:
Size: 3.13 MB
Stars: 2
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
- Support: docs/support.md

Awesome Lists containing this project

awesome-khmer-language - khmerocr_tools

README

        # OCR toolkits

## Introduction

Collection of functions to work with ocr and synthetic data generater

## Features

- Generate synthetic images containing Khmer text

- Customize text content from a file

- Choose from multiple font styles

- Option to apply random blur effect to images

- Generate corresponding labels for each image

## Installation

You can install the Khmer Text Image Generator using pip:

```bash

pip install ocr_toolkits

```

## Usage

### move files from a folder to another folder filter by extension

```python

from ocr_toolkits import move_files_ext

# Move all files from 'src' directory to 'dst' directory

move_files_ext(

    src_dir = 'src', 

    dst_dir = 'dst1',

)

# Move only .jpg files from 'src' directory to 'dst' directory

move_files_ext(

    src_dir = 'src', 

    dst_dir = 'dst1',

    ext = '.jpg'

)

```

### change files extension from a folder filter by extension

```python

from ocr_toolkits import change_files_ext

# Example usage

change_files_ext(

  src_dir ='src', 

  dst_dir = 'dst', 

  ext = '.png'

)

```

### delete files from a folder to another folder filter by extension

```python

from ocr_toolkits import delete_files_ext

delete_files_ext(

    dir = 'dst',

    ext = '.jpg',

)

```

### autocorrect gender

```python

from ocr_toolkits.postprocess import autocorrect_gender

corrected_gender_eng = autocorrect_gender("ប្រុ", return_eng=False)

print(corrected_gender_eng)  # Output: Male

# Example usage with return_eng=False (Cambodian output)

corrected_gender_kh = autocorrect_gender("ស្រ", return_eng=False)

print(corrected_gender_kh)  # Output: ស្រី

```

### resize image 

```python

from ocr_toolkits import resize_image

resized_image = resize_image(

    image_path='./images/img.jpg', 

    width=555,

    height=555,

    save=True,

    save_path='save.jpg'

)

```

### resize image 

```python

from ocr_toolkits import resize_h_128

resized_image = resize_h_128(

    image_path='./images/img.jpg', 

    save=True,

    save_path='save.jpg'

)

```

### resize image 

```python

from ocr_toolkits import to_grayscale

resized_image = to_grayscale(

    image_path='./images/img.jpg', 

    save=True,

    save_path='save.jpg'

)

```

### synthetic data

- create text file to words list eg. dict.txt and put all khmer words you want to gnerate or download [sample data here](https://github.com/MetythornPenn/khmerocr_tools/blob/main/dict.txt)

- create a folder call font and download all font from this link : [font](https://github.com/MetythornPenn/khmerocr_tools/tree/main/font)

- create python script to generate data eg. test.py

```python

from khmerocr_tools import synthetic_data

# Set parameters

image_height = 128

output_folder = 'output'

output_labels_file = 'output/labels.txt'

text_file_path = "dict.txt"

font_option = [1, 2]  

# Generate images and labels

synthetic_data(

    text_file_path, 

    image_height, 

    output_folder, 

    output_labels_file, 

    font_option=font_option, 

    random_blur=True

)

```

## Parameters

- `image_height`: Height of the generated images in pixels.

- `output_folder`: Path to the folder where generated images will be saved.

- `output_labels_file`: Path to the file where labels will be saved.

- `text_file_path`: Path to the text file containing Khmer text for generation.

- `font_option`: List of integers representing font options. 

  - 1 for AKbalthom KhmerLer Regular.

  - 2 for Khmer MEF1 Regular.

  - 3 for Khmer OS Battambang Regular.

  - 4 for Khmer OS Muol Light Regular.

  - 5 for Khmer OS Siemreap Regular.

  - Use an empty list [] to select all available fonts.

- `random_blur`: Boolean flag indicating whether to apply random blur effect to images.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.