Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/j03-dev/tarzan

Python package for optical character recognition, CNN
https://github.com/j03-dev/tarzan

cnn keras ocr opencv tensorflow

Last synced: 16 days ago
JSON representation

Python package for optical character recognition, CNN

Awesome Lists containing this project

README

        

# Tarzan
Tarzan no me is a simple package to do optical character recognition with python.
This package is made with python with [tensorflow](https://www.tensorflow.org) develop by google ,
and [opencv-python](https://docs.opencv.org).
this package uses CNN (convolution neuron network) to do image recognition

## Build this package

```bash
python -m pip install --upgrade build
python -m build
```

## Install package

```bash
python -m pip install dist/tarzan-0.0.1-py3-none-any.whl
```

## Train your own Ocr model with dataset
* #### Example
```python
from tarzan import OcrModel

ocr_model = OcrModel(
'dataset-a-z/data/training_data',
'dataset-a-z/data/testing_data'
)

ocr_model.train_and_save(path="model_ocr_v1.model")
ocr_model.save_classes(path="classes")
```
### this is the dataset [ocr_dataset](https://www.kaggle.com/datasets/preatcher/standard-ocr-dataset)