Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/watersink/ocrsegment

a deep learning model for page layout analysis / segmentation.
https://github.com/watersink/ocrsegment

Last synced: 10 days ago
JSON representation

a deep learning model for page layout analysis / segmentation.

Awesome Lists containing this project

README

        

# OCR Segmentation
a deep learning model for page layout analysis / segmentation.

## dependencies
tensorflow1.8
>
python3

## dataset:
[uw3-framed-lines-degraded-000](https://storage.googleapis.com/tmb-ocr/uw3-framed-lines-degraded-000.tgz)

## make training labels
python3 data_pre_process.py

## train
python3 train_test.py
## test
python3 segmentation.py
![image]( https://github.com/watersink/ocrsegment/blob/master/make_training_labels/W001.png)
![image]( https://github.com/watersink/ocrsegment/blob/master/make_training_labels/out.png)
![image]( https://github.com/watersink/ocrsegment/blob/master/lines/0.png)

## references
[Multi-Dimensional Recurrent Neural Networks](https://arxiv.org/abs/0705.2011)

[Robust_ Simple Page Segmentation Using Hybrid Convolutional MDLSTM Networks](https://github.com/wanghaisheng/awesome-ocr/files/2042377/Robust_.Simple.Page.Segmentation.Using.Hybrid.Convolutional.MDLSTM.Networks.pdf)

[https://github.com/NVlabs/ocroseg](https://github.com/NVlabs/ocroseg)

[https://github.com/philipperemy/tensorflow-multi-dimensional-lstm](https://github.com/philipperemy/tensorflow-multi-dimensional-lstm)