Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vsymbol/CUTIE

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)
https://github.com/vsymbol/CUTIE

computer-vision deep-learning text-extraction

Last synced: 10 days ago
JSON representation

CUTIE (TensorFlow implementation of Convolutional Universal Text Information Extractor)

Host: GitHub
URL: https://github.com/vsymbol/CUTIE
Owner: vsymbol
Created: 2019-01-15T06:18:27.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2022-12-08T05:25:04.000Z (almost 2 years ago)
Last Synced: 2024-08-02T11:14:52.251Z (3 months ago)
Topics: computer-vision, deep-learning, text-extraction
Language: Python
Homepage:
Size: 2.87 MB
Stars: 156
Watchers: 16
Forks: 78
Open Issues: 18
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# CUTIE
TensorFlow implementation of the paper "CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor."
Xiaohui Zhao [Paper Link](https://arxiv.org/abs/1903.12363v4)

----
CUTIE 是用于“票据文档” 2D 关键信息提取/命名实体识别/槽位填充算法。
使用CUTIE前，需先使用OCR算法对“票据文档” 中的文字执行检测和识别，而后将格式化的文本输入入CUTIE网络，具体流程可参照论文。

CUTIE can be considered as one type of 2-Dimensional Key Information Extraction, 2-D NER (Named Entity Recognition) or a 2-Dimensional 2D Slot Filling algorithm.
Before training / inference with CUTIE, prepare your structured texts in your scanned document images with any type of OCR algorithm. Refer to the CUTIE paper for details about the procedure.

### Results

Result evaluated on 4,484 receipt documents, including taxi receipts, meals entertainment receipts, and hotel receipts, with 9 different key information classes. (AP / softAP)
|Method | #Params | Taxi | Hotel |
| ----------|:---------:| :-----: | :-----: |
| CloudScan | - | 82.0 / - | 60.0 / - |
| BERT | 110M | 88.1 / - | 71.7 / - |
| CUTIE |**14M** |**94.0 / 97.3**|**74.6 / 87.0**|

![Taxi](https://github.com/vsymbol/CUTIE/raw/master/others/example_1.jpg)

![Hotel](https://github.com/vsymbol/CUTIE/raw/master/others/example_2.jpg)

### Installation & Usage

```
pip install -r requirements.txt
```

1. Generate your own dictionary with main_build_dict.py / main_data_tokenizer.py
2. Train your model with main_train_json.py

CUTIE achieves best performance with rows/cols well configured. For more insights, refer to statistics in the file (others/TrainingStatistic.xlsx).

![Chart](https://github.com/vsymbol/CUTIE/raw/master/others/chart.jpg)

### Others

For information about the input example, refer to [issue discussion](https://github.com/vsymbol/CUTIE/issues/7).
- Apply any OCR tool that help you detecting and recognizing words in the scanned document image.
- Label image OCR results with key information class as the .json file in the invoice_data folder. (thanks to @4kssoft)