https://github.com/ayumiymk/aster.pytorch

ASTER in Pytorch
https://github.com/ayumiymk/aster.pytorch

aster computer-vision ocr pytorch scene-text text-recognition text-rectification

Last synced: 8 months ago
JSON representation

ASTER in Pytorch

Host: GitHub
URL: https://github.com/ayumiymk/aster.pytorch
Owner: ayumiymk
License: mit
Created: 2019-07-13T06:56:34.000Z (over 6 years ago)
Default Branch: master
Last Pushed: 2021-12-09T11:26:06.000Z (almost 4 years ago)
Last Synced: 2025-03-28T19:10:03.929Z (8 months ago)
Topics: aster, computer-vision, ocr, pytorch, scene-text, text-recognition, text-rectification
Language: Python
Homepage:
Size: 117 KB
Stars: 676
Watchers: 11
Forks: 168
Open Issues: 23
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # ASTER: Attentional Scene Text Recognizer with Flexible Rectification

This repository implements the ASTER in pytorch. Origin software could be found in [here](https://github.com/bgshih/aster).

ASTER is an accurate scene text recognizer with flexible rectification mechanism. The research paper can be found [here](https://ieeexplore.ieee.org/abstract/document/8395027/).

![ASTER Overview](overview.png)

## Installation

```

conda env create -f environment.yml

```

## Train

[**NOTE**] Some users say that they can't reproduce the reported performance with minor modification, like [1](https://github.com/ayumiymk/aster.pytorch/issues/17#issuecomment-527380815) and [2](https://github.com/ayumiymk/aster.pytorch/issues/17#issuecomment-528718596). I haven't try other settings, so I can't guarantee the same performance with different settings. The users should just run the following script without any modification to reproduce the results.

```

bash scripts/stn_att_rec.sh

```

## Test

You can test with .lmdb files by

```

bash scripts/main_test_all.sh

```

Or test with single image by

```

bash scripts/main_test_image.sh

```

## Pretrained model

The pretrained model is available on our [release page](https://github.com/ayumiymk/aster.pytorch/releases/download/v1.0/demo.pth.tar). Download `demo.pth.tar` and put it to somewhere. Before running, modify the `--resume` to the location of this file.

## Reproduced results

|               | IIIT5k |  SVT |  IC03 |  IC13 |  IC15 | SVTP  |  CUTE |

|:-------------:|:------:|:----:|:-----:|:-----:|:-----:|:-----:|:-----:|

|  ASTER (L2R)  |  92.67 |   -  | 93.72 | 90.74 |    -  | 78.76 | 76.39 |

| ASTER.Pytorch |  93.2  | 89.2 | 92.2  |   91  |  78.0 |  81.2 |  81.9 |

At present, the bidirectional attention decoder proposed in ASTER is not included in my implementation. 

You can use the codes to bootstrap for your next text recognition research project.

## Data preparation

We give an example to construct your own datasets. Details please refer to `tools/create_svtp_lmdb.py`.

We also provide datasets for [training](https://pan.baidu.com/s/1BMYb93u4gW_3GJdjBWSCSw&shfl=sharepset) (password: wi05) and [testing](https://drive.google.com/open?id=1U4mGLlsm9Ade1-gQOyd6He5R0yiaafYJ).

## Citation

If you find this project helpful for your research, please cite the following papers:

```

@article{bshi2018aster,

  author    = {Baoguang Shi and

               Mingkun Yang and

               Xinggang Wang and

               Pengyuan Lyu and

               Cong Yao and

               Xiang Bai},

  title     = {ASTER: An Attentional Scene Text Recognizer with Flexible Rectification},

  journal   = {IEEE Transactions on Pattern Analysis and Machine Intelligence}, 

  volume    = {41},

  number    = {9},

  pages     = {2035--2048},

  year      = {2019},

}

@inproceedings{ShiWLYB16,

  author    = {Baoguang Shi and

               Xinggang Wang and

               Pengyuan Lyu and

               Cong Yao and

               Xiang Bai},

  title     = {Robust Scene Text Recognition with Automatic Rectification},

  booktitle = {2016 {IEEE} Conference on Computer Vision and Pattern Recognition,

               {CVPR} 2016, Las Vegas, NV, USA, June 27-30, 2016},

  pages     = {4168--4176},

  year      = {2016}

}

```

IMPORTANT NOTICE: Although this software is licensed under MIT, our intention is to make it free for academic research purposes. If you are going to use it in a product, we suggest you [contact us](xbai@hust.edu.cn) regarding possible patent issues.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ayumiymk/aster.pytorch

Awesome Lists containing this project

README