Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/opconty/keras_std
This repository is the official implementation of Semantic-based Scene Text Detector(STD for short), a new scene text detection method, inspired by the idea of CSP, which is used on Pedestrian Detection.
https://github.com/opconty/keras_std
keras ocr scene-text-detectors text-detection
Last synced: 7 days ago
JSON representation
This repository is the official implementation of Semantic-based Scene Text Detector(STD for short), a new scene text detection method, inspired by the idea of CSP, which is used on Pedestrian Detection.
- Host: GitHub
- URL: https://github.com/opconty/keras_std
- Owner: opconty
- Created: 2019-07-16T08:44:55.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2019-07-16T08:47:25.000Z (almost 5 years ago)
- Last Synced: 2024-03-15T00:38:18.426Z (4 months ago)
- Topics: keras, ocr, scene-text-detectors, text-detection
- Language: Python
- Homepage: https://mp.weixin.qq.com/s/F5twPy_6fjpwQuCRxMPGIA
- Size: 3.33 MB
- Stars: 2
- Watchers: 3
- Forks: 1
- Open Issues: 2
-
Metadata Files:
- Readme: readme.md
Lists
README
# STD: A Semantic-based Scene Text Detector
This repository is the official implementation of Semantic-based Scene Text Detector(STD for short), a new scene text detection method, inspired by the idea of [CSP](https://arxiv.org/abs/1904.02948), which is used on Pedestrian Detection. we evaluated STD on Icdar2013 dataset with a limited computing resources(only one GeForce GTX 1080), and achived 0.81 in F-measure with a single image scale.
![demo](./examples/paper_samples.png)
## Introduction
STD is an efficient and fast method for scene text detection:
- no any further post-processings, like NMS.
- anchor-free.
- easy to generate training labels.
- only one step process to get final bounding boxes.## Training
Download (gived below) SynthTextDetectionEnglish and Icdar13 dataset, training on SynthText about 5 epochs(up to you if you have more power and time), and finetune on icdar13 dataset.
Training image resize to 384x384 on SynthText, and finetune on Icdar13 with image size 384x384 for the first stage, then resize to larger size(e.g. 640) for the second stage, and decrease learning rate(e.g. 10x) synchronously until converges.
## Inference
Refer to [predict.py](./predict.py)
## Examples
![icdar13](./examples/what_a_mess.png)
## Dataset
SynthTextDetectionEnglish contains more than 300K images, download training image and annotations from [baiduyun, code:eq92](https://pan.baidu.com/s/1DT-vRyXtRXArc7Xj5XrYJA).
Icdar2013 dataset: training and testing images from [official site](https://rrc.cvc.uab.es/?ch=2&com=downloads) for Task 2.1 Text Localization. Download reconstructed annotations from [baiduyun, code:d9gb](https://pan.baidu.com/s/1deB5AoE3cqXoupr1FE51Ng).
## Limitations
STD only for vertical or horizontal text detection. and we are working on arbitrary direction text detection now, wouldn't be too long.
Any questions or suggestions,please drop a comment or contact me,email: [email protected].
## License
This project is released under the [Apache 2.0 license](https://www.apache.org/licenses/LICENSE-2.0).
## Citation
If you use our codebase in your work, please cite this project.
a paper or technical report will be released soon.And besides, you are welcomed to join us to maintain this project.
```
@misc{std2019,
author = {Gao Lijun},
title = {STD: A Semantic-based Scene Text Detector},
howpublished = {\url{https://github.com/opconty/keras_std}},
year = {2019}
}
```