Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/abdur75648/urdu-text-detection
Text line detection for Urdu OCR (UTRNet)
https://github.com/abdur75648/urdu-text-detection
contournet document-processing ocr text-detection urdu-ocr urdu-text-detection utrnet
Last synced: 21 days ago
JSON representation
Text line detection for Urdu OCR (UTRNet)
- Host: GitHub
- URL: https://github.com/abdur75648/urdu-text-detection
- Owner: abdur75648
- Created: 2023-01-15T06:43:27.000Z (almost 2 years ago)
- Default Branch: main
- Last Pushed: 2024-01-31T10:35:54.000Z (11 months ago)
- Last Synced: 2024-04-23T00:11:52.742Z (8 months ago)
- Topics: contournet, document-processing, ocr, text-detection, urdu-ocr, urdu-text-detection, utrnet
- Language: Python
- Homepage: https://abdur75648.github.io/UTRNet/
- Size: 48.5 MB
- Stars: 3
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Urdu OCR - Text Line Detection Inference
**Released as a supplement of [UTRNet: High-Resolution Urdu Text Recognition](https://github.com/abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition)**
[![UTRNet](https://img.shields.io/badge/UTRNet:%20High--Resolution%20Urdu%20Text%20Recognition-blueviolet?logo=github&style=flat-square)](https://github.com/abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition)
[![Website](https://img.shields.io/badge/Website-Visit%20Here-darkgreen?style=flat-square)](https://abdur75648.github.io/UTRNet/)
[![arXiv](https://img.shields.io/badge/arXiv-2306.15782-darkred.svg)](https://arxiv.org/abs/2306.15782)
[![SpringerLink](https://img.shields.io/badge/Springer-Page-darkblue.svg)](https://link.springer.com/chapter/10.1007/978-3-031-41734-4_19)
[![SpringerLink](https://img.shields.io/badge/Springer-PDF-blue.svg)](https://rdcu.be/dkbIF)
[![Demo](https://img.shields.io/badge/Demo-Online-brightgreen.svg)](https://abdur75648-urduocr-utrnet.hf.space)## Steps to run the code
***Note** - Due to several dependency issues in [maskrcnn-benchmark](https://github.com/facebookresearch/maskrcnn-benchmark), YoloV8 is used for text line detection. The model is finetuned on the [UrduDoc](https://paperswithcode.com/dataset/urdudoc) dataset.* Clone the repository
* Install the dependencies
```bash
pip install torch==2.0.1 ultralytics==8.1.8
```
* Download [yolov8m_UrduDoc.pt](https://github.com/abdur75648/urdu-text-detection/releases/download/v1.0.0/yolov8m_UrduDoc.pt)
* Put `test.jpg`
* Run `python3 detect.py`
* The output will be saved as `output.jpg`## Note
The code & trained model is for research purposes only and must not be used for any other purpose without the author's explicit permission.## Citation
If you use the code/model/dataset, please cite the following paper:```BibTeX
@article{rahman2023utrnet,
title={UTRNet: High-Resolution Urdu Text Recognition In Printed Documents},
author={Abdur Rahman and Arjun Ghosh and Chetan Arora},
journal={arXiv preprint arXiv:2306.15782},
year={2023},
eprint={2306.15782},
archivePrefix={arXiv},
primaryClass={cs.CV},
doi = {https://doi.org/10.48550/arXiv.2306.15782},
url = {https://arxiv.org/abs/2306.15782}
}
```### License
[![Creative Commons License](https://i.creativecommons.org/l/by-nc-sa/4.0/88x31.png)](http://creativecommons.org/licenses/by-nc-sa/4.0/). This work is licensed under a [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-nc-sa/4.0/) for Noncommercial (academic & research) purposes only and must not be used for any other purpose without the author's explicit permission.