https://github.com/abdur75648/urdu-text-detection
Text line detection for Urdu OCR (UTRNet)
https://github.com/abdur75648/urdu-text-detection
contournet document-processing ocr text-detection urdu-ocr urdu-text-detection utrnet
Last synced: 2 months ago
JSON representation
Text line detection for Urdu OCR (UTRNet)
- Host: GitHub
- URL: https://github.com/abdur75648/urdu-text-detection
- Owner: abdur75648
- Created: 2023-01-15T06:43:27.000Z (over 2 years ago)
- Default Branch: main
- Last Pushed: 2024-10-08T05:21:03.000Z (9 months ago)
- Last Synced: 2025-03-30T14:41:55.334Z (3 months ago)
- Topics: contournet, document-processing, ocr, text-detection, urdu-ocr, urdu-text-detection, utrnet
- Language: Python
- Homepage: https://abdur75648.github.io/UTRNet/
- Size: 48.5 MB
- Stars: 6
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Urdu OCR - Text Line Detection Inference
**Released as a supplement of [UTRNet: High-Resolution Urdu Text Recognition](https://github.com/abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition)**
[](https://github.com/abdur75648/UTRNet-High-Resolution-Urdu-Text-Recognition)
[](https://abdur75648.github.io/UTRNet/)
[](https://arxiv.org/abs/2306.15782)
[](https://link.springer.com/chapter/10.1007/978-3-031-41734-4_19)
[](https://rdcu.be/dkbIF)
[](https://abdur75648-urduocr-utrnet.hf.space)## Steps to run the code
***Note** - Due to several dependency issues in [maskrcnn-benchmark](https://github.com/facebookresearch/maskrcnn-benchmark), YoloV8 is used for text line detection. The model is finetuned on the [UrduDoc](https://paperswithcode.com/dataset/urdudoc) dataset.* Clone the repository
* Install the dependencies
```bash
pip install torch==2.0.1 ultralytics==8.1.8
```
* Download [yolov8m_UrduDoc.pt](https://github.com/abdur75648/urdu-text-detection/releases/download/v1.0.0/yolov8m_UrduDoc.pt)
* Put `test.jpg`
* Run `python3 detect.py`
* The output will be saved as `output.jpg`## Note
The code & trained model is for research purposes only and must not be used for any other purpose without the author's explicit permission.## Citation
If you use the code/model/dataset, please cite the following paper:```BibTeX
@article{rahman2023utrnet,
title={UTRNet: High-Resolution Urdu Text Recognition In Printed Documents},
author={Abdur Rahman and Arjun Ghosh and Chetan Arora},
journal={arXiv preprint arXiv:2306.15782},
year={2023},
eprint={2306.15782},
archivePrefix={arXiv},
primaryClass={cs.CV},
doi = {https://doi.org/10.48550/arXiv.2306.15782},
url = {https://arxiv.org/abs/2306.15782}
}
```### License
[](http://creativecommons.org/licenses/by-nc-sa/4.0/). This work is licensed under a [Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License](http://creativecommons.org/licenses/by-nc-sa/4.0/) for Noncommercial (academic & research) purposes only and must not be used for any other purpose without the author's explicit permission.