https://github.com/rapidai/rapidocr
đ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
https://github.com/rapidai/rapidocr
chineseocr crnn dbnet easyocr ocr onnxocr onnxruntime openvino paddleocr rapidocr rapidocronnxruntime
Last synced: 10 months ago
JSON representation
đ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.
- Host: GitHub
- URL: https://github.com/rapidai/rapidocr
- Owner: RapidAI
- License: apache-2.0
- Created: 2021-01-04T08:02:09.000Z (about 5 years ago)
- Default Branch: main
- Last Pushed: 2025-04-26T08:55:07.000Z (11 months ago)
- Last Synced: 2025-05-07T19:47:42.748Z (10 months ago)
- Topics: chineseocr, crnn, dbnet, easyocr, ocr, onnxocr, onnxruntime, openvino, paddleocr, rapidocr, rapidocronnxruntime
- Language: Python
- Homepage: https://rapidai.github.io/RapidOCRDocs
- Size: 19.6 MB
- Stars: 4,028
- Watchers: 51
- Forks: 433
- Open Issues: 27
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
Â
Open source OCR for the security of the digital world
Â
[įŽäŊ䏿](./docs/README_zh.md) | English
### Introduction
đ Introducing the foremost multi-platform, multi-lingual OCR tool that boasts unparalleled speed, expansive support, and complete openness. This exceptional software is entirely free and renowned for facilitating swift offline deployments. Core to its efficiency is the ONNXRuntime inference engine, offering 4 to 5 times the speed of PaddlePaddle's engine while ensuring no memory leaks.
đĻ **Supported Languages**: It inherently supports Chinese and English, with self-service conversion required for additional languages. Please refer [here](https://rapidai.github.io/RapidOCRDocs/blog/2022/09/28/%E6%94%AF%E6%8C%81%E8%AF%86%E5%88%AB%E8%AF%AD%E8%A8%80/) for specific language support details.
đ **Rationale**: Acknowledging the limitations in [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)'s architecture, we embarked on a mission to simplify OCR inference across diverse platforms. This endeavor culminated in converting PaddleOCR's model to the versatile ONNX format and seamlessly integrating it into Python, C++, Java, and C# environments.
đ **Etymology**: Derived from its essence, RapidOCR embodies lightness, velocity, affordability, and intelligence. Rooted in deep learning, this OCR technology underscores AI's prowess and emphasizes compact models, prioritizing swiftness without compromising efficacy.
đ **Usage Scenarios**:
- **Instant Deployment**: If the pre-existing models within our repository suffice, simply leverage RapidOCR for swift deployment.
- **Customization**: In case of specific requirements, refine PaddleOCR with your data and proceed with RapidOCR deployment, ensuring tailored results.
If our repository proves beneficial to your endeavors, kindly consider leaving a star â on GitHub to show your appreciation. It means the world to us!
### Visualization ([more](https://rapidai.github.io/RapidOCRDocs/visualization/))
### Installation
```bash
pip install onnxruntime
pip install rapidocr
```
### Usage
```python
from rapidocr import RapidOCR
engine = RapidOCR()
img_url = "https://github.com/RapidAI/RapidOCR/blob/main/python/tests/test_files/ch_en_num.jpg?raw=true"
result = engine(img_url)
print(result)
result.vis("vis_result.jpg")
```
### Documentation
Full documentation can be found on [docs](https://rapidai.github.io/RapidOCRDocs/), in Chinese.
### Who use?
Used by [link](https://github.com/RapidAI/RapidOCR/discussions/286)
### Acknowledgements
- Many thanks to [DeliciaLaniD](https://github.com/DeliciaLaniD) for fixing the misplaced start position of scan animation in ocrweb.
- Many thanks to [zhsunlight](https://github.com/zhsunlight) for the suggestion about parameterized call GPU reasoning and the careful and thoughtful testing.
- Many thanks to [lzh111222334](https://github.com/lzh111222334) for fixing some bugs of rec preprocessing under python version.
- Many thanks to [AutumnSun1996](https://github.com/AutumnSun1996) for the suggestion in the [#42](https://github.com/RapidAI/RapidOCR/issues/42).
- Many thanks to [DeadWood8](https://github.com/DeadWood8) for providing the [document](https://rapidai.github.io/RapidOCRDocs/install_usage/rapidocr_web/nuitka_package) which packages rapidocr_web to exe by Nuitka.
- Many thanks to [Loovelj](https://github.com/Loovelj) for fixing the bug of sorting the text boxes. For details see [issue 75](https://github.com/RapidAI/RapidOCR/issues/75).
### đ Code Contributors
### [Sponsor](https://rapidai.github.io/RapidOCRDocs/sponsor/)
> [!IMPORTANT]
>
> If you want to sponsor the project, you can directly click the **Buy me a coffee** image, please write a note (e.g. your github account name) to facilitate adding to the sponsorship list below.
>
>
| Sponsor | Applied Products |
| :-------: | :----------: |
|
|
|
|
| - |
### Citation
If you find this project useful in your research, please consider cite:
```bibtex
@misc{RapidOCR 2021,
title={{Rapid OCR}: OCR Toolbox},
author={RapidAI Team},
howpublished = {\url{https://github.com/RapidAI/RapidOCR}},
year={2021}
}
```
### âī¸ Stargazers over time
[](https://starchart.cc/RapidAI/RapidOCR)
### License
The copyright of the OCR model is held by Baidu, while the copyrights of all other engineering scripts are retained by the repository's owner.
This project is released under the [Apache 2.0 license](./LICENSE).
