Skip to content

RapidAI/RapidOCR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

1,481 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Shows an illustrated sun in light mode and a moon with stars in dark mode.
ย 
Open source OCR for the security of the digital world
ย 

Open in Colab PyPI SemVer2.0

Join our Discord

็ฎ€ไฝ“ไธญๆ–‡ | English

๐Ÿ“ Introduction

RapidOCR is a completely open-source, free OCR tool that supports multi-platform, multi-language operation and rapid offline deployment. Its core advantages lie in extreme speed and extensive compatibility.

Supported Languages: Default support for Chinese and English recognition. For other supported languages, please refer to the documentation: Model List.

Project Origin: Considering that PaddleOCR still has room for optimization in engineering aspects, we innovatively converted the models in PaddleOCR into the highly compatible ONNX format to simplify and accelerate the inference deployment of OCR models on various terminal devices. Furthermore, we achieved seamless cross-platform porting based on multiple programming languages such as Python, C++, Java, and C#, enabling developers to get started easily and integrate efficiently.

Name Implication: The name "RapidOCR" embodies our core expectations for the product: Rapid (simple operation, fast response), Good & Economical (low resource consumption, high cost-effectiveness), and Intelligent (achieving accurate and efficient recognition relying on deep learning technology). We focus on leveraging the advantages of artificial intelligence to create compact yet powerful models, relentlessly pursuing speed while ensuring excellent recognition results.

User Guide:

  • Direct Deployment: If the models provided in this repository meet your needs, simply refer to the Official Documentation to quickly complete the deployment and usage of RapidOCR.
  • Custom Fine-tuning: If the existing models cannot meet specific scenario requirements, you can fine-tune them using your own data based on PaddleOCR, and then apply the optimized models to the RapidOCR deployment process to achieve personalized customization.

If you find this project helpful for your work or study, we kindly ask you to give us a โญ Star to provide valuable support and encouragement!

๐ŸŽฅ Visualization

Demo

๐Ÿ› ๏ธ Installation

pip install rapidocr onnxruntime

๐Ÿ“‹ Usage

from rapidocr import RapidOCR

engine = RapidOCR()

img_url = "https://www.modelscope.cn/models/RapidAI/RapidOCR/resolve/master/resources/test_files/ch_en_num.jpg"
result = engine(img_url)
print(result)

result.vis("vis_result.jpg")

๐Ÿ“š Documentation

Full documentation can be found on docs, in Chinese.

๐Ÿ‘ฅ Who use? (more)

For more projects that use RapidOCR, you are welcome to register at the registration address. Registration is solely for product promotion.

๐Ÿ™ Acknowledgements

  • Many thanks to PaddleOCR for everything.
  • Many thanks to PaddleOCR2Pytorch for providing the converted PyTorch format models.
  • Many thanks to PaddleX for providing the document models.
  • Many thanks to DeliciaLaniD for fixing the misplaced start position of scan animation in ocrweb.
  • Many thanks to zhsunlight for the suggestion about parameterized call GPU reasoning and the careful and thoughtful testing.
  • Many thanks to lzh111222334 for fixing some bugs of rec preprocessing under python version.
  • Many thanks to AutumnSun1996 for the suggestion in the #42.
  • Many thanks to DeadWood8 for providing the document which packages rapidocr_web to exe by Nuitka.
  • Many thanks to Loovelj for fixing the bug of sorting the text boxes. For details see issue 75.

๐Ÿค Contribution Guide

This repository contains the Python component of RapidOCR. Components for other languages have been migrated to separate repositories.

For the complete workflow on contributing to Python development, please refer to: Python CONTRIBUTING.

๐ŸŽ– Code Contributors

๐ŸŒŸ Sponsors & Backers

RapidOCR is an Apache2.0-licensed open source project with its ongoing development made possible entirely by the support of these awesome backers. If you'd like to join them, please consider sponsoring RapidOCR's development.

Sponsors

Sponsors Application Introduction
Quicker Your fingertip toolbox

Backers


@zg-young

@Eunsolfs

@youzzhang

๐Ÿ“œ Citation

If you find this project useful in your research, please consider cite:

@misc{RapidOCR 2021,
    title={{Rapid OCR}: OCR Toolbox},
    author={RapidAI Team},
    howpublished = {\url{https://github.com/RapidAI/RapidOCR}},
    year={2021}
}

โญ๏ธ Stargazers over time

Stargazers over time

โš–๏ธ License

The copyright of the OCR model is held by Baidu, while the copyrights of all other engineering scripts are retained by the repository's owner.

This project is released under the Apache 2.0 license.

About

๐Ÿ“„ Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, MNN, PaddlePaddle and PyTorch.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Sponsor this project

Contributors