5 million parameters level the scale of billion-level large models: Baidu PaddleOCR surpasses Tesseract to top the GitHub OCR charts

BlockBeatNews

According to monitoring by 1M AI News, Baidu’s open-source OCR toolkit PaddleOCR has surpassed Google’s long-established OCR engine Tesseract (73,200 stars) with 73,300 stars on GitHub, making it the highest-rated OCR project on GitHub. The third-ranked MinerU has 57,500 stars. PaddleOCR was open-sourced in 2020, supporting over 100 languages and covering more than 160 countries and regions.

PaddleOCR has recently undergone intensive updates, with the release of PP-OCRv5 last week featuring only 5 million parameters, achieving accuracy comparable to that of billion-parameter visual language models on standard OCR benchmarks; PaddleOCR-VL-1.5 set a new record with an accuracy of 94.5% on the document parsing benchmark OmniDocBench v1.5.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.
Comment
0/400
No comments