paddlepaddle/paddleocr
activeTurn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Python
View on GitHub
Stars
83,304
Forks
10,844
Open issues
154
24h
+76
+0.1%
7d
+520
+0.7%
Refresh
30m
Star history (7 days)
Last checked
28m ago
Last pushed
16h ago
Next check
just now