Replies: 1 comment 2 replies
-
Share an example please that is handled correctly by pytesseract but not by the Tesseract embedded in PyMuPDF. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm using 1.24.13 at the moment. From discussion #3254 PyMuPDF should be able to handle the orientation issues for text capture, but OCR was not mentioned. Is this auto rotation handled in the OCR process as well?
Additionally I have some scanned PDF's with mixed rotations (user error) and the OCR process is not producing usable text except from the cover page (normally would not be present). We're able to deal with it using PyTesseract in testing, but would prefer to stick with PyMuPDF if possible.
Beta Was this translation helpful? Give feedback.
All reactions