-
Notifications
You must be signed in to change notification settings - Fork 184
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Idea : merge digital extraction result and OCR result. #24
Comments
pdf不就是直接就能解析出文字么,提高准确性体现在哪? |
如果pdf含有公式,直接解析会保留文字,但是会破坏公式。 |
所以我理解可以这么做,转成图片后用P2T的MFD检测出数学公式所在位置,然后在原始PDF里把这些位置的文字替换为识别出的Latex表示即可。 |
没错,比如说: |
是的,对于数字原生pdf,版面恢复可以结合Layout-Parser/layout-parser: A Unified Toolkit for Deep Learning Based Document Image Analysis: https://github.com/Layout-Parser/layout-parser 二者可以融合而一下 |
Hi, I'm currently developing a pdf parser specialised for math pdf. The non-OCR solutions offer great accuracy for text because they are simply extracted, not detected optically. So, is it possible to merge the non-OCR results and Pix2Text results to improve the accuracy?
嗨,我目前正在开发一个专门用于数学 PDF 的解析器。非 OCR 解决方案对于文本具有很高的准确性,因为它们是直接提取的,而不是通过光学检测。那么,是否可以将非 OCR 结果和 Pix2Text 结果合并以提高准确性呢?
The text was updated successfully, but these errors were encountered: