Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

docling identified my entire page as a picture #357

Open
aodingpeng opened this issue Nov 18, 2024 · 4 comments
Open

docling identified my entire page as a picture #357

aodingpeng opened this issue Nov 18, 2024 · 4 comments
Assignees
Labels
bug Something isn't working

Comments

@aodingpeng
Copy link

aodingpeng commented Nov 18, 2024

Bug

I need to identify this page, but it seems that Docling has recognized my page as an image

image

file:
ISO IEC 23090-5DUP.pdf

Is there any way to solve this problem?

@aodingpeng aodingpeng added the bug Something isn't working label Nov 18, 2024
@mllife
Copy link

mllife commented Nov 18, 2024

This is a scanned document. You should use OCR argument to parse it.

@aodingpeng
Copy link
Author

This is a scanned document. You should use OCR argument to parse it.

I added OCR to my final command, but the layout analysis still referred to the image

@aodingpeng
Copy link
Author

这是扫描的文档。您应该使用 OCR 参数来解析它。
我是新手,请问如何才能启动源码?

@cau-git
Copy link
Contributor

cau-git commented Nov 19, 2024

@aodingpeng I will investigate this issue. My suspicion is that the layout of this page is wrongly detected as a full page picture, hence all content in the detected picture is lost (so far Docling ignores in-picture text). OCR won't solve this alone.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants