-
Notifications
You must be signed in to change notification settings - Fork 532
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Leverage word bbox from pdf-parser-v2 in the layout- and table-model #285
Comments
Working on implementation, it will require some refactoring of page_preprocessing_model as well as table_structure_model. |
thank you very much |
@maxmnemonic you can leverage this new feature in docling-parse (DS4SD/docling-parse#57) |
@PeterStaar-IBM , any update to https://github.com/DS4SD/docling-ibm-models for table-parsing to utilise this? |
It is coming ;) |
@PeterStaar-IBM , I am using your table model and integrated to my code based on the tests which uses iocr.parse format https://github.com/DS4SD/docling-ibm-models/blob/main/tests/test_data/samples/ADS.2007.page_123.png_iocr.parse_format.json ; what are changes required for me? It already had word based bboxes. |
Requested feature
We have much finer grained bbox information using the docling-parse-v2, which could be easily leveraged by layout and table model for improved accuracy.
The text was updated successfully, but these errors were encountered: