You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
Currently, unstructured and unstructured-inference use different default layout models, so there are many differences in the elements extracted by the two libraries.
This PR was initially created to close GitHub Issue #1604 (Synchronizing the default
layout model), but since it was already resolved in PR
[#1607](#1607), this
PR now only adds the visualization script used to investigate the issue.
### Summary
- add python script to annotate elements
PDF:
[references.pdf](https://github.com/Unstructured-IO/unstructured/files/12778270/references.pdf)
### Evaluation
```
PYTHONPATH=. python examples/layout-analysis/visualization.py references.pdf hi_res
```
Describe the bug
Currently,
unstructured
andunstructured-inference
use different default layout models, so there are many differences in the elements extracted by the two libraries.unstructured-inference
PR #201)To Reproduce
PDF: references.pdf
unstructured
unstructured-inference
Screenshots
Elements extracted with the
unstructured
LibraryElements extracted with the
unstructured-inference
LibraryExpected behavior
The elements extracted by the two libraries shouldn't be too different.
Environment Info
Additional context
This issue is related to issue #1602.
The text was updated successfully, but these errors were encountered: