Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Mixed Document Types #734

Open
aakankshaduggal opened this issue Jan 13, 2025 · 0 comments
Open

Support for Mixed Document Types #734

aakankshaduggal opened this issue Jan 13, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@aakankshaduggal
Copy link

Requested feature

Docling currently encounters issues when processing markdown documents that contain HTML elements, such as tables. This limitation affects the robustness of our data generation processes, particularly when dealing with documents that mix markdown syntax with HTML tags. Enhancing Docling's parsing capabilities to support mixed document types will enable seamless processing of documents that combine markdown with HTML tags, thereby supporting more complex data scenarios and contributing to the accuracy and utility of generated datasets.

Alternatives

...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant