Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

text extraction while font Encoding is a PDFStream object #279

Closed
cuteufo opened this issue Jul 26, 2019 · 3 comments
Closed

text extraction while font Encoding is a PDFStream object #279

cuteufo opened this issue Jul 26, 2019 · 3 comments

Comments

@cuteufo
Copy link

cuteufo commented Jul 26, 2019

I am encountering issues while extracting text from a page with Encoding points a stream object with the CMap data embedded.

@pietermarsman
Copy link
Member

Hi @cuteufo, you mention that you have a stream object with embedded CMap data. Recently, @fakabbir did a lot of work on that in PR #264. It is now ready for merging, but not yet merged and released. Perhaps you could checkout his code locally and see if that solves your problem.

@fakabbir
Copy link
Contributor

Hi @cuteufo . Any updates on the issue ? You can pull the latest changes of #283 for testing out your case.

@jstockwin
Copy link
Member

I am closing this due to lack of activity. Both the referenced PRs have been merged. If this is still an issue, please either comment, re-open, or file a new issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants