Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PyPDF2 can't read French Accented Characters (œ) #532

Closed
hamzaAmier opened this issue Jan 3, 2020 · 2 comments
Closed

PyPDF2 can't read French Accented Characters (œ) #532

hamzaAmier opened this issue Jan 3, 2020 · 2 comments
Labels
is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF workflow-text-extraction From a users perspective, text extraction is the affected feature/workflow

Comments

@hamzaAmier
Copy link

hello,
I'm working for text mining project using PyPDF2 and i'm facing problem to extract this character "œ" (it's french character) . the method "extract Text ()" of a page object can't detect it at all.

Thank you for your help.

@MartinThoma MartinThoma added the is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF label Apr 8, 2022
@MartinThoma
Copy link
Member

I think #464 might solve this

@MartinThoma MartinThoma added the workflow-text-extraction From a users perspective, text extraction is the affected feature/workflow label Apr 16, 2022
@MartinThoma
Copy link
Member

Duplicates #524

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF workflow-text-extraction From a users perspective, text extraction is the affected feature/workflow
Projects
None yet
Development

No branches or pull requests

2 participants