Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x97 in position 8811: invalid start byte #181

Open
sentry-io bot opened this issue Feb 27, 2024 · 0 comments

Comments

@sentry-io
Copy link

sentry-io bot commented Feb 27, 2024

Sentry Issue: DOCTOR-E

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x97 in position 8811: invalid start byte
  File "doctor/views.py", line 101, in extract_doc_content
    content, err, returncode = extract_from_html(fp)
  File "doctor/tasks.py", line 339, in extract_from_html
    content = f.read()

This is linked to the courtlistener Sentry issue https://freelawproject.sentry.io/issues/5017932231/?project=5257254, the events were registered at almost the same time.

Also, is one of the causes of this issue freelawproject/courtlistener#3811

Filed by @grossir

grossir added a commit to grossir/doctor that referenced this issue Feb 27, 2024
UnicodeDecodeError was bubbling because except clause was not catching it

Solves freelawproject#181
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: No status
Development

No branches or pull requests

0 participants