Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Farsi in URL wrongly encoded to "Double UTF-8" #490

Open
manu-cyber opened this issue Nov 1, 2024 · 0 comments
Open

Farsi in URL wrongly encoded to "Double UTF-8" #490

manu-cyber opened this issue Nov 1, 2024 · 0 comments

Comments

@manu-cyber
Copy link

I encountered an issue where Farsi characters are encoded incorrectly.

The quote below contains pretty much every information I know about the topic, but I’m happy to answer additional questions if I can.

17:13:18 <JAA> c3manu: Re 8c1e0e2phxz76zhg3famykftb, not entirely sure what's going on there. https://paydarymelli.ir/fa/tag/1/%D8%AF%D8%A7%D8%B9%D8%B4%DB%8C redirects to 'Location: /fa/tags/1114/1/داعشی', which wpull then encodes as https://paydarymelli.ir/fa/tags/1114/1/%C3%98%C2%AF%C3%98%C2%A7%C3%98%C2%B9%C3%98%C2%B4%C3%9B%C2%8C rather than https://paydarymelli.ir/fa/tags/1114/1/%D8%AF%D8%A7%D8%B9%D8%B4%DB%8C
17:13:24 <JAA> for some reason, i.e. double UTF-8.
17:13:43 <JAA> I guess it's the same class of issue we've seen before regarding unencoded Location headers.

cc @JustAnotherArchivist

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant