Handle non-ASCII chars correctly #22

Boudewijn26 · 2016-01-18T12:27:04Z

According to #21 tokens aren't correctly calculated when they contain non-ASCII chars. I found and fixed the bug, as well as adding a couple extra tests.

As per pndurette#21 non ASCII chars were not correctly being calculated. This due to an error in the calculate_token function. The diff should be self explanatory when comparing it with the token-script.js. I added a couple of tests to make sure all unicode chars are correctly being calculated.

Boudewijn26 · 2016-01-18T12:41:42Z

Any suggestions as to properly handle both Python 2.7 and 3.x unicode string handling without too much ado? (Besides dropping support for Python 3.2?)

As it turns out a lot of the token-script was just doing the utf-8 encoding of a piece of text. Python can also do that, so now it's way simpler.

desbma · 2016-01-18T20:22:02Z

b"h\xc3\xa9".decode("utf-8") would return text hé for Python 2.7, 3.2 and 3.4.

Otherwise there is six.text_type, but that would introduce a new dependency.

pndurette · 2016-01-18T23:10:42Z

From what I read, dropping Python 3.2 is the thing to do and what most projects are doing. So I guess we can agree on this. Thanks a lot for this (fast) fix @Boudewijn26! Will release this shortly.

Handle non-ASCII chars correctly

pndurette and others added 10 commits January 13, 2016 00:52

Version bump 1.1.0

3f8d0ff

Quotes around password in .travis.yml

a66dcc6

Added pip upgrade to .travis.yml 'script'

a04e559

Revert .travis.yml changes, fix bad deploy username

4761f4c

Added matrix restriction (python 3.4) for deploy in .travis.yml

7db3915

Made setup.py use README.md, TravisCI might have a symlink issue

15952e9

Add MANIFEST.in, with README.md and CHANGES.txt

b6c674f

Version bump 1.1.2

b652774

Make tests compatible with Python 2.7 string handling.

b30a0ba

Replace handwritten encoding to python call

98ff275

As it turns out a lot of the token-script was just doing the utf-8 encoding of a piece of text. Python can also do that, so now it's way simpler.

pndurette added a commit that referenced this pull request Jan 24, 2016

Merge pull request #22 from Boudewijn26/master

81e60ca

Handle non-ASCII chars correctly

pndurette merged commit 81e60ca into pndurette:develop Jan 24, 2016

pndurette mentioned this pull request Jan 25, 2016

Wrong token with accentuated chars #21

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle non-ASCII chars correctly #22

Handle non-ASCII chars correctly #22

Boudewijn26 commented Jan 18, 2016

Boudewijn26 commented Jan 18, 2016

desbma commented Jan 18, 2016

pndurette commented Jan 18, 2016

Handle non-ASCII chars correctly #22

Handle non-ASCII chars correctly #22

Conversation

Boudewijn26 commented Jan 18, 2016

Boudewijn26 commented Jan 18, 2016

desbma commented Jan 18, 2016

pndurette commented Jan 18, 2016