Bigint encoding takes quadratic O(n^2) time #29

Sekenre · 2018-11-06T13:00:57Z

The code seems to be a long integer time complexity O (n ^ 2)?
Let's say I have a 10,000 bit integer

Originally posted by @fsssosei in #28 (comment)

The repeated divisions and inserting at the beginning of the list get noticeably slow once you get to about 100000 digits.

>>> timeit(lambda: cbor2.dumps(2**200), number=50000)
1.3028403969080813
>>> timeit(lambda: cbor2.dumps(2**2000), number=50000)
12.468400787521446
>>> timeit(lambda: cbor2.dumps(2**20000), number=50000)
710.3362544665841
>>>

Sekenre · 2018-11-06T13:08:51Z

I have a fix for this in my repo and will merge it in once I've done some more testing.

The algorithm used for bigint encoding comes from Stackoverflow. and is perfectly reasonable for arbitrary bases in C but carries a lot of overhead in Python.

The most efficient method I've found so far converts the float into it's hexadecimal representation and then unhexlify's the string.

Sekenre · 2018-11-06T18:52:55Z

Performance improvement on python 3.6

>>> timeit(lambda: cbor2.dumps(2**20000), number=50000)
3.508080048715488

Solution: Encode to base16 string and then decode to bytes. * Most performant big integer encodings for 2.7 and >=3.2

Sekenre · 2018-11-12T10:27:37Z

Fix has been merged, will be in next release

…gronholm#31) Solution: Encode to base16 string and then decode to bytes. * Most performant big integer encodings for 2.7 and >=3.2 Problem: Bug agronholm#15, CBOR maps can contain unhashable types as keys Solution: Use a context manager for decoding keys and set items which makes sure to use a hashable type. Also indefinite length bytestring decoding returns bytes() instead of bytearray(). Fixes for flake8 tests Problem: HashableMap is unclear and bogus Solution: Make it clear we are creating an immutable type and call it FrozenDict to be like the builtin frozenset() Remove premature optimization Add a test for nested immutable maps Test should verify a value that is not nested in a key uses mutable type Whitespace fixes Add tests for unused magic methods on FrozenDict An extra space is a terrible thing Fixing merge conflict: Switched from absolute to relative imports This was requested by Mercurial developers to facilitate vendoring. See agronholm#20 for the discussion. Make the immutable flag a read-only property and document it's use Added FrozenDict to encoder for roundtrip decode->encode Minor whitespace fix

…gronholm#31) Solution: Encode to base16 string and then decode to bytes. * Most performant big integer encodings for 2.7 and >=3.2

Sekenre self-assigned this Nov 6, 2018

Sekenre added a commit that referenced this issue Nov 12, 2018

Fix for #29 encoding integers > 2**64 takes quadratic time (#31)

549d8d9

Solution: Encode to base16 string and then decode to bytes. * Most performant big integer encodings for 2.7 and >=3.2

Sekenre closed this as completed Nov 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bigint encoding takes quadratic O(n^2) time #29

Bigint encoding takes quadratic O(n^2) time #29

Sekenre commented Nov 6, 2018 •

edited

Loading

Sekenre commented Nov 6, 2018

Sekenre commented Nov 6, 2018

Sekenre commented Nov 12, 2018

Bigint encoding takes quadratic O(n^2) time #29

Bigint encoding takes quadratic O(n^2) time #29

Comments

Sekenre commented Nov 6, 2018 • edited Loading

Sekenre commented Nov 6, 2018

Sekenre commented Nov 6, 2018

Sekenre commented Nov 12, 2018

Sekenre commented Nov 6, 2018 •

edited

Loading