Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HyperLogLog structure way above average in size #189

Closed
seiflotfy opened this issue Jul 3, 2017 · 2 comments
Closed

HyperLogLog structure way above average in size #189

seiflotfy opened this issue Jul 3, 2017 · 2 comments

Comments

@seiflotfy
Copy link
Contributor

seiflotfy commented Jul 3, 2017

Redis and Elasticsearch use a precision 14 by default for HyperLogLog which 16KB as opposed to veneur which uses 262KB. Any reason behind that? Also in the [paper by Google]

Here you seem to use precision 18!

(https://stefanheule.com/papers/edbt13-hyperloglog.pdf) most simulations were done with 2^14 registers. Any reason why not use precision 14?

@cory-stripe
Copy link
Contributor

Good question! I don't recall why 18 was chosen. 14 certainly seems to be the consensus from the above paper. I will discuss with the team and see if have any good reasons.

@cory-stripe
Copy link
Contributor

Hey @seiflotfy! In #190 we merged a new HLL implementation that defaults to 14 registers, so in the upcoming 1.7 release, we'll be good to go! Closing this as a result.

Thanks for the suggestion and sorry for the wait!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants