-
Notifications
You must be signed in to change notification settings - Fork 604
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(mssql): add hashbytes and test for binary output hash fns #8107
Conversation
86e29d2
to
4efcdbf
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Overall LGTM. Just a few docstring requests.
And I guess now we can commence debate over the method name? 🐎
I like the name hexdigest
for the new operation, seems clear and matches the equivalent python stdlib name.
Should we discuss deprecating whichever other method overlaps with the new |
feat(mssql): add hexdigest Apply suggestions from code review Co-authored-by: Jim Crist-Harif <[email protected]>
So it would be hashbytes, although currently there is a difference in that hashbytes returns the binary hash and there are at least a few backends that support both of those options. |
…project#8107) This adds support for `ops.HashBytes` to `mssql` and also adds a test for that functionality so it's easier to port when we merge in the epic split branch. I've also added a new op, `HashHexDigest` which returns the hexdigest of various cryptographic hashing functions since I imagine this is what many users are _probably_ after. This newer op (and corresponding `hexdigest` method) can also support many more backends, as most of them default to returning the string hex digest and not the raw binary output. I tried to be very accurate in the `notimpl` and `notyet` portions of both tests and I think I've done that. For now, only exposing DuckDB, Pyspark, and MSSQL so we don't add a huge extra burden for the epic split but also address the user request in And I guess now we can commence debate over the method name? 🐎 Resolves ibis-project#8082 --------- Co-authored-by: Jim Crist-Harif <[email protected]>
Description of changes
This adds support for
ops.HashBytes
tomssql
and also adds a test for that functionality so it's easier to port when we merge in the epic split branch.I've also added a new op,
HashHexDigest
which returns the hexdigest of various cryptographic hashing functions since I imagine this is what many users are probably after. This newer op (and correspondinghexdigest
method) can also support many more backends, as most of them default to returning the string hex digest and not the raw binary output.I tried to be very accurate in the
notimpl
andnotyet
portions of both tests and I think I've done that.For now, only exposing DuckDB, Pyspark, and MSSQL so we don't add a huge extra burden for the epic split but also address the user request in #8082
And I guess now we can commence debate over the method name? 🐎
Issues closed
Resolves #8082