Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Locality sensitive hashing annotation #69

Merged
merged 5 commits into from
Dec 13, 2022
Merged

Locality sensitive hashing annotation #69

merged 5 commits into from
Dec 13, 2022

Conversation

Uinelj
Copy link
Member

@Uinelj Uinelj commented Sep 26, 2022

Adds a TLSH hash value as annotation (when possible).

This aims to ease the computation of OSCAR diffs (each document has a URL+TLSH, which could help build a "similarity score" that would help identify similar pages throughout differrent OSCAR releases.

@Uinelj Uinelj added this to the v1.3.0 milestone Sep 26, 2022
@Uinelj
Copy link
Member Author

Uinelj commented Sep 26, 2022

TODO:

  • Push changes on tlsh-rs

@Uinelj Uinelj marked this pull request as ready for review December 13, 2022 11:49
@Uinelj Uinelj merged commit 1ba6d5b into dev Dec 13, 2022
@Uinelj Uinelj deleted the dev-diffing branch December 13, 2022 13:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant