Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Narek/storage upgrade #262

Merged
merged 31 commits into from
Feb 7, 2024
Merged

Narek/storage upgrade #262

merged 31 commits into from
Feb 7, 2024

Conversation

Ngalstyan4
Copy link
Contributor

No description provided.

Copy link

codecov bot commented Jan 22, 2024

Codecov Report

Attention: 5 lines in your changes are missing coverage. Please review.

Comparison is base (2d6cbcf) 77.26% compared to head (dc216a9) 75.68%.
Report is 3 commits behind head on main.

Additional details and impacted files
@@            Coverage Diff             @@
##             main     #262      +/-   ##
==========================================
- Coverage   77.26%   75.68%   -1.58%     
==========================================
  Files          23       23              
  Lines        1869     1929      +60     
  Branches      480      505      +25     
==========================================
+ Hits         1444     1460      +16     
- Misses        220      247      +27     
- Partials      205      222      +17     
Files Coverage Δ
src/hnsw/build.c 80.15% <100.00%> (ø)
src/hnsw/external_index.c 85.78% <100.00%> (-4.64%) ⬇️
src/hnsw/scan.c 81.13% <100.00%> (-0.43%) ⬇️
src/hnsw/utils.c 62.12% <100.00%> (+0.58%) ⬆️
src/hnsw/validate_index.c 46.81% <100.00%> (+0.18%) ⬆️
src/hnsw/insert.c 79.16% <87.50%> (-0.25%) ⬇️
src/hnsw.c 79.55% <75.00%> (-0.92%) ⬇️
src/hnsw/usearch_storage.cpp 90.47% <90.47%> (ø)

... and 1 file with indirect coverage changes

@Ngalstyan4 Ngalstyan4 force-pushed the narek/storage-upgrade branch 2 times, most recently from fd4316a to dcce645 Compare February 4, 2024 06:48
uint32 neighbors_nr;
uint32 neighbors_max;
uint32 *neighbors;
uint32 unused;

LDB_VI_READ_NODE_CHUNK(vi_node, vi_node->vn_label, node_tape, &tape_pos, node_tape_size);
LDB_VI_READ_NODE_CHUNK(vi_node, vi_node->vn_dim, node_tape, &tape_pos, node_tape_size);
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

deleted because new binary index does not contain this information, so there is no room for inconsistency

When I ran the old command on a machine that had
a really old lantern (v0.0.3), I noticed that
it was never updated to the latest version
and the old version ran instead.
This makes sure that does not happen with our default docker command
Update usearch
Remove hnswlib dependency
Get closer to upstream usearch API
Switch to using usearch_search_ef - new upstream compatible API in C api of usearch
Update stroage level for external index generation
…ndomness

Newer storage made the index slightly smaller which
put old m=4 sift 1k index be near the border of a filled
index page. As a result, depending on level distribution
in a run, the regression test would have a differently sized
index and result in a failure.

Here we increase m slighly, to make it very unlikely a test will
fail as a result of level randomness
@Ngalstyan4 Ngalstyan4 force-pushed the narek/storage-upgrade branch 2 times, most recently from 96be1b7 to 3cc22f4 Compare February 7, 2024 11:15
@Ngalstyan4 Ngalstyan4 merged commit 1caa6bf into main Feb 7, 2024
39 of 45 checks passed
@Ngalstyan4 Ngalstyan4 deleted the narek/storage-upgrade branch February 7, 2024 20:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant