Add Querying Functionality to OSB #409

jmazanec15 · 2022-05-23T04:09:13Z

Description

Adds ability to run query workload from a data set with OpenSearch Benchmark tool for k-NN workloads. Refactors some of the code to better share components across extensions.

In addition, added unit tests for testing custom param sources.

For recall metrics, tracking issue here: opensearch-project/opensearch-benchmark#199. This will not be covered in this PR.

Issues Resolved

#373

Check List

New functionality includes testing.
- All tests pass
New functionality has been documented.
- New functionality has javadoc added
Commits are signed as per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Adds random query workloads to both the train and no-train test procedures. Adds custom parameter source to produce the queries. Add usage of the parameter source to both json files. Updated documentation. Signed-off-by: John Mazanec <[email protected]>

Adds custom param source that will allow users to pull queries from a data set as opposed to using random queries. Along with this, refactored parameter sources to share common functionality. Updated README Signed-off-by: John Mazanec <[email protected]>

Reads query vecs from data set in batches to avoid making too many disk reads. Batch size is hardcoded to 100. Signed-off-by: John Mazanec <[email protected]>

Add custom query recall runner so that we can eventually compute the recall of queries. Currently, recall value is hard coded but this will be implemented in the future. Signed-off-by: John Mazanec <[email protected]>

Add ability to compute recall score for the customer query runner. Currently, to compute recall, it checks how many of the top k returned results appear in the ground truth set. Signed-off-by: John Mazanec <[email protected]>

Cleans up documentation and tracks with addition of query and compute recall functionality. Signed-off-by: John Mazanec <[email protected]>

codecov-commenter · 2022-05-23T04:24:58Z

Codecov Report

Merging #409 (52563b1) into main (a5dd71c) will not change coverage.
The diff coverage is n/a.

@@            Coverage Diff            @@
##               main     #409   +/-   ##
=========================================
  Coverage     84.01%   84.01%           
  Complexity      911      911           
=========================================
  Files           130      130           
  Lines          3879     3879           
  Branches        359      359           
=========================================
  Hits           3259     3259           
  Misses          458      458           
  Partials        162      162

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a5dd71c...52563b1. Read the comment docs.

travisbenedict · 2022-05-24T14:49:11Z

I haven't used this functionality myself but this seems like it should work.

Is any of the data defined here showing up in the results? Is it just recall that's missing?

If you haven't already you could try configuring OpenSearch Benchmark to write to an OpenSearch cluster. That would give you access to the full set of raw metrics.

jmazanec15 · 2022-05-24T16:18:12Z

@travisbenedict Ive tried a few variations of it, but no, only query latency, thoughput, service time, and error rate get output. In Rally docs, it said that custom metrics would be added in meta data about the operation, but I am not sure how to find those or generate those if it is not connected to an OpenSearch cluster. Also, ideally, I would like to get the results in the summary.

Here is a current sample of the results: https://gist.github.com/jmazanec15/82b91eaad4af8acd773fbc97ba25b638.

Removes recall calculation from benchmarking logic as this is delayed until opensearch-project/opensearch-benchmark#199 can be implemented. Signed-off-by: John Mazanec <[email protected]>

Removes random query. Random query may be misleading if the distribution of the index data is significantly different than that of the randomness. Signed-off-by: John Mazanec <[email protected]>

Signed-off-by: John Mazanec <[email protected]>

Adds unit tests for param sources for benchmarking. In addition, adds a test utility to create data sets dynamically. Signed-off-by: John Mazanec <[email protected]>

Signed-off-by: John Mazanec <[email protected]>

martin-gaievski

Few general questions:

how we gonna handle OSB version updates, I think we use officially supported extension points, but just want to re-confirm that we minimize changes of breaking things on our end with upgrade to a new OSB version
do you want to use multiple clients for queries in our benchmarks (say for k-NN release)? We probably need to come up with some formula to estimate number of clients based on cluster configuration.

martin-gaievski · 2022-06-21T16:25:25Z

benchmarks/osb/extensions/param_sources.py

+        Returns:
+            The parameter source for this particular partion
+        """
+        if self.num_vectors % total_partitions != 0:


is this mean that the data set size must be divisible by the number of parallel clients?
If so I think in next revision we need to relax this requirement and divide evenly except for last client that will have the remainder

Yes thats a good point. I can update this in a future PR. Will create an issue when PR is merged.

jmazanec15 · 2022-06-21T18:09:28Z

how we gonna handle OSB version updates, I think we use officially supported extension points, but just want to re-confirm that we minimize changes of breaking things on our end with upgrade to a new OSB version

@martin-gaievski Good question. I think it will most likely be addressed at a later date. Right now, we don't release the benchmarks as artifacts and we hard code dependency to OSB in requirements.txt. I think eventually we will want to transfer things to https://github.com/opensearch-project/opensearch-benchmark-workloads/ and when we do that we can ensure version compatibility.

jmazanec15 · 2022-06-21T18:13:06Z

do you want to use multiple clients for queries in our benchmarks (say for k-NN release)? We probably need to come up with some formula to estimate number of clients based on cluster configuration.

Yes, I think this PR will focus more on providing functionality of benchmarking tool. In a future PR, we will make decisions on configuration. We need some kind of standardization of performance testing for releases.

jmazanec15 added 6 commits May 21, 2022 17:33

Read query vecs from data set in batches

61601a4

Reads query vecs from data set in batches to avoid making too many disk reads. Batch size is hardcoded to 100. Signed-off-by: John Mazanec <[email protected]>

Add custom query runner for recall

bc96d92

Add custom query recall runner so that we can eventually compute the recall of queries. Currently, recall value is hard coded but this will be implemented in the future. Signed-off-by: John Mazanec <[email protected]>

Add ability to compute recall score

6a1b27b

Add ability to compute recall score for the customer query runner. Currently, to compute recall, it checks how many of the top k returned results appear in the ground truth set. Signed-off-by: John Mazanec <[email protected]>

Clean up docs and procedures

52563b1

Cleans up documentation and tracks with addition of query and compute recall functionality. Signed-off-by: John Mazanec <[email protected]>

jmazanec15 added the Infrastructure Changes to infrastructure, testing, CI/CD, pipelines, etc. label May 23, 2022

jmazanec15 requested a review from a team May 23, 2022 04:09

jmazanec15 marked this pull request as draft May 23, 2022 04:09

jmazanec15 mentioned this pull request May 23, 2022

Support Benchmarking K-NN Plugin and Vectorsearch Workload opensearch-project/opensearch-benchmark#103

Closed

jmazanec15 mentioned this pull request May 31, 2022

Make output metrics extendable opensearch-project/opensearch-benchmark#199

Open

jmazanec15 added 4 commits June 6, 2022 20:41

Remove recall calculation from benchmark

14f31fa

Removes recall calculation from benchmarking logic as this is delayed until opensearch-project/opensearch-benchmark#199 can be implemented. Signed-off-by: John Mazanec <[email protected]>

Remove random query

16b8de5

Removes random query. Random query may be misleading if the distribution of the index data is significantly different than that of the randomness. Signed-off-by: John Mazanec <[email protected]>

Minor bug fixes in data set

17ea4a8

Signed-off-by: John Mazanec <[email protected]>

Add unit tests for param sources for benchmarks

2a32401

Adds unit tests for param sources for benchmarking. In addition, adds a test utility to create data sets dynamically. Signed-off-by: John Mazanec <[email protected]>

jmazanec15 force-pushed the issue-373 branch from 2cfb80e to 6a8d331 Compare June 20, 2022 21:28

jmazanec15 added 2 commits June 20, 2022 15:10

Style fixes

f1f3c58

Signed-off-by: John Mazanec <[email protected]>

Update README

294d1c6

Signed-off-by: John Mazanec <[email protected]>

jmazanec15 force-pushed the issue-373 branch from 6a8d331 to 294d1c6 Compare June 20, 2022 22:10

Fix broken test

95ee6f7

Signed-off-by: John Mazanec <[email protected]>

jmazanec15 marked this pull request as ready for review June 20, 2022 22:15

martin-gaievski reviewed Jun 21, 2022

View reviewed changes

jmazanec15 requested a review from martin-gaievski June 21, 2022 18:25

martin-gaievski approved these changes Jun 21, 2022

View reviewed changes

VijayanB approved these changes Jun 21, 2022

View reviewed changes

jmazanec15 merged commit b59dcff into opensearch-project:main Jun 21, 2022

jmazanec15 mentioned this pull request Jun 21, 2022

[Benchmarking] Allow number of vectors to not be divisible by number of clients #426

Closed

jmazanec15 added the v2.1.0 label Jun 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Querying Functionality to OSB #409

Add Querying Functionality to OSB #409

jmazanec15 commented May 23, 2022 •

edited

Loading

codecov-commenter commented May 23, 2022 •

edited

Loading

travisbenedict commented May 24, 2022

jmazanec15 commented May 24, 2022

martin-gaievski left a comment

martin-gaievski Jun 21, 2022

jmazanec15 Jun 21, 2022

jmazanec15 commented Jun 21, 2022

jmazanec15 commented Jun 21, 2022

Add Querying Functionality to OSB #409

Add Querying Functionality to OSB #409

Conversation

jmazanec15 commented May 23, 2022 • edited Loading

Description

Issues Resolved

Check List

codecov-commenter commented May 23, 2022 • edited Loading

Codecov Report

travisbenedict commented May 24, 2022

jmazanec15 commented May 24, 2022

martin-gaievski left a comment

Choose a reason for hiding this comment

martin-gaievski Jun 21, 2022

Choose a reason for hiding this comment

jmazanec15 Jun 21, 2022

Choose a reason for hiding this comment

jmazanec15 commented Jun 21, 2022

jmazanec15 commented Jun 21, 2022

jmazanec15 commented May 23, 2022 •

edited

Loading

codecov-commenter commented May 23, 2022 •

edited

Loading