[Introduce Aggregate Subcommand] Enhance opensearch-benchmark `compare` command #630

OVI3D0 · 2024-08-28T16:53:28Z

This is an issue based off one of the proposed priorities in this RFC: #627

Background

The existing compare subcommand in OpenSearch Benchmark (OSB) allows users to compare the results of two benchmark test executions by providing the unique IDs (UIDs) of the test executions. However, users have expressed interest in comparing aggregated results from two or more groups of test executions, rather than just two individual tests.

Proposed Design

To address this requirement, we propose enhancing the existing compare subcommand to support comparing two aggregated test results.

When comparing two aggregated test results, a validation step will be added to ensure that the underlying workload (the type of operations being performed, the data set being used, etc.) is consistent across the test executions being aggregated.
The enhanced compare subcommand will allow users to specify two test execution IDs, and OSB will perform the necessary validations before comparing the aggregated results. The output will display the performance differences between the two groups.

Example Usage:

$ opensearch-benchmark compare --baseline=729291a0-ee87-44e5-9b75-cc6d50c89702 --contender=a33845cc-c2e5-4488-a2db-b0670741ff9b

Proposed Priority

The ability to compare two aggregated test results is a highly requested feature from users. It will enable more accurate and representative performance comparisons by reducing the impact of variability and outliers, particularly when evaluating the impact of changes or optimizations across different configurations or workloads.

The text was updated successfully, but these errors were encountered:

OVI3D0 added enhancement New feature or request untriaged labels Aug 28, 2024

OVI3D0 mentioned this issue Aug 28, 2024

[META] Introduce Aggregate Command #628

Open

IanHoang removed the untriaged label Aug 28, 2024

IanHoang assigned OVI3D0 Aug 28, 2024

OVI3D0 mentioned this issue Sep 11, 2024

Add aggregate command #638

Merged

OVI3D0 closed this as completed Sep 19, 2024

OVI3D0 mentioned this issue Sep 19, 2024

[FEATURE] Add configurable iterations parameters per operation opensearch-project/opensearch-benchmark-workloads#393

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark `compare` command #630

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark `compare` command #630

OVI3D0 commented Aug 28, 2024 •

edited

Loading

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark compare command #630

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark compare command #630

Comments

OVI3D0 commented Aug 28, 2024 • edited Loading

Background

Proposed Design

Proposed Priority

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark `compare` command #630

[Introduce Aggregate Subcommand] Enhance opensearch-benchmark `compare` command #630

OVI3D0 commented Aug 28, 2024 •

edited

Loading