Support for fixed number of requests #633

tgerdesnv · 2024-05-07T13:45:26Z

new CLI arg --request-count, which when specified tells PA exactly how many requests to issue for the experiment

Works with concurrency, request-rate, and custom load

Also:

Combined thread_config classes into one, so that the base LoadWorker class has access to it
Cleaned up some printing

… now)

dyastremsky

Nice refactors, especially ThreadConfig. This looks splendid!

matthewkotila

Looks good overall

src/c++/perf_analyzer/request_rate_manager.cc

…equest-count

* first pass. Hardcoded values * Working for concurrency (hardcoded whenever count windows is used for now) * working for req rate as well * Add CLI. Add/fix unit tests * Remove hack. Restore all normal functionality * Refactor thread config into one class. Add more testing * Rename arg to request-count * Fix request rate bug * Update info print * fix corner case * move fixme to a story tag * add assert to avoid corner case * rename variables * self review #1 * copyright changes * add doxygen to functions * Don't allow sweeping over multiple concurrency or request rate with request-count

* Fix empty response bug * Fix unused variable Fix test Initialize logger to capture logs Add unit test Change to _ instead of removing Check if args.model is not None fix artifact path Support Python 3.8 in GenAI-Perf (#643) Add automation to run unit tests and check code coverage for GenAI-Perf against Python 3.10 (#640) Changes to support Ensemble Top Level Response Caching (#560) Support for fixed number of requests (#633) * first pass. Hardcoded values * Working for concurrency (hardcoded whenever count windows is used for now) * working for req rate as well * Add CLI. Add/fix unit tests * Remove hack. Restore all normal functionality * Refactor thread config into one class. Add more testing * Rename arg to request-count * Fix request rate bug * Update info print * fix corner case * move fixme to a story tag * add assert to avoid corner case * rename variables * self review #1 * copyright changes * add doxygen to functions * Don't allow sweeping over multiple concurrency or request rate with request-count fix test (#637) Support custom artifacts directory and improve default artifacts directory (#636) * Add artifacts dir option and more descriptive profile export filename * Clean up * fix input data path * Add tests * create one to one plot dir for each profile run * change the directory look * add helper method Extend genai perf plots to compare across multiple runs (#635) * Modify PlotManager and plots classes * Support plots for multiple runs -draft * Fix default plot visualization * Remove artifact * Set default compare directory * Support generating parquet files * Remove annotations and fix heatmap * Fix errors * Fix pre-commit * Fix CodeQL warning * Remove unused comments * remove x axis tick label for boxplot * Add logging and label for heatmap subplots * Allow users to adjust width and height * fix grammer --------- Co-authored-by: Hyunjae Woo <[email protected]> Generate plot configurations for plot manager (#632) * Introduce PlotConfig and PlotConfigParser class * Port preprocessing steps and introduce ProfileRunData * Create plot configs for default plots * fix minor bug * Fix comment * Implement parse method in PlotConfigParser * refactor * fix test * Add test * Address feedback * Handle custom endpoint Add more metadata to profile export JSON file (#627) * Add more metadata to profile export data * Fix minor bug * refactor Add compare subcommand (#623) * Move for better visibility * Add compare subparser * Add subcommand compare * Fix test * Add ticket * add --files option and minor fix * Fix tests * Add unit tests * Address feedback * Fix minor error and add section header Revert "Changes to support Ensemble Top Level Response Caching (#560) (#642)" This reverts commit cc6a3b2. Changes to support Ensemble Top Level Response Caching (#560) (#642)

* first pass. Hardcoded values * Working for concurrency (hardcoded whenever count windows is used for now) * working for req rate as well * Add CLI. Add/fix unit tests * Remove hack. Restore all normal functionality * Refactor thread config into one class. Add more testing * Rename arg to request-count * Fix request rate bug * Update info print * fix corner case * move fixme to a story tag * add assert to avoid corner case * rename variables * self review #1 * copyright changes * add doxygen to functions * Don't allow sweeping over multiple concurrency or request rate with request-count

tgerdesnv changed the title ~~Support for fixed num requests~~ Support for fixed number of requests May 7, 2024

tgerdesnv added 14 commits May 8, 2024 13:15

first pass. Hardcoded values

76a4cb7

Working for concurrency (hardcoded whenever count windows is used for…

fef9012

… now)

working for req rate as well

ee026a9

Add CLI. Add/fix unit tests

80adc50

Remove hack. Restore all normal functionality

72c5a2a

Refactor thread config into one class. Add more testing

d95c2d4

Rename arg to request-count

32c791f

Fix request rate bug

1fa3842

Update info print

421e0c5

fix corner case

4a15592

move fixme to a story tag

4a5b465

add assert to avoid corner case

20c421a

rename variables

184ad34

self review #1

94c01d2

tgerdesnv force-pushed the tgerdes-fixed-num-req branch from 82d142a to 94c01d2 Compare May 8, 2024 18:15

tgerdesnv marked this pull request as ready for review May 8, 2024 18:15

dyastremsky approved these changes May 8, 2024

View reviewed changes

tgerdesnv added 2 commits May 8, 2024 15:02

copyright changes

51a0456

add doxygen to functions

7b52c93

matthewkotila reviewed May 9, 2024

View reviewed changes

src/c++/perf_analyzer/request_rate_manager.cc Show resolved Hide resolved

Don't allow sweeping over multiple concurrency or request rate with r…

4e7620d

…equest-count

matthewkotila approved these changes May 9, 2024

View reviewed changes

dyastremsky approved these changes May 9, 2024

View reviewed changes

tgerdesnv merged commit c3cf131 into main May 9, 2024
3 checks passed

tgerdesnv deleted the tgerdes-fixed-num-req branch May 9, 2024 19:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for fixed number of requests #633

Support for fixed number of requests #633

tgerdesnv commented May 7, 2024 •

edited

Loading

dyastremsky left a comment

matthewkotila left a comment

Support for fixed number of requests #633

Support for fixed number of requests #633

Conversation

tgerdesnv commented May 7, 2024 • edited Loading

dyastremsky left a comment

Choose a reason for hiding this comment

matthewkotila left a comment

Choose a reason for hiding this comment

tgerdesnv commented May 7, 2024 •

edited

Loading