Load-testing ZooKeeper using Locust

Description

An experimental "test harness" for ZooKeeper.

Quick Start

Install Locust. (Tested with version 0.11.0. See note in locust_max_load_seeker.py if you use a newer version.);
Install Kazoo. (Versions 2.6.1 and HEAD as of 2019-08-06 have been tested. HEAD is required for SASL support.)
Install and configure ZooKeeper (left as an exercise for the reader).
export ZK_LOCUST_HOSTS=<ensemble>.
Run a simple test using the Web UI:
```
$ locust -f locust_set.py
INFO/locust.main: Starting web monitor at *:8089
INFO/locust.main: Starting Locust 0.11.1
```
In the Web UI (at http://localhost:8089 by default), enter e.g. 128 (users), 32 (/second), and activate "Start swarming." Click "STOP" then kill the locust command when satisfied.

Run a 7-worker instance, "distributed" as processes on a single machine:

$ mkdir -p tmp
$ ./multi-locust.sh 7 tmp -f locust_set.py
locust.main: Starting web monitor at *:8089
locust.main: Starting Locust 0.11.1
locust.runners: Client 'teek_14bd0d516df3487a8d173d6cd5018fdf' reported as ready. Currently 1 clients ready to swarm.
[…]
locust.runners: Client 'teek_370e1be12b7a454284dc3b3bee37c709' reported as ready. Currently 7 clients ready to swarm.

Note how it is now possible to keep a multicore machine busy.

Run a headless (--no-web) 7-worker instances on a number of test cases, collecting (some) statistics, using the provided Make recipe:

$ make
[…]
$ ls out/*.csv | wc -l
18
$ tail -n 8 out/set_and_get.log
Percentage of the requests completed within given times
 Name     # reqs    50%    66%    75%    80%    90%    95%    98%    99%   100%
--------------------------------------------------------------------------------------
 get      207341     28     29     30     30     32     34     40     47     61
 set       20819     28     29     30     31     32     35     42     48     61
--------------------------------------------------------------------------------------
 Total    228160     28     29     30     30     32     34     40     47     61

Run a many-worker instances, across a fleet of machines. (Left as an exercise for the reader.)

Locust Utilities

A number of utilities are provided in the zk_locust module:

KazooLocustClient: A Locust "client" object which provides helper methods as well as direct access to the Kazoo client object via get_zk_client;
ZKLocustClient: Similar to KazooLocustClient, but its backend is thin wrapper around zkpython--which allows exercising the "official" ZooKeeper client library;
ZKLocust: A Locust subclass which can host task sets and is automatically initialized with an instance of KazooLocustClient (default) or ZKLocustClient as the client;
LocustTimer: A Python "Context Manager" which makes it easy to time requests or segments.

Extended Statistics Utilities

The locust_extra.stats module enables the collection of an extended set of statistics (compared to Locust's --csv parameter).

A Locust script can enable the monitor by including the following bit:

from locust_extra.stats import register_extra_stats
register_extra_stats()

The --stats-collect / LOCUST_EXTRA_STATS_COLLECT and --stats-csv / LOCUST_EXTRA_STATS_CSV parameters direct the module to dump Locust statistics as a "time series." (See "Parameters" below for details.)

Control Utilities

The locust_extra.control module provides a mechanism and utilities for dynamically controlling the Locust "runner," including dynamically changing the number of clients.

An example is provided in locust_set_with_controller.py.

ZooKeeper Metrics Utilities

The zk_metrics module integrates ZooKeeper metrics with the Locust Web application and CSV output. A Locust script can use the following bit to register a ZooKeeper metrics monitor:

from zk_metrics import register_zk_metrics
register_zk_metrics()

If the Web UI is enabled via the --zk-metrics-collect / ZK_LOCUST_ZK_METRICS_COLLECT collection parameter (see "Parameters" below), the monitor adds a page under /zk-metrics in the Locust UI.

That page, when open, regularly polls the servers of the ensemble to gather metrics

whose "instant" value can be displayed on the "Statistics" page;
whose "history" can be plotted on the "Charts" page;
which are recorded to the file specified via --zk-metrics-csv / ZK_LOCUST_ZK_METRICS_CSV.

Points 1/ and 2/ can help interactive exploration, whereas 3/ enables "offline" data analysis. Alternatively, the collection parameter can be set to an integer number of milliseconds to perform Locust-side polling without setting up a Web UI.

ZooKeeper Ensemble Utilities

The zk_dispatch module provides a mechanism and utilities for manipulating with the ZooKeeper ensemble.

It can for example disable the leader and/or other members, forcing an election or migration of the affected clients.

An example is provided in locust_set_with_dispatcher.py.

Parameters

"ZK Locust" Parameters

Most parameters can either be controlled by "flag" arguments to the parameterized-locust.sh wrapper script (starting with --), or by setting (upper-case) environment variables. (Note that the wrapper systematically clears the latter when the flag argument is known.)

--hosts, ZK_LOCUST_HOSTS: A ZooKeeper "connect string" including the addresses of the ensemble;
--client, ZK_LOCUST_CLIENT: Selects the ZKLocust backend, unless overriden by a subclass. Valid values include kazoo (default) and zkpython;
--pseudo-root, ZK_LOCUST_PSEUDO_ROOT: A "pseudo root" for tests. Note that this is not a "chroot" in the ZooKeeper sense; it is purely advisory;
--min-wait, ZK_LOCUST_MIN_WAIT: The default value of the min_wait Locust setting used by ZKLocust subclasses (defaults to 0);
--max-wait, ZK_LOCUST_MAX_WAIT: The default value of the max_wait Locust setting used by ZKLocust subclasses (defaults to ZK_LOCUST_MIN_WAIT);
--key-size, ZK_LOCUST_KEY_SIZE: The (advisory) byte length of the key names to be generated by tests;
--val-size, ZK_LOCUST_VAL_SIZE: The (advisory) byte length of the payloads to be generated by tests;
--exception-behavior, ZK_LOCUST_EXCEPTION_BEHAVIOR: Where possible, choose between one of the following behaviors when an exception is thrown by the active ZooKeeper backend:
- log-failure (default): Marks the request as a Locust "request failure," causing it to 1. be included in statistics, but 2. influence the request latency;
- try-suppress: Swallow the exception, causing the attempted request to only participate in statistics via its absence (decreasing the reported req/s);
- propagate: Let Python propagate the exception to the caller, causing the Locust client to die.
--kazoo-handler, KAZOO_LOCUST_HANDLER: Selects the Kazoo concurrency "handler." Valid values include threading and gevent. The default depends on Kazoo, but normally corresponds to threading;
--kazoo-sasl-options, KAZOO_LOCUST_SASL_OPTIONS: An optional JSON-encoded dictionary of SASL options for the Kazoo backend. The default is to not authenticate with the server;
KAZOO_LOCUST_CREATE_CLIENT: If set to a non-empty string, contains the name of a module and "function" to invoke to instantiate Kazoo clients. E.g.:
```
export KAZOO_LOCUST_CREATE_CLIENT='my.custom.kazoo.create_client'
```
That callable ought to act as a replacement for the KazooClient constructor, and is invoked with the "normal" set of constructor arguments.
--zk-metrics-collect, ZK_LOCUST_ZK_METRICS_COLLECT: Determine the collection method used by the zk_metrics monitor; either web (the default) to have the monitor driven by the Web UI, or a millisecond delay, for no Web UI and Locust-side polling;
--zk-metrics-csv, ZK_LOCUST_ZK_METRICS_CSV: Path to a CSV file to be created by the zk_metrics monitor.
--stats-collect, LOCUST_EXTRA_STATS_COLLECT: A millisecond delay for extended statistics collection, or 0 (the default) to disable it;
--stats-csv, LOCUST_EXTRA_STATS_CSV: Path to the CSV file in which to collect extended statistics;
--stats-distrib, LOCUST_EXTRA_STATS_DISTRIB: Path to the file in which to collect full (rounded) distributions.
ZK_DISPATCH_DISABLE_SCRIPT, ZK_DISPATCH_ENABLE_SCRIPT: The zk_dispatch module does not directly implement, but rather delegates ensemble member disable/enable operations to these scripts, as they tend to be very environment-specific.

These shell commands are executed in an environment which contains additional variables denoting the targeted member:
- ZK_MEMBER_HOST_AND_PORT: The "full" member specification as extracted from the ZooKeeper connect string, e.g. member2:2181;
- ZK_MEMBER_HOST: The member's host name, e.g. member2;
- ZK_MEMBER_PORT: The member's port number, e.g. 2181;
- ZK_MEMBER_STATE: The last-known state of the targeted member; one of unknown, follower or leader.
E.g.:
```
`export ZK_DISPATCH_ENABLE_SCRIPT='my-zk-enable "$ZK_MEMBER_HOST"'`
```
--bench-*: As a special case, an open-ended set of "benchmark" parameters is accepted; those are not validated and simply "forwarded" to corresponding ZK_LOCUST_BENCH_* variables. E.g., --bench-the-answer 42 is equivalent to:
```
export ZK_LOCUST_BENCH_THE_ANSWER=42
```

Reporting Parameters

The parameterized-locust.sh script can be directed to generate a report after the locust runner exists. (Alternatively, reports can be "manually" generated at any time after completion using the reporting script, see "Reporting Parameters for report.py" below.)

Here are some reporting parameters it admits:

--report-dir: The name of a directory to use to store the collected metrics and generate a "human-readable" report;
--report-jobs, --report-option, --report-nb, --report-no-nb, --report-md, --report-no-md: Forwarded to the report generator.

E.g., --report-no-nb corresponds to report.py's --no-nb.

See "Reporting Parameters for report.py", or ./report.py --help, for more information.

It is recommended not to use --zk-metrics-csv or --stats-csv in conjunction with --report-dir to allow the script to use conventionally named files in that directory.

Reporting Parameters for `report.py`

The report.py report generator can be used to generate reports from one or more existing datasets. Here are some of its parameters (run ./report.py --help for more):

--md/--no-md: Generate Markdown-based report;
--nb/--no-nb: Generate Jupyter notebook;
--option <key value>: Set a single "named" report or plot option (see below).

"Named" report/plot options are passed via the --option flag, which can be specified a number of times. E.g.:

./report.py ... \
     --option latencies.shade False \
     --option '*.width' 10 ...

With Mardown-based reports, such options are "statically" applied at report generation time. When generating Jupyter notebooks, they just replace the default plot_options stanza in "General setup," and can later be interactively tweaked.

The "naming" keys themselves are made of two parts, "category" and "option," separated by a dot. * can be used as a wildcard matching all categories. The categories are:

latencies: "Operation Latencies" plot;
client_count: "ZK Client Count" plot;
request_frequency: "ZK Client Requests" plot;
outstanding_requests: "ZooKeeper Outstanding Requests" plot;
clients: "ZooKeeper Clients" plot;
nodes: "ZooKeeper Nodes" plot;
watch_count: "ZooKeeper Watches" plot;
errors: All "Errors" plots.

Common plot options are (* means: valid for all applicable categories):

*.width, *.height: The (floating-point) width/(resp.)height of a plot in Matplotlib units (normally inches);
*.bottom, *.top: (Floating-point) "anchors" for the Y axis, propagated to Matplotlib via set_ylim. Both default to None, which means "auto";
*.per_worker: A boolean indicating whether to include a "distribution" of per-worker (Locust "slave") curves on relevant plots (default: True);
latencies.shade: A boolean indicating whether to shade the latencies plot (default: True);

"Locustfiles" Starter Kit

The included locust_*.py files are "locustfiles," and test various aspects of the target ZooKeeper ensemble.

Most are composed of boilerplate code, and use "operations" and "task sets" which have been factored out to the zk_locust.ops and zk_locust.task_sets modules, respectively.

One notable exception is locust_sequence.py, which implements a complete "suite" of tests to be run sequentially.

TODO(ddiederen): Generate more representative loads.

Examples

Running the locust_sequence.py suite in standalone mode, unleashing 256 clients on $MY_ENSEMBLE, collecting metrics every 500ms, and finally generating a report in ../my-report-3:

./parameterized-locust.sh                   \
    --hosts "$MY_ENSEMBLE"                  \
    --kazoo-handler gevent                  \
    --min-wait 150                          \
    --max-wait 300                          \
    --stats-collect 500                     \
    --zk-metrics-collect 500                \
    --bench-step-duration 15                \
    --report-dir ../my-report-3             \
    --                                      \
        --reset-stats --no-web              \
        -c 256 -r 64                        \
        -f locust_sequence.py

Pictures

Captured data

The CSV files produced by --no-web do not contain full histograms, but rather focus on on tail latencies. While these do not lend themselves to smooth curves, they can still be quickly visualized and compared by plotting:

Name		Name	Last commit message	Last commit date
Latest commit History 226 Commits
doc		doc
locust_extra		locust_extra
report		report
zk_dispatch		zk_dispatch
zk_locust		zk_locust
zk_metrics		zk_metrics
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
locust_connect.py		locust_connect.py
locust_connect_set_parallel.py		locust_connect_set_parallel.py
locust_create_and_delete.py		locust_create_and_delete.py
locust_exists.py		locust_exists.py
locust_exists_many.py		locust_exists_many.py
locust_fail.py		locust_fail.py
locust_get.py		locust_get.py
locust_get_children.py		locust_get_children.py
locust_get_children2.py		locust_get_children2.py
locust_get_with_dispatcher.py		locust_get_with_dispatcher.py
locust_max_load_seeker.py		locust_max_load_seeker.py
locust_set.py		locust_set.py
locust_set_and_get.py		locust_set_and_get.py
locust_set_with_controller.py		locust_set_with_controller.py
locust_set_with_dispatcher.py		locust_set_with_dispatcher.py
locust_watch.py		locust_watch.py
locust_watch_with_dispatcher.py		locust_watch_with_dispatcher.py
multi-locust.sh		multi-locust.sh
parameterized-locust.sh		parameterized-locust.sh
report.py		report.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Load-testing ZooKeeper using Locust

Description

Quick Start

Locust Utilities

Extended Statistics Utilities

Control Utilities

ZooKeeper Metrics Utilities

ZooKeeper Ensemble Utilities

Parameters

"ZK Locust" Parameters

Reporting Parameters

Reporting Parameters for `report.py`

"Locustfiles" Starter Kit

Examples

Pictures

Captured data

Web UI Screenshots

About

Releases

Packages

Languages

ztzg/zk-locust-tests

Folders and files

Latest commit

History

Repository files navigation

Load-testing ZooKeeper using Locust

Description

Quick Start

Locust Utilities

Extended Statistics Utilities

Control Utilities

ZooKeeper Metrics Utilities

ZooKeeper Ensemble Utilities

Parameters

"ZK Locust" Parameters

Reporting Parameters

Reporting Parameters for report.py

"Locustfiles" Starter Kit

Examples

Pictures

Captured data

Web UI Screenshots

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Reporting Parameters for `report.py`

Packages