Fix mysql metric for innodb row lock time #7289

sayap · 2020-08-05T08:43:57Z

What does this PR do?

Previously, the code retrieved the internal innodb metric returned by
SHOW STATUS first, and then tried to add the row lock time from any
in-flight query on top of it, based on SHOW ENGINE INNODB STATUS.

The intention was probably to make the metric more accurate, because
the internal innodb metric for row lock time will only be incremented
once a query has completed (either successfully or after exceeding
innodb_lock_wait_timeout).

However, the addition actually didn't work, because we are just merging
2 python dictionaries. So, we would end up with either the accumulating
counter value from SHOW STATUS when there is no in-flight query
waiting for row lock, or the manually calculated value from parsing
SHOW ENGINE INNODB STATUS otherwise. Flipping between these 2 values,
the rate calculated by datadog agent would be useless and confusing.

This is fixed by returning the internal innodb metric as-is. Trying to
add the row lock time from in-flight queries would introduce complexity
without adding much benefit, as the default innodb_lock_wait_timeout
is only 50 seconds anyway.

Motivation

The wrong metric made the graph looks confusing, and also prevented us from setting the alert threshold.

Additional Notes

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
PR title must be written as a CHANGELOG entry (see why)
Files changes must correspond to the primary purpose of the PR as described in the title (small unrelated changes should have their own PR)
PR must have changelog/ and integration/ labels attached

Previously, the code retrieved the internal innodb metric returned by `SHOW STATUS` first, and then tried to add the row lock time from any in-flight query on top of it, based on `SHOW ENGINE INNODB STATUS`. The intention was probably to make the metric more accurate, because the internal innodb metric for row lock time will only be incremented once a query has completed (either successfully or after exceeding `innodb_lock_wait_timeout`). However, the addition actually didn't work, because we are just merging 2 python dictionaries. So, we would end up with either the accumulating counter value from `SHOW STATUS` when there is no in-flight query waiting for row lock, or the manually calculated value from parsing `SHOW ENGINE INNODB STATUS` otherwise. Flipping between these 2 values, the rate calculated by datadog agent would be useless and confusing. This is fixed by returning the internal innodb metric as-is. Trying to add the row lock time from in-flight queries would introduce complexity without adding much benefit, as the default `innodb_lock_wait_timeout` is only 50 seconds anyway.

codecov · 2020-08-05T08:59:37Z

Codecov Report

Merging #7289 into master will decrease coverage by 8.74%.
The diff coverage is n/a.

Impacted Files	Coverage Δ
mysql/datadog_checks/mysql/innodb_metrics.py	`63.94% <ø> (ø)`
elastic/tests/test_config.py
...kroachdb/datadog_checks/cockroachdb/cockroachdb.py
ibm_db2/tests/test_integration_e2e.py
cockroachdb/datadog_checks/cockroachdb/metrics.py
postgres/tests/common.py
crio/tests/test_crio.py
airflow/datadog_checks/airflow/airflow.py
...s_dev/tests/tooling/config_validator/test_utils.py
disk/tests/metrics.py
... and 778 more

sayap requested a review from a team as a code owner August 5, 2020 08:43

hithwen added integration/mysql changelog/Changed labels Aug 5, 2020

hithwen approved these changes Aug 7, 2020

View reviewed changes

hithwen merged commit 3a0f4a4 into DataDog:master Aug 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mysql metric for innodb row lock time #7289

Fix mysql metric for innodb row lock time #7289

sayap commented Aug 5, 2020

codecov bot commented Aug 5, 2020

Fix mysql metric for innodb row lock time #7289

Fix mysql metric for innodb row lock time #7289

Conversation

sayap commented Aug 5, 2020

What does this PR do?

Motivation

Additional Notes

Review checklist (to be filled by reviewers)

codecov bot commented Aug 5, 2020

Codecov Report