release-21.2: sql: consistent aggregated timestamp when flushing sql stats #72008

Azhng · 2021-10-26T22:47:50Z

Backport 1/1 commits from #71731.

/cc @cockroachdb/release

Previously, when SQL Stats are flushed to system table, the
aggregatedTs column for SQL Stats is calculated for each stats
entry individually. This means that if a flush starts near
the end of each hour, it is possible that different stats
rows will be assigned two different aggregated timestamp.
Currently, flusher flushes statement statistics first, and only
when statement statistics are flushed, transaction statistics
will be flushed into system table. This means that it is likely
the transaction statistics will get assigned a different
aggregatedTs than the statement statistics.
Consequentially, when the frontend fetches SQL Stats through
the CombinedStmtStats handler, the frontend default performs
range scan at 1 hour interval. This triggers a range scan
on the system table for that 1 hour range. This causes the
statement statisitcs that got assigned a different aggregatedTs
to be omitted from the result.

This commit changed the flusher to only compute aggregatedTs once
before the flush actually happen, and assign that aggregatedTs
too all stmt/txn stats rows. Statements executed in the same
aggregation interval can be looked up by the corresponding
statement fingerprint ID stored in transaction stats metadata.

Follow up to #71596

Release note: None

Previously, when SQL Stats are flushed to system table, the aggregatedTs column for SQL Stats is calculated for each stats entry individually. This means that if a flush starts near the end of each hour, it is possible that different stats rows will be assigned two different aggregated timestamp. Currently, flusher flushes statement statistics first, and only when statement statistics are flushed, transaction statistics will be flushed into system table. This means that it is likely the transaction statistics will get assigned a different aggregatedTs than the statement statistics. Consequentially, when the frontend fetches SQL Stats through the CombinedStmtStats handler, the frontend default performs range scan at 1 hour interval. This triggers a range scan on the system table for that 1 hour range. This causes the statement statisitcs that got assigned a different aggregatedTs to be omitted from the result. This commit changed the flusher to only compute aggregatedTs once before the flush actually happen, and assign that aggregatedTs too *all* stmt/txn stats rows. Statements executed in the same aggregation interval can be looked up by the corresponding statement fingerprint ID stored in transaction stats metadata. Release note: None

blathers-crl · 2021-10-26T22:47:52Z

cockroach-teamcity · 2021-10-26T22:47:56Z

This change is

Azhng · 2021-11-18T21:50:20Z

Closing in favor of #72941

Azhng closed this Nov 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release-21.2: sql: consistent aggregated timestamp when flushing sql stats #72008

release-21.2: sql: consistent aggregated timestamp when flushing sql stats #72008

Azhng commented Oct 26, 2021

blathers-crl bot commented Oct 26, 2021

cockroach-teamcity commented Oct 26, 2021

Azhng commented Nov 18, 2021

release-21.2: sql: consistent aggregated timestamp when flushing sql stats #72008

release-21.2: sql: consistent aggregated timestamp when flushing sql stats #72008

Conversation

Azhng commented Oct 26, 2021

blathers-crl bot commented Oct 26, 2021

cockroach-teamcity commented Oct 26, 2021

Azhng commented Nov 18, 2021