[TEX] Part 7: wiring data collection #6340

ergo14 · 2022-10-31T14:57:08Z

General

Wiring reportExpectationsCollectedData in the right spots and add metrics

Priority:
P2
Concerns / possible downsides (what feedback would you like?):
Did I wrap all the spots where transactions are used to run tasks?
Open to suggestions for metric namespace name (expectations is a little too broad). Always, note we expect to have another metrics namespace for the violations in future TEX PRs.
Is documentation needed?:
No

Compatibility

Does this PR create any API breaks (e.g. at the Java or HTTP layers) - if so, do we have compatibility?:
No
Does this PR change the persisted format of any data - if so, do we have forward and backward compatibility?:
No
The code in this PR may be part of a blue-green deploy. Can upgrades from previous versions safely coexist? (Consider restarts of blue or green nodes.):
Yes
Does this PR rely on statements being true about other products at a deployment - if so, do we have correct product dependencies on these products (or other ways of verifying that these statements are true)?:
No
Does this PR need a schema migration?
No

Testing and Correctness

What, if any, assumptions are made about the current state of the world? If they change over time, how will we find out?:
No-op wiring, existing tests should suffice
What was existing testing like? What have you done to improve it?:
N/A
If this PR contains complex concurrent or asynchronous code, is it correct? The onus is on the PR writer to demonstrate this.:
N/A
If this PR involves acquiring locks or other shared resources, how do we ensure that these are always released?:
N/A

Execution

How would I tell this PR works in production? (Metrics, logs, etc.):
N/A
Has the safety of all log arguments been decided correctly?:
N/A
Will this change significantly affect our spending on metrics or logs?:
N/A
How would I tell that this PR does not work in production? (monitors, etc.):
N/A
If this PR does not work as expected, how do I fix that state? Would rollback be straightforward?:
recall and rollback
If the above plan is more complex than “recall and rollback”, please tag the support PoC here (if it is the end of the week, tag both the current and next PoC):
N/A

Scale

Would this PR be expected to pose a risk at scale? Think of the shopping product at our largest stack.:
N/A
Would this PR be expected to perform a large number of database calls, and/or expensive database calls (e.g., row range scans, concurrent CAS)?:
N/A
Would this PR ever, with time and scale, become the wrong thing to do - and if so, how would we know that we need to do something differently?:
N/A

Development Process

Where should we start reviewing?:
ExpectationsAwareTransaction
If this PR is in excess of 500 lines excluding versions lock-files, why does it not make sense to split it?:

changelog-app · 2022-10-31T14:57:17Z

Generate changelog in `changelog/@unreleased`

Type

Description

[TEX] data collection WIP

Check the box to generate changelog(s)

Generate changelog entry

changelog-app · 2022-11-11T20:01:40Z

Generate changelog in `changelog/@unreleased`

Type

Description

Transactional expectations: enables metrics collections for key value service read calls on transactions post-mortem.
Metrics tracked per transaction: transaction age, number of bytes read, number of Atlas kvs method queries and number of bytes read for the worse Atlas kvs call (one with the most bytes read).

Check the box to generate changelog(s)

Generate changelog entry

ergo14 · 2022-11-23T14:58:45Z

...ain/java/com/palantir/atlasdb/transaction/api/expectations/ExpectationsAwareTransaction.java

@@ -22,7 +22,6 @@
 * Implementors of this interface provide methods useful for tracking transactional expectations and whether
 * they were breached as well as relevant metrics and alerts. Transactional expectations represent transaction-level
 * limits and rules for proper usage of AtlasDB transactions (e.g. reading too much data overall).
- * Todo(aalouane): move this out of API once part 4 is merged
 */


tried to move this to client for 8 minutes but broke some imports that i couldn't fix quickly, we can look at this together

paired offline, resolved!

mdaudali

paired offline, lgtm! Cutting an RC to test on internal proxy to check metrics are as you expect

(Lets add tests to verify that report doesn't run if the transaction is (!definitively committed || aborted)

mdaudali · 2022-11-24T17:20:45Z

...ain/java/com/palantir/atlasdb/transaction/api/expectations/ExpectationsAwareTransaction.java

@@ -22,7 +22,6 @@
 * Implementors of this interface provide methods useful for tracking transactional expectations and whether
 * they were breached as well as relevant metrics and alerts. Transactional expectations represent transaction-level
 * limits and rules for proper usage of AtlasDB transactions (e.g. reading too much data overall).
- * Todo(aalouane): move this out of API once part 4 is merged
 */


paired offline, resolved!

mdaudali · 2022-11-24T17:34:37Z

...sdb-impl-shared/src/main/java/com/palantir/atlasdb/transaction/impl/SnapshotTransaction.java

+
+    @Override
+    public void reportExpectationsCollectedData() {
+        if (!isDefinitivelyCommitted() && !isAborted()) {


While correct, I think this would be easier to read (by applying De Morgen laws) -> !(isDefinitivelyCommitted || isAborted)

[skip ci]

jeremyk-91

Broadly looks good.

Do we need tests for the updates?

jeremyk-91 · 2023-01-30T12:23:44Z

...sdb-impl-shared/src/main/java/com/palantir/atlasdb/transaction/impl/SnapshotTransaction.java

+        if (!List.of(State.COMMITTED, State.ABORTED, State.FAILED).contains(state.get())) {
+            log.error(
+                    "reportExpectationsCollectedData is called on an in-progress transaction",


You can probably reuse some of the logic from ensureStillRunning() - it isn't the same, but there should be similar bits you can use :)

ergo14 added 10 commits October 30, 2022 12:57

copy pasta

d5cc1e9

adding expectations measuring utilities & tests

8a3eef0

adding keyvalueservice data tracker & tests

037777e

adding iterators

b0f6609

Merge branch 'tex-pr-1c0' into tex-pr-1d2

0f6b859

Merge branch 'tex-pr-1d1' into tex-pr-1d2

f829968

putting it all together

0dbad63

fix constructor

49f106e

progress

89ae9f0

progress

2c97dbf

ergo14 added 2 commits October 31, 2022 17:04

checkpoint

4c7fb19

more wiring

87ce54b

ergo14 changed the base branch from develop to tex-pr-1d2 November 1, 2022 11:13

ergo14 added 2 commits November 1, 2022 13:07

stuff

f08fd07

merge conflicts

7375fb4

ergo14 changed the base branch from tex-pr-1d2 to develop November 11, 2022 20:01

ergo14 added 4 commits November 11, 2022 20:02

merge conflict

7712ccd

removing obsolete code

5e2fc66

removing clock and impls

b51745b

removing metrics definition

c72a8f1

ergo14 changed the title ~~[TEX] data collection WIP~~ [TEX] Part 6: no-op wiring Nov 14, 2022

ergo14 marked this pull request as ready for review November 14, 2022 13:18

ergo14 added 2 commits November 14, 2022 14:33

removing getAgeMillisAndFreezeStopwatch

76c4e62

staging

86b882a

ergo14 changed the title ~~[TEX] Part 6: no-op wiring~~ [TEX] Part 7: no-op wiring Nov 15, 2022

ergo14 changed the base branch from develop to tex-pr-6 November 15, 2022 11:35

ergo14 added 2 commits November 15, 2022 13:04

moving ExpectationsAwareTransaction

0271726

merging

6b7e898

ergo14 commented Nov 23, 2022

View reviewed changes

ergo14 added 4 commits November 23, 2022 17:02

Merge branch 'develop' into tex-draft-1

18be54b

pairing

7df9eca

come one

11d4c43

yes

5390c75

mdaudali approved these changes Nov 24, 2022

View reviewed changes

svc-changelog and others added 20 commits November 24, 2022 17:46

Add generated changelog entries

dd465e3

Autorelease 0.773.0-rc1

ba63b23

revapi

15836f4

Autorelease 0.773.0-rc2

0c1e1f4

debugging

2f08a45

fix

dfab6fb

Autorelease 0.773.0-rc3

8c7fd1a

fix revapi

5680412

Autorelease 0.782.0-rc1

480dae7

[skip ci]

more logs

8fbe122

Autorelease 0.782.0-rc2

4238a7a

[skip ci]

Merge branch 'develop' into tex-draft-1

6f2fc36

fixes

1a1873f

Autorelease 0.784.0-rc1

2786dee

[skip ci]

Merge branch 'develop' into tex-draft-1

7aaf6c2

changing the metric namespace name

c06bee3

fix

642761c

fix

756e438

exclude metrics file from license

5598022

Autorelease 0.787.0-rc1

908da7d

[skip ci]

jeremyk-91 approved these changes Jan 30, 2023

View reviewed changes

ergo14 mentioned this pull request Mar 21, 2023

[TEX] Wrap KVS service reads and track bytes read (no metrics) #6475

Merged

ergo14 mentioned this pull request May 25, 2023

[TEX] Report transaction level metrics #6595

Merged

ergo14 closed this May 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEX] Part 7: wiring data collection #6340

[TEX] Part 7: wiring data collection #6340

ergo14 commented Oct 31, 2022 •

edited

Loading

changelog-app bot commented Oct 31, 2022

changelog-app bot commented Nov 11, 2022 •

edited by ergo14

Loading

ergo14 Nov 23, 2022

mdaudali Nov 24, 2022

mdaudali left a comment •

edited

Loading

mdaudali Nov 24, 2022

mdaudali Nov 24, 2022 •

edited

Loading

jeremyk-91 left a comment •

edited

Loading

jeremyk-91 Jan 30, 2023

[TEX] Part 7: wiring data collection #6340

[TEX] Part 7: wiring data collection #6340

Conversation

ergo14 commented Oct 31, 2022 • edited Loading

General

Compatibility

Testing and Correctness

Execution

Scale

Development Process

changelog-app bot commented Oct 31, 2022

Generate changelog in changelog/@unreleased

changelog-app bot commented Nov 11, 2022 • edited by ergo14 Loading

Generate changelog in changelog/@unreleased

ergo14 Nov 23, 2022

Choose a reason for hiding this comment

mdaudali Nov 24, 2022

Choose a reason for hiding this comment

mdaudali left a comment • edited Loading

Choose a reason for hiding this comment

mdaudali Nov 24, 2022

Choose a reason for hiding this comment

mdaudali Nov 24, 2022 • edited Loading

Choose a reason for hiding this comment

jeremyk-91 left a comment • edited Loading

Choose a reason for hiding this comment

jeremyk-91 Jan 30, 2023

Choose a reason for hiding this comment

ergo14 commented Oct 31, 2022 •

edited

Loading

Generate changelog in `changelog/@unreleased`

changelog-app bot commented Nov 11, 2022 •

edited by ergo14

Loading

Generate changelog in `changelog/@unreleased`

mdaudali left a comment •

edited

Loading

mdaudali Nov 24, 2022 •

edited

Loading

jeremyk-91 left a comment •

edited

Loading