disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated #31671

tao-stones · 2023-05-16T16:54:46Z

Problem

Enabling the mentioned feature without first removing the QoS Adjustment Logic (#29595 (comment)) breaks cluster test runs off head of Master, where all features are enabled by default.

Summary of Changes

disables the QoS Adjustment Logic when feature is activated (Will still remove the logic eventually Remove leader QoS adjustment logic #31379)

Fixes #31340

…eplay is activated

tao-stones · 2023-05-16T16:56:29Z

tag @KirillLykov @jeffwashington @sakridge
This should fix kin-sim and other cluster test without needing to specifically disable the feature.

Also worth mentioning: if the TXs used in test request higher compute-unit-limit, the number of tx included in block will reduce with this change, therefore reduce throughput, or the # account created in kin-sim. To avoid that, request accurate CU 😄

apfitzge · 2023-05-16T18:04:52Z

Made a comment on the feature-gate issue: #29595 (comment)

tao-stones · 2023-05-16T18:19:22Z

Made a comment on the feature-gate issue: #29595 (comment)

made comment there too

codecov · 2023-05-16T18:46:58Z

Codecov Report

Merging #31671 (b9cd201) into master (db4b76d) will decrease coverage by 0.1%.
The diff coverage is 100.0%.

@@            Coverage Diff            @@
##           master   #31671     +/-   ##
=========================================
- Coverage    81.9%    81.9%   -0.1%     
=========================================
  Files         737      737             
  Lines      205985   205989      +4     
=========================================
- Hits       168718   168712      -6     
- Misses      37267    37277     +10

KirillLykov · 2023-05-17T15:59:48Z

Cluster didn't panic any longer with this fix

apfitzge

ah, merged while I was reviewing 🐢

apfitzge · 2023-05-17T16:40:15Z

core/src/banking_stage/consumer.rs

@@ -1074,7 +1082,9 @@ mod tests {
            mint_keypair,
            ..
        } = create_slow_genesis_config(10_000);
-        let bank = Arc::new(Bank::new_no_wallclock_throttle_for_tests(&genesis_config));
+        let mut bank = Bank::new_no_wallclock_throttle_for_tests(&genesis_config);
+        bank.deactivate_feature(&feature_set::apply_cost_tracker_during_replay::id());


Any reason to explicitly disable the feature for this test? I don't see anything in it that stands out as failing immediately.
If that's the case, imo it's better to have this test run the scenario twice with the feature enabled and disabled.

But obviously let me know if I'm missing something!

per the intention of the pr, with the feature enabled, the adjustment will be bypassed, which breaks this test.

But a good point to test it with out both feature on/off. Will PR with added test.

But what part of the test breaks without this? All the asserts check, afaict, is that there is some non-zero cost, and the tx was committed.

It breaks here:

- assert_eq!(get_block_cost(), 2 * single_transfer_cost); - assert_eq!(get_tx_count(), 2); + assert_eq!(get_block_cost(), expected_block_cost); + assert_eq!(get_tx_count(), expected_tracked_tx_count);

the adjustment logic removes failed tx (the one fail due to AccountInUse) from cost tracker. #31708 for it.

apfitzge · 2023-05-17T16:46:09Z

core/src/banking_stage/consumer.rs

-        );
+        // once feature `apply_cost_tracker_during_replay` is activated, leader shall no longer
+        // adjust block with executed cost (a behavior more inline with bankless leader), instead
+        // will be exclusively using requested `compute_unit_limit` in cost tracking.


🤔

exclusively using requested compute_unit_limit in cost tracking

Sorry to nit-pick this comment, but I want to be really clear about the expectations and behaviors, since this kind of implies that we will require the request to be set (to me at least), and I don't beleive that is the case.

We still have a default CU-limit which is used if not specified.

If we have pure builtin txs, even if the request is set, we don't use that limit in the cost tracking, only fee calculations.

Are these 2 points still going to be true moving forward?

Might be better to just be a bit more vague in the comments, w/ details being in the cost model itself. Something along the lines of "instead calculated, requested, or default costs will be used in cost tracking".

yes for point 1, default will be used if compute_unit_limit is not explicitly set. I implicitly included default when mentioning requested cu. But good point, #31700 clarifies it.

Point 2 is also correct, but it probably should change. Since all builtins now (in 1.16+) also consume requested CUs, probably no need to distinguish builtin and bpf programs in cost model.

Since all builtins now (in 1.16+) also consume requested CUs, probably no need to distinguish builtin and bpf programs in cost model.

I suppose the difference is we know the static builtin costs up-front, compared to bpf. So we can accurately determine those costs, which are (if tx will succeed) strictly <= requested.

Actually, taking it a slight step further - if we found it to be a problem we could immediately reject (try taking fees only) if a builtin-only tx requested too few CUs.

…eplay is activated (solana-labs#31671) * disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated

disable qos adjustment logic when feature apply_cost_tracker_during_r…

f3d2060

…eplay is activated

tao-stones requested review from apfitzge and KirillLykov May 16, 2023 16:56

update test

b9cd201

KirillLykov approved these changes May 17, 2023

View reviewed changes

tao-stones merged commit 692e1f2 into solana-labs:master May 17, 2023

tao-stones deleted the apply_feautre_gate_to_qos_adjustment_logic branch May 17, 2023 16:25

apfitzge reviewed May 17, 2023

View reviewed changes

apfitzge mentioned this pull request May 17, 2023

update comment for clarification #31700

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated #31671

disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated #31671

tao-stones commented May 16, 2023

tao-stones commented May 16, 2023 •

edited

Loading

apfitzge commented May 16, 2023

tao-stones commented May 16, 2023

codecov bot commented May 16, 2023

KirillLykov commented May 17, 2023

apfitzge left a comment

apfitzge May 17, 2023

tao-stones May 17, 2023

apfitzge May 17, 2023

tao-stones May 18, 2023

apfitzge May 17, 2023

tao-stones May 17, 2023

apfitzge May 17, 2023

disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated #31671

disable qos adjustment logic when feature apply_cost_tracker_during_replay is activated #31671

Conversation

tao-stones commented May 16, 2023

Problem

Summary of Changes

tao-stones commented May 16, 2023 • edited Loading

apfitzge commented May 16, 2023

tao-stones commented May 16, 2023

codecov bot commented May 16, 2023

Codecov Report

KirillLykov commented May 17, 2023

apfitzge left a comment

Choose a reason for hiding this comment

apfitzge May 17, 2023

Choose a reason for hiding this comment

tao-stones May 17, 2023

Choose a reason for hiding this comment

apfitzge May 17, 2023

Choose a reason for hiding this comment

tao-stones May 18, 2023

Choose a reason for hiding this comment

apfitzge May 17, 2023

Choose a reason for hiding this comment

tao-stones May 17, 2023

Choose a reason for hiding this comment

apfitzge May 17, 2023

Choose a reason for hiding this comment

tao-stones commented May 16, 2023 •

edited

Loading