Unify `code_at` logic between `CallExecutor` & `Client` #4618

bkchr · 2024-05-28T14:34:16Z

This unifies the logic between CallExecutor and Client when it comes to fetching the code for a given block. The actual code depends on potential overrides/substitutes.

Besides that it changes the logic in the lookahead collator on which ValidationCodeHash it sends to the validator alongside the POV. We are now sending the code hash as found on the relay chain. This is done as the local node could run with an override which is compatible to the validation code on the relay chain, but has a different hash.

This unifies the logic between `CallExecutor` and `Client` when it comes to fetching the `code` for a given block. The actual `code` depends on potential overrides/substitutes. Besides that it changes the logic in the lookahead collator on which `ValidationCodeHash` it sends to the validator alongside the `POV`. We are now sending the code hash as found on the relay chain. This is done as the local node could run with an override which is compatible to the validation code on the relay chain, but has a different hash.

skunert · 2024-05-28T15:12:20Z

cumulus/client/consensus/aura/src/collators/mod.rs

 			if state != *local_validation_code_hash {
 				tracing::warn!(
 					target: super::LOG_TARGET,
 					%para_id,
 					?relay_parent,
 					?local_validation_code_hash,
 					relay_validation_code_hash = ?state,
-					"Parachain code doesn't match validation code stored in the relay chain state",
+					"Parachain code doesn't match validation code stored in the relay chain state. This is expected if you are using an override for example.",


Is it expected actually? If we look at how lookahead collator is typically instantiated, we pass client.code_at as code_hash_provider like here:

polkadot-sdk/cumulus/polkadot-parachain/src/service.rs

Lines 830 to 832 in b9c81e1

code_hash_provider: move |block_hash| {

client.code_at(block_hash).ok().map(|c| ValidationCode::from(c).hash())

},

With your changes in client.rs this will already take substitutes and overrides into account. The hash should match then and we would not even need to return the code hash from the relay chain, right?

If you are using a override, you will never find the hash of the override in the relay chain state. However, the relay chain would reject your pov if you send it with the hash of you override, as it doesn't know this hash.

So what is not super clear to me is what is the point of this check here except the logging message.

We fetch the code hash from the relay chain here and then submit it to the relay chain along with the collation. Previously if there is some other reason for a code hash mismatch, it would be caught. Can we not check if there is an override or substitute in place and only in that case return the relay chain code hash here?

So what is not super clear to me is what is the point of this check here except the logging message.

The entire purpose of this function was always to just log. There is nothing we can do, besides informing the human that something is wrong.

Previously if there is some other reason for a code hash mismatch, it would be caught.

Not sure what other reason you mean, but we would still catch it. I mean even if the parachain is using a completely different code in its state, it will be printed. There is no real difference to before. The only difference is that we tell the relay chain what it expects from us (a little bit like cheating). However, the worst case would be that the relay chain rejects the Pov.

skunert · 2024-05-28T15:26:05Z

prdoc/pr_4618.prdoc

+
+crates:
+  - name: cumulus-client-consensus-aura
+    bump: patch


Since this changes behaviour I am in favor of minor.

paritytech-cicd-pr · 2024-06-17T19:59:11Z

The CI pipeline was cancelled due to failure one of the required jobs.
Job name: test-linux-stable 2/3
Logs: https://gitlab.parity.io/parity/mirrors/polkadot-sdk/-/jobs/6485817

koute · 2024-06-18T01:39:28Z

substrate/client/service/src/client/code_provider.rs

+	pub fn code_at(
+		&self,
+		block: Block::Hash,
+		ignore_overrides: bool,


The ignore_overrides seems to always be true here if I'm looking at it correctly, so maybe remove this argument and rename the function code_at_ignoring_overrides?

koute · 2024-06-18T01:50:04Z

substrate/client/service/src/client/code_provider.rs

+pub struct CodeProvider<Block: BlockT, Backend, Executor> {
+	backend: Arc<Backend>,
+	executor: Arc<Executor>,
+	wasm_override: Arc<Option<WasmOverride>>,
+	wasm_substitutes: WasmSubstitutes<Block, Executor, Backend>,
+}


Considering there are three Arcs in here it'd be nicer to maybe do something like this:

struct CodeProviderInner<...> { backend: Backend, executor: Executor, ... } struct CodeProvider<...>(Arc(CodeProviderImpl<...>));

koute · 2024-06-18T01:56:30Z

substrate/client/service/src/client/code_provider.rs

+	use substrate_test_runtime_client::{runtime, GenesisInit};
+
+	#[test]
+	fn should_get_override_if_exists() {


Maybe would be nice to also have a test if there's no override to trigger the passthrough?

) This unifies the logic between `CallExecutor` and `Client` when it comes to fetching the `code` for a given block. The actual `code` depends on potential overrides/substitutes. Besides that it changes the logic in the lookahead collator on which `ValidationCodeHash` it sends to the validator alongside the `POV`. We are now sending the code hash as found on the relay chain. This is done as the local node could run with an override which is compatible to the validation code on the relay chain, but has a different hash.

bkchr added T0-node This PR/Issue is related to the topic “node”. T9-cumulus This PR/Issue is related to cumulus. labels May 28, 2024

bkchr added 2 commits May 28, 2024 16:54

PRDOC

b9c81e1

Fix warnings

140389e

skunert reviewed May 28, 2024

View reviewed changes

bkchr added 4 commits June 17, 2024 15:42

Merge remote-tracking branch 'origin/master' into bkchr-code-provider

7757bb4

Small adjustments

03f610a

Adds missing license

765ca1f

Only respect the substitutes

764352e

skunert approved these changes Jun 17, 2024

View reviewed changes

Oops

98a9ab1

koute approved these changes Jun 18, 2024

View reviewed changes

bkchr and others added 2 commits June 18, 2024 13:13

Review feedback

c133a27

Merge branch 'master' into bkchr-code-provider

cfbc547

bkchr enabled auto-merge June 18, 2024 11:14

bkchr added this pull request to the merge queue Jun 18, 2024

Merged via the queue into master with commit 029a656 Jun 18, 2024
149 of 157 checks passed

bkchr deleted the bkchr-code-provider branch June 18, 2024 12:28

This was referenced Aug 21, 2024

Update polkadot-sdk from v1.11.0 to stable2407 moondance-labs/tanssi#659

Open

Update polkadot-sdk from v1.11.0 to stable2407 moonbeam-foundation/moonbeam#2912

Closed

RomarQ mentioned this pull request Sep 27, 2024

Expose types from sc-service #5855

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unify `code_at` logic between `CallExecutor` & `Client` #4618

Unify `code_at` logic between `CallExecutor` & `Client` #4618

bkchr commented May 28, 2024

skunert May 28, 2024

bkchr Jun 17, 2024

skunert Jun 17, 2024

bkchr Jun 17, 2024

skunert May 28, 2024

paritytech-cicd-pr commented Jun 17, 2024

koute Jun 18, 2024

koute Jun 18, 2024

koute Jun 18, 2024

	code_hash_provider: move \|block_hash\| {
	client.code_at(block_hash).ok().map(\|c\| ValidationCode::from(c).hash())
	},

Unify code_at logic between CallExecutor & Client #4618

Unify code_at logic between CallExecutor & Client #4618

Conversation

bkchr commented May 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

paritytech-cicd-pr commented Jun 17, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Unify `code_at` logic between `CallExecutor` & `Client` #4618

Unify `code_at` logic between `CallExecutor` & `Client` #4618