[EPIC] Rollout PlanResourceChange #1785

t0yv0 · 2024-03-22T22:25:50Z

🚧

PlanResourceChange flag enables a rewritten implementation of the lifecycle for SDKv2 based resources that more closely aligns with how TF CLI exercises the providers. Building out a plan to roll it out as the default behavior in the bridge. Currently it is rolled out selectively on a per-resource basis where it is shown to fix significant issues.

Introduce initial version in Integrate with PlanResourceChange and ApplyResourceChange #1614
aws:ec2:LaunchTemplate update_default_version Fails on AWS Provider pulumi-aws#1504
Inconsistent behavior and unable to update aws.ssm.Document pulumi-aws#2555
Swap CtyInstanceState with PlanResourceChange flag pulumi-aws#3359
ProviderMeta Issue enrolling GCP resource into PlanResourceChange #1822
Cross-test matching Diff behavior with property based testing #1791 (verify that discrepancies here are non-critical)
Test provider meta handling with GCP resources which use it #1827
PlanResourceChange UpgradeState panic #2034
~~Bridge does not run TF upgrades from version 0 -> 1 #2039~~ (not a PRC issue, fixed in Fix bridge not running state upgrades from 0 -> 1 #2081)
Pass PlanResourceChange through downstream tests #1962
PlanResourceChange undesirable updates #1967 (will be closed with Fix dirty collection attribute refresh #2065)
PlanResourceChange dirty refresh on empty maps #2047 (will be closed with Fix dirty collection attribute refresh #2065)
~~PlanResourceChange issue with empty strings and GCP labels pulumi-gcp#2078~~ (issue in GCP tests, fixed in Fix overfitting label tests pulumi-gcp#2072)
SDKv2 PlanResourceChange truncates big integers #2284

Technicalities

What PlanResourceChange flag does: it currently is a partial rewrite of the resource lifecycle methods for bridging the SDKv2-based resources. It ties into as much as possible into TF gRPC implementations to work "exactly-as" TF. Several known areas where this improves things:

RawState, RawConfig and RawPlan are computed as-in TF; when they differ this may lead to panics in providers
Does not call CustomizeDiff repeatedly, matching TF behavior more closely and solving complicated bugs

Cross-testing is essential to pin down corner cases and get full confidence before broader rollout.

Rollout

We will start the rollout as follows:

Instances of TF behaviour regressing against non-PRC:

PlanResourceChange: Empty resource labels keeps old label value pulumi-gcp#2083

The text was updated successfully, but these errors were encountered:

VenelinMartinov · 2024-04-01T15:55:03Z

#1822 is likely the GCP Meta handling issue - it does have a repro.

VenelinMartinov · 2024-04-24T17:55:40Z

Possible issue in GCP bucket: pulumi/pulumi-gcp#1952 - I attempted to enroll it in PlanResourceChange to check if it fixes an issue. Did not fix the original issue but caused a lot of test failures.

At least one was legitimate - the resource started refreshing dirty when no labels are specified.

Recent changes in hashicorp/terraform-provider-aws#37111 introduced a Diff customizer that leads to errors in the Pulumi version of the provider due to discrepancies in the order in which Diff customizer functions are applied between Pulumi and Terraform. This change fixes the problem by applying the experimental PlanResourceChange flag to the affected resource. A regression test is included. See also: pulumi/pulumi-terraform-bridge#1785

…3888) Recent changes in hashicorp/terraform-provider-aws#37111 introduced a Diff customizer that leads to errors in the Pulumi version of the provider due to discrepancies in the order in which Diff customizer functions are applied between Pulumi and Terraform. This change fixes the problem by applying the experimental PlanResourceChange flag to the affected resource. Fixes #3887 A regression test is included. See also: pulumi/pulumi-terraform-bridge#1785

t0yv0 · 2024-05-09T14:50:12Z

Some updates. There's still some very basic issues around unknown handling (#1943) causing customer P1s in AWS (pulumi/pulumi-aws@8c302b9) for resources where this was speculatively rolled out to fix other P1s or diff issues like in the case of LaunchTemplate.

Given this situation, after #1943 it would be great to open a bridge PR that runs all the downstream tests with this feature turned on to see if we can spot any more basic simple failures and proactively fix them.

The issue in question could have been located by #1858 but I would bet also regular integration tests would flag it somewhere.

t0yv0 · 2024-05-09T14:52:08Z

ON a positive note we landed #1927 which together with this flag is shown to close 4 resource cycling issues in AWS, and likely actually fixes a lot more.

VenelinMartinov · 2024-05-09T14:52:18Z

We could add an env var which default enables this perhaps?

Then run downstream tests with the env var enabled. Iterate until all downstream tests succeed?

EDIT: This was done in fb50112

PULUMI_ENABLE_PLAN_RESOURCE_CHANGE=1 enables PlanResourceChange for all resources in all bridged providers.

t0yv0 · 2024-05-09T14:58:56Z

Exactly. The issues from actual integration tests likely cover the most important basics.

VenelinMartinov · 2024-05-29T16:14:16Z

Should we finish #1858 and maybe #1864 before rolling this out?

t0yv0 · 2024-05-29T18:48:23Z

As discussed today:

Include PlanResourceChange panic: can't use ElementIterator on null value #1964 panic as requirement for release
Normalize block values before passing to TF provider #1971 is borderline but we wanted to include it as requirement
do not include Cross-test matching Diff behavior with property based testing #1791 as requirement for release
similarly do not include 1858 and 1864 as requirements

For failing downstream checks do not over-index on upgrade tests flagging Updates, this is not unexpected and we can weaken the tests to just weed out replacements.

part of #1785 This change adds a normalisation step for collections when recovering cty values to pass to terraform. This ensures we represent them similarly to terraform. In practice this means that all block collections need to be passed to TF providers as an empty collection instead of nil. This should get rid of quite a few subtle discrepancies in the data we pass to the TF provider code. These sometimes result in panics since we pass unexpected nils. This gets rid of all known input discrepancies discovered so far through cross-testing. The full rules for what is a block are [here](https://github.com/hashicorp/terraform-plugin-sdk/blob/1f499688ebd9420768f501d4ed622a51b2135ced/helper/schema/core_schema.go#L60). It is essentially properties with schema: typeList or typeSet with a Resource Elem. fixes #1970 fixes #1915 fixes #1964 fixes #1767 fixes #1762 TODO: [DONE] remove the MaxItemsOne default hacks introduced in #1725 (opened #2049) --------- Co-authored-by: Anton Tayanovskyy <[email protected]> Co-authored-by: Ian Wahbe <[email protected]>

VenelinMartinov · 2024-06-10T16:57:51Z

Discovered a new issue with GCP and PRC: pulumi/pulumi-gcp#2078

VenelinMartinov · 2024-06-11T14:50:27Z

#2039 reproes on non-PRC, removing it from the list.

VenelinMartinov · 2024-06-12T11:36:22Z

pulumi/pulumi-gcp#2078 is actually an upstream bug/behaviour uncovered by PRC

VenelinMartinov · 2024-06-12T11:38:44Z

1967 and 2047 are addressed by #2065 so we only need to run down all the failures in #1962

Enables PlanResourceChange by default in pulumi-azure. Part of pulumi/pulumi-terraform-bridge#1785 Also contains #2325 Also contains #2331 fixes #2322 fixes #2323 fixes #2330

VenelinMartinov · 2024-08-19T19:29:17Z

Azure is done: pulumi/pulumi-azure#2306

VenelinMartinov · 2024-08-27T11:07:31Z

Investigating PRC test failures in AWS here: pulumi/pulumi-aws#4410

VenelinMartinov · 2024-08-30T14:40:34Z

Started on the bridge cleanup #2380 so that we can base other work off of that.

VenelinMartinov · 2024-09-03T15:21:52Z

Possible issue with user-defined timeout overrides for Create under PRC: #2386

t0yv0 · 2024-09-06T13:55:15Z

Progress this week on fixing the emergent custom timeouts issue 2386; next up: roll out to AWS, cleanup and close. AWS rollout delayed as waiting for a bridge release to pass all downstream tests.

t0yv0 · 2024-09-14T00:42:21Z

pulumi/pulumi-aws#4457 is a consequence we believe, but it is quite tricky. The low-level behavior is more inline with matching TF (our goal) but at the higher level we have gaps to make the users be able to get unstuck.

t0yv0 · 2024-09-14T00:43:22Z

pulumi/pulumi-gcp#2372 is this a consequence of PRC rollout?

mjeffryes · 2024-09-14T00:56:37Z

Yes, the behavior is actually more consistent with the upstream behavior with PRC, but this results in unexpected diffs for when there are labels set as ""

VenelinMartinov · 2024-09-17T10:06:49Z

Another persistent diff fixed by PRC: pulumi/pulumi-newrelic#875

t0yv0 · 2024-09-18T20:08:39Z

One more fix possibly: pulumi/pulumi-aws#1784 per @corymhall

VenelinMartinov · 2024-09-19T11:02:28Z

New issue with detailed diff when a set contains an unknown element: #2427

pulumi-bot · 2024-09-20T09:50:28Z

This issue has been addressed in PR #2380 and shipped in release v3.91.0.

VenelinMartinov · 2024-09-23T14:41:08Z

possibly fixed with this: pulumi/pulumi-azuread#1338

This was referenced Mar 22, 2024

aws:ec2:LaunchTemplate update_default_version Fails on AWS Provider pulumi/pulumi-aws#1504

Closed

Rollout DiffStrategy=PlanState across providers #866

Closed

mjeffryes added the kind/task Work that's part of an ongoing epic label Mar 26, 2024

t0yv0 mentioned this issue Mar 26, 2024

Table-driven diff tests #1282

Closed

8 tasks

mjeffryes mentioned this issue Mar 26, 2024

Cross-testing (aka conformance testing) goals for Q4 #1796

Open

guineveresaenger mentioned this issue Apr 12, 2024

VM always changed in eptRviMode,hvMode pulumi/pulumi-vsphere#472

Open

t0yv0 mentioned this issue May 2, 2024

Fixes errors and resource cycling issues for aws.batch.JobDefinition pulumi/pulumi-aws#3888

Merged

VenelinMartinov mentioned this issue May 6, 2024

Consider rewriting makeDetailedDiff #1895

Closed

mjeffryes assigned VenelinMartinov May 7, 2024

VenelinMartinov mentioned this issue May 13, 2024

Pass PlanResourceChange through downstream tests #1962

Closed

VenelinMartinov mentioned this issue May 22, 2024

Normalize block values before passing to TF provider #1971

Merged

iwahbe added kind/epic Large new features or investments and removed kind/task Work that's part of an ongoing epic labels May 29, 2024

iwahbe changed the title ~~Rollout PlanResourceChange~~ [EPIC] Rollout PlanResourceChange May 29, 2024

VenelinMartinov mentioned this issue Jun 10, 2024

Allow disabling PRC via env var #2077

Closed

mjeffryes modified the milestones: 0.108, 0.109 Aug 19, 2024

VenelinMartinov mentioned this issue Aug 27, 2024

PRC failures in AWS pulumi/pulumi-aws#4410

Closed

6 tasks

This was referenced Aug 28, 2024

Enable PRC by default in AWS pulumi/pulumi-aws#4403

Closed

Enable PRC by default #2380

Merged

mjeffryes modified the milestones: 0.109, 0.110 Sep 12, 2024

mjeffryes mentioned this issue Sep 16, 2024

Announce pulumi-gcp v8 release pulumi/docs#12791

Merged

VenelinMartinov closed this as completed in #2380 Sep 19, 2024

VenelinMartinov closed this as completed in be3b5e6 Sep 19, 2024

pulumi-bot added the resolution/fixed This issue was fixed label Sep 19, 2024

VenelinMartinov mentioned this issue Sep 20, 2024

Set up CI for feature flags #2432

Closed

VenelinMartinov mentioned this issue Sep 30, 2024

No diff when adding packages to pypi_packages of gcp.composer.Environment pulumi/pulumi-gcp#1685

Closed

VenelinMartinov mentioned this issue Nov 6, 2024

Accurate bridge previews rollout plan #2598

Open

lukehoban mentioned this issue Nov 9, 2024

Pulumi reports vault.azure.BackendRole always has changes pulumi/pulumi-vault#231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Rollout PlanResourceChange #1785

[EPIC] Rollout PlanResourceChange #1785

t0yv0 commented Mar 22, 2024 •

edited by VenelinMartinov

Loading

VenelinMartinov commented Apr 1, 2024 •

edited

Loading

VenelinMartinov commented Apr 24, 2024

t0yv0 commented May 9, 2024

t0yv0 commented May 9, 2024

VenelinMartinov commented May 9, 2024 •

edited

Loading

t0yv0 commented May 9, 2024

VenelinMartinov commented May 29, 2024

t0yv0 commented May 29, 2024

VenelinMartinov commented Jun 10, 2024

VenelinMartinov commented Jun 11, 2024

VenelinMartinov commented Jun 12, 2024

VenelinMartinov commented Jun 12, 2024

VenelinMartinov commented Aug 19, 2024

VenelinMartinov commented Aug 27, 2024

VenelinMartinov commented Aug 30, 2024

VenelinMartinov commented Sep 3, 2024

t0yv0 commented Sep 6, 2024

t0yv0 commented Sep 14, 2024

t0yv0 commented Sep 14, 2024

mjeffryes commented Sep 14, 2024

VenelinMartinov commented Sep 17, 2024

t0yv0 commented Sep 18, 2024

VenelinMartinov commented Sep 19, 2024

pulumi-bot commented Sep 20, 2024

VenelinMartinov commented Sep 23, 2024

[EPIC] Rollout PlanResourceChange #1785

[EPIC] Rollout PlanResourceChange #1785

Comments

t0yv0 commented Mar 22, 2024 • edited by VenelinMartinov Loading

Technicalities

Rollout

Instances of TF behaviour regressing against non-PRC:

VenelinMartinov commented Apr 1, 2024 • edited Loading

VenelinMartinov commented Apr 24, 2024

t0yv0 commented May 9, 2024

t0yv0 commented May 9, 2024

VenelinMartinov commented May 9, 2024 • edited Loading

t0yv0 commented May 9, 2024

VenelinMartinov commented May 29, 2024

t0yv0 commented May 29, 2024

VenelinMartinov commented Jun 10, 2024

VenelinMartinov commented Jun 11, 2024

VenelinMartinov commented Jun 12, 2024

VenelinMartinov commented Jun 12, 2024

VenelinMartinov commented Aug 19, 2024

VenelinMartinov commented Aug 27, 2024

VenelinMartinov commented Aug 30, 2024

VenelinMartinov commented Sep 3, 2024

t0yv0 commented Sep 6, 2024

t0yv0 commented Sep 14, 2024

t0yv0 commented Sep 14, 2024

mjeffryes commented Sep 14, 2024

VenelinMartinov commented Sep 17, 2024

t0yv0 commented Sep 18, 2024

VenelinMartinov commented Sep 19, 2024

pulumi-bot commented Sep 20, 2024

VenelinMartinov commented Sep 23, 2024

t0yv0 commented Mar 22, 2024 •

edited by VenelinMartinov

Loading

VenelinMartinov commented Apr 1, 2024 •

edited

Loading

VenelinMartinov commented May 9, 2024 •

edited

Loading