New subcommand: `query` to support customized set of resources #231

magodo · 2022-09-21T12:36:28Z

Summary

This PR introduces a 3rd mode (among the rg mode and res mode): query mode. It asks users to specify an Azure Resource Graph where predicate, and terrafying those resources. Optionally, when --recursive (newly introduced for this mode) is set, all the child resources of the queried resources will be included.

Additionally, in order to improve the performance of listing resource, we also introduced a common option: --parallelism to allow parallelized resource list. This can extend for other scenarios, e.g. parallel import, in the future.

The resource dependency is now resolved by HCL.

The VM is now correctly populates the disk attachemnt resource (previously, only the managed data disk is populated). This is also the case for single resource mode, i.e. if you terrafies a single VM resource that has one data disk attached, it will results into three TF resources: vm, managed disk and the disk attachment resource, with correct dependencies added. This will be extended to support other AzureRM provider association resources, where the full list can be found at: https://github.com/magodo/aztft/blob/3dfe6a12ed5ee515b6f9281ed307c54211ee35b3/tool/aztft-import/main.go#L45.

Internal

Internally, we removed the dependency on ARM template export for the rg mode. Instead, it uses the ARG query where resourceGroup =~ "my-rg" in recursive mode, with the rg itself added on. The recursively listing child resource logic is implemented in another depending module: https://github.com/magodo/azlist. That is basically how ARM template export lists resources.

Another benefit that is brought by ARM template is the cross resource dependency. ARM discovers the dependencies by walking through the API model of each resource. aztfy now discovers the dependency at two dimensions:

By analyzing the Azure resource ids of the terrafied resources
By analyzing the generated HCL of each resource

"Discover by API model" vs "Discover by HCL model"

The API model might save us the dependency for cases like VM disk depends on VM, which will be missing if discovering by HCL model, because that dependency is introduced by the attachment resource. However, that can only save a very small amount of cases, but there are still a big part of dependencies can't be discovered by simply looking into API properties, which are heavily depending on the business (e.g. sentinel resources depends on an operation insight resource).

For catching up the gap of the previous VM depends on disk, we introduced the association resources in the azure resource set hack code. This is the great place for holding such kind of code, as we are already using it to populate the missing managed disk in this case. If we go through this direction, then discovering reference dependencies based on HCL is the right choice here, as it will then be able to add on the dependencies for the association resource on the associated resources (the dependency of vm depends on managed disk in the previous version is actually not so correct).

Closed Issues

TODO: - Cross resource dependencies of non-parent-child relationship - Resources needs to be ignored but now listed (e.g. OS disk of vm) - Resources needs to be listed but ignored (e.g. nsr, subnet, etc)

…id is used for import When providing the resource mapping file, it is matched case sensitvely before. Changing it to insensitive is to mitigate a bug that ARG will return its first resource id with RG name uppercased, which will cause the mismatch if compares case sensitively. Additionally, the resource id that is recorded in the mapping file will be used as the tf id for importing (case sensitively). This is useful when aztft has bugs that returns the incorrect casing for certain resources, users can tune the resource mapping file to get the correct casing. Whilst there might be cases that the resource id is not the same as tf id not only in casing. In that case, we will need to further modify the format of resource mapping file to accomodate it.

…he parent-child dependency late prior to HCL generation

- Introduce a hidden flag `--plain-ui`, which is used for e2e test to stream the test output - Add more stage related log during e2e test - Introduce delay in e2e test right before apply, to wait for the created resources be recorded in ARG

There is another option that discovers the dependencies on API models, just like ARM does. This might save us the dependency for cases like VM disk depends on VM, which is now missing because that dependency is introduced by the attachment resource. However, that can only save a very small amount of cases, but there are still a big part of dependencies can't be discovered by simply looking into API properties, which are heavily depending on the business (e.g. sentinel resources depends on an operation insight resource). For catching up the gap of the previous VM depends on disk, we shall consider introducing the association resources in the azure resource set hack code. This is the great place for holding such kind of code, as we are already using it to populate the missing managed disk in this case. If we go through this direction, then discovering reference dependencies based on HCL is the right choice here, as it will then be able to add on the dependencies for the association resource on the associated resources.

…ent` This commit introduce a new field `PesudoResourceInfo` in the `AzureResource` to record TF pesudo resources, which will later be used when converting into `TFResource`. In this case, rather than using aztft to look for the TF resource type based on the concrete azure resource, it directly consume the info recorded in the `PesudoResourceInfo`. With above, now resourceset hack not only populates the managed disk, but also the disk attachement resource.

…ery mode to support both scope/specific predicates

stemaMSFT

Great work Zhaoting! Can't understate how much benefit this is going to bring to our users. Customers will be very excited to try this out :) I'll leave it to @ms-henglu to check the technical coding aspects of this PR and do the final approval, but in terms of features/functionality, I have no problems with it.

ms-henglu

LGTM！

E.g. `aztfy res <vm id>` now might terrafies three resources: vm, managed disk and disk attachement.

magodo added 17 commits September 14, 2022 11:11

CLI changes to add new subcommand "query"

6c1c071

Partial finish; TODO: dependency + remove armtemplate

8062945

Remove arm template

1f114d4

Add simple dependency

76abf96

TODO: - Cross resource dependencies of non-parent-child relationship - Resources needs to be ignored but now listed (e.g. OS disk of vm) - Resources needs to be listed but ignored (e.g. nsr, subnet, etc)

revert back the armid change

197ef2c

Handle large scale of resources when listing

56f17f5

Tweak ARG result by mimicing how ARM template does

d2d1cca

Integrate with github.com/magodo/azlist & Introduce --parallelism

aa7bd75

Record azure resource id in the resourceset.TFResource so to move t…

9058d18

…he parent-child dependency late prior to HCL generation

Merge branch 'main' into query_mode

9ac5163

Improve test cases

465b83e

- Introduce a hidden flag `--plain-ui`, which is used for e2e test to stream the test output - Add more stage related log during e2e test - Introduce delay in e2e test right before apply, to wait for the created resources be recorded in ARG

ARG query changes to where predicate && Introduce --recursive in qu…

e25c50a

…ery mode to support both scope/specific predicates

Adding e2e test for query mode

3dd3dd6

README: include query mode

7693b6b

magodo added the enhancement New feature or request label Sep 21, 2022

magodo requested review from ms-henglu and stemaMSFT September 21, 2022 12:36

stemaMSFT reviewed Sep 21, 2022

View reviewed changes

magodo added 3 commits September 22, 2022 17:24

Update readme to include ARG limiation

2344c98

--parallelism: remove mentioning import

00bbc36

Add short name -r for --recursive

9cd67a8

ms-henglu approved these changes Sep 23, 2022

View reviewed changes

Single resource mode supports resource population and depedency

2f5c5cf

E.g. `aztfy res <vm id>` now might terrafies three resources: vm, managed disk and disk attachement.

magodo merged commit 027017a into Azure:main Sep 23, 2022

magodo mentioned this pull request Sep 23, 2022

Hooks for certain resources to handle cases like the "association" resources #3

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New subcommand: `query` to support customized set of resources #231

New subcommand: `query` to support customized set of resources #231

magodo commented Sep 21, 2022 •

edited

Loading

stemaMSFT left a comment

ms-henglu left a comment

New subcommand: query to support customized set of resources #231

New subcommand: query to support customized set of resources #231

Conversation

magodo commented Sep 21, 2022 • edited Loading

Summary

Internal

"Discover by API model" vs "Discover by HCL model"

Closed Issues

stemaMSFT left a comment

Choose a reason for hiding this comment

ms-henglu left a comment

Choose a reason for hiding this comment

New subcommand: `query` to support customized set of resources #231

New subcommand: `query` to support customized set of resources #231

magodo commented Sep 21, 2022 •

edited

Loading