Add ignition support in bootstrap provider #3430

dongsupark · 2020-07-30T16:16:27Z

Detailed Description

In CABPK, KubeadmConfig.Spec already has Format field, which is used to specify the output format of the bootstrap data that the controller generates. At the moment the field is not used anywhere. So CABPK always relies on cloud-init.

We can use the field to support other types of bootstrap configs, other than the default cloud-init. It would be a good use case to add ignition, used by Flatcar Container Linux or Fedora CoreOS.

Previous attempts:

There has been a project cluster-api-bootstrap-provider-kubeadm-ignition, which is actually a fork of another repo. Though the original repo from minsheng-fintech was recently removed, I am not sure why. I tried to contact the original author, but so far could not hear anything from him.

Anyway based on the code, I have created a PoC branch on top of the current cluster-api code base. Of course it is still up for discussions.

Related issues:

#1576
#1582
#3064

/kind feature
/area bootstrap

/cc @vbatts @t-lo @ncdc @vincepri @detiber

The text was updated successfully, but these errors were encountered:

vincepri · 2020-07-30T16:20:06Z

/kind design
/milestone v0.4.0

rudoi · 2020-07-30T16:44:34Z

@dongsupark thanks for opening this!

We use quite a bit of Flatcar and were looking into adding this support a while back and we were struggling to think of the right way to handle the "immutability" of it in the AWS provider, for example.

CAPA kind of assumes the AMI has the correct version of kubeadm on it, so I was curious if you'd thought about this at all. If I understand CoreOS/Flatcar correctly, it's not best practice to publish a bunch of different machine images - you'd use the official release image and then use systemd units, etc to make sure your dependencies were installed.

Would love to hear your thoughts. I acknowledge that this isn't strictly related to this issue, though 😅.

dongsupark · 2020-07-31T15:45:38Z

CAPA kind of assumes the AMI has the correct version of kubeadm on it, so I was curious if you'd thought about this at all. If I understand CoreOS/Flatcar correctly, it's not best practice to publish a bunch of different machine images - you'd use the official release image and then use systemd units, etc to make sure your dependencies were installed.

You are right.
That is the main reason why we need to create CAPA AMI for Flatcar, making use of a pending PR. It basically downloads necessary binaries under /opt, a read-write partition inside Flatcar. Then the final AMI will include everything we need, like kubeadm, containerd, crictl, etc.
Once the PR got merged, we will publish the CAPA AMI, and support other providers as well.

vincepri · 2020-07-31T17:09:38Z

/milestone v0.3.x

Synced up with @dongsupark on slack, we're going to copy and adapt the CABPK bootstrapper and provide new types and controllers for the kubeadm-ignition based one. Everything will live under the same CABPK group as new types, but under the experimental folder for now.

vincepri · 2020-08-25T18:08:52Z

/milestone Next

vbatts · 2020-10-07T17:06:14Z

@vincepri curious, would this support need to be include in the v1alpha1 roadmap?

vbatts · 2020-10-07T17:59:06Z

(asking as I joined the call today and was looking over #3754)

vincepri · 2020-10-07T18:02:36Z

(Assuming you meant v1alpha4) We can just add it to the roadmap, we just need someone assigned to push it forward.

There is also the node agent talks that came up from #2554, which might be of interest for you all

cc @randomvariable

vbatts · 2020-10-14T15:47:54Z

@vincepri yes, sorry v1alpha4. And yes, that issue is very related. The secrets access is part of cloud-init on AWS that we can not support currently, and upstream ignition has rejected the multi-part mime support needed for handling these secrets on AWS. It's not an ideal spot.

vbatts · 2020-10-14T15:49:48Z

I was look at #3761, and it's the basis of my ^^ comment.

fejta-bot · 2021-01-12T16:38:58Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fabriziopandini · 2021-01-12T17:37:59Z

/remove-lifecycle stale

invidian · 2021-02-10T11:28:36Z

Hey all, I had a look into #3437 and comments there and I plan to open a replacement PR to solve this issue, but wanted to share my idea for solving it before completing this work.

I would like to propose the following move things forward:

Add format field to bootstrap Secret object, next to value field, so Infrastructure providers can automatically identify bootstrap data format and act accordingly if needed. Different behavior for AWS is required for example, as Ignition do not support AWS Secret Manager, meaning CAPA must upload Ignition data to S3 to make nodes be able to read it while first booting.
Such approach also keeps CABPK cloud-agnostic as it should be, so then each Infrastructure provider can use their own technology to securely deliver the bootstrap data
to the instances.
Add ignition as a valid value for KubeadmControlplane.spec.kubeadmConfigSpec.format field. If this field is set, CABPK will generate bootstrap data in Ignition format rather than in cloud-init format.

Now, cloud-init config generated by CABPK together with fields exposed by KubeadmControlPlane.spec.kubeadmConfigSpec like users or ntp cannot be mapped 1:1 and in 100% to Ignition, but same result can be achieved in a different way.

CABPK core uses 2 features of cloud-init for bootstrapping: run_cmd and write_files.

For run_cmd, CABPK will generate kubeadm.service systemd unit and kubeadm.sh script file, which will include:

preKubeadmCommands
actual kubeadm join/init command
postKubeadmCommands
Removal of kubeadm configuration file, as with Ignition files cannot be written to /tmp directory, as it is done with cloud-init.

write_files can easily be replaced with storage.files. Also code makes it easy to convert from one format which is used internally to the other.

However, cloud-init additionally offers Jinja templating for all files which will be written, which is used for example by CAPA (example). Such feature is not available with Ignition.

Fortunately CABPK does not use templating for bootstrapping files, except settings which might be controlled by user (like in CAPA example), which makes things a bit simpler, as templating can be moved to infrastructure provider (cluster configuration template).

If user needs to template some parts of the configuration, they can use preKubeadmCommands OR create their own systemd service running before mentioned kubeadm.service unit.

Also, as far as I saw, no major Infrastructure provider use templating extensively or in options other than writing files.

To break down "additional" fields in KubeadmControlPlane.spec.kubeadmConfigSpec:

diskSetup - It should be possible to map it to storage.disks.
files - Can be mapped 1:1 to storage.files.
mounts - It should be possible to map it to storage.filesystems?
ntp- Perhaps could me mapped to configure systemd-timesyncd.
users - Can be mapped to passwd.users.

Add ignition field to KubeadmControlPlane.spec.kubeadmConfigSpec with the following structure:

kind: KubeadmControlPlane
spec:
  kubeadmConfigSpec:
    ignition:
      containerLinuxConfig:
        additionalConfig: |
          ---
          systemd:
            units: ...

Right now Fedora CoreOS uses Ignition version 3.0+ and Flatcar is still using 2.3 and as suggested in #3437 (comment), only 3.0+ should be used right now, which can optionally be downgraded to 2.3-compatible format.

Having the structure above will keep enough space to extend the structure in the future to something like:

kind: KubeadmControlPlane
spec:
  kubeadmConfigSpec:
    ignition:
      fedoraCoreOSConfig:

Or:

kind: KubeadmControlPlane
spec:
  kubeadmConfigSpec:
    ignition:
      containerLinuxConfig:
        version: 3.0
        additionalConfig: |
          ---
          systemd:
            units: ...

additionalConfig field will allow users to specify their own configuration in Ignition-native (or CLC/FCCT) way and to access features not available in kubeadmConfigSpec like adding systemd units.

additionalConfig field will be of type string to avoid pulling entire Ignition config scheme into KubeadmControlPlane CRD and also to allow using different transpiler versions in the future if desired.

Please let me know what you think and if such addition would be acceptable :)

BTW I've already create a PoC for it which is available at https://github.com/kinvolk/cluster-api/commits/invidian/ignition-support. It requires more work though.

MarcelMue · 2021-02-10T17:58:02Z

I personally think that the suggestion made by @invidian is quite sound. It would make life for ignition users a lot simpler and offers a nice basis for integration.
I am not very opinionated about the proposed extensions of kubeadmConfigSpec.

invidian · 2021-02-10T18:26:37Z

Opened PR with changes proposed above: #4172.

fejta-bot · 2021-05-11T19:23:25Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

invidian · 2021-05-11T19:43:07Z

/remove-lifecycle stale

k8s-triage-robot · 2021-08-09T19:43:59Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

randomvariable · 2021-08-09T20:15:38Z

/lifecycle frozen

randomvariable · 2021-08-09T20:15:47Z

/lifecycle active

randomvariable · 2021-08-09T20:15:57Z

/assign @invidian

k8s-triage-robot · 2021-11-07T21:00:37Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

invidian · 2021-11-07T21:21:26Z

/remove-lifecycle stale

randomvariable · 2021-11-08T13:31:57Z

/lifecycle active

omniproc · 2021-11-16T20:37:19Z

I just wanted to drop that Jinja templating is not exclusive for cloud-init. You can use it to template whatever you like. All you need is a working Jinja install and a Jinja formatted template.
So if you want to unify the bootstrap process but still allow two different bootstrap techs, cloud-init and ignition, using a single templateing engine unrelated to those bootstrappers might be a way to deal with that.
You could offer two default bootstrap templates, one based on cloud-init one based on ignition. Both need about the same variables passed from CAPI. So you could just have Jinja templates for both bootstrap favours, run Jinja and pass it the common variables. The output is the ready to use bootstrap yaml / json file.
And if the user wants more power to customize the bootstrap process: just replace the default Jinja template.
Even more: all the custom var generation that's currently done in Go in CAPV, e.g. GenerateName, could be offloaded as a Jinja filter so a user may access that filter at any time or write own filters if further customization is needed.
It's envsubst on steroids really.

k8s-ci-robot added kind/feature Categorizes issue or PR as related to a new feature. area/bootstrap Issues or PRs related to bootstrap providers labels Jul 30, 2020

k8s-ci-robot added this to the v0.4.0 milestone Jul 30, 2020

k8s-ci-robot added the kind/design Categorizes issue or PR as related to design. label Jul 30, 2020

k8s-ci-robot modified the milestones: v0.4.0, v0.3.x Jul 31, 2020

dongsupark mentioned this issue Jul 31, 2020

✨ [WIP] Introduce bootstrap provider for ignition #3437

Closed

vincepri modified the milestones: v0.3.x, v0.3.9 Aug 3, 2020

k8s-ci-robot modified the milestones: v0.3.9, Next Aug 25, 2020

randomvariable mentioned this issue Oct 7, 2020

Support composition of bootstrapping of kubeadm, cloud-init/ignition/talos/etc... and secrets transport #3761

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 12, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 12, 2021

invidian mentioned this issue Feb 10, 2021

✨ Add support for generating bootstrap data in Ignition format to CABPK #4172

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 11, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 11, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 9, 2021

k8s-ci-robot added lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Aug 9, 2021

k8s-ci-robot added lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. and removed lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. labels Aug 9, 2021

k8s-ci-robot assigned invidian Aug 9, 2021

killianmuldoon mentioned this issue Oct 1, 2021

Restructure Kubeadm Bootstrap controller code #5370

Closed

AverageMarcus mentioned this issue Oct 20, 2021

CAPI - enable/configure encryption-at-rest for etcd giantswarm/roadmap#510

Closed

k8s-ci-robot added lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. and removed lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. labels Nov 7, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 7, 2021

k8s-ci-robot added the lifecycle/active Indicates that an issue or PR is actively being worked on by a contributor. label Nov 8, 2021

k8s-ci-robot closed this as completed in #4172 Dec 8, 2021

AverageMarcus mentioned this issue Feb 23, 2022

Support Flatcar linux for nodes kubernetes-sigs/cluster-api-provider-gcp#540

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ignition support in bootstrap provider #3430

Add ignition support in bootstrap provider #3430

dongsupark commented Jul 30, 2020

vincepri commented Jul 30, 2020

rudoi commented Jul 30, 2020

dongsupark commented Jul 31, 2020

vincepri commented Jul 31, 2020

vincepri commented Aug 25, 2020

vbatts commented Oct 7, 2020 •

edited

Loading

vbatts commented Oct 7, 2020

vincepri commented Oct 7, 2020

vbatts commented Oct 14, 2020

vbatts commented Oct 14, 2020

fejta-bot commented Jan 12, 2021

fabriziopandini commented Jan 12, 2021

invidian commented Feb 10, 2021

MarcelMue commented Feb 10, 2021

invidian commented Feb 10, 2021

fejta-bot commented May 11, 2021

invidian commented May 11, 2021

k8s-triage-robot commented Aug 9, 2021

randomvariable commented Aug 9, 2021

randomvariable commented Aug 9, 2021

randomvariable commented Aug 9, 2021

k8s-triage-robot commented Nov 7, 2021

invidian commented Nov 7, 2021

randomvariable commented Nov 8, 2021

omniproc commented Nov 16, 2021 •

edited

Loading

Add ignition support in bootstrap provider #3430

Add ignition support in bootstrap provider #3430

Comments

dongsupark commented Jul 30, 2020

vincepri commented Jul 30, 2020

rudoi commented Jul 30, 2020

dongsupark commented Jul 31, 2020

vincepri commented Jul 31, 2020

vincepri commented Aug 25, 2020

vbatts commented Oct 7, 2020 • edited Loading

vbatts commented Oct 7, 2020

vincepri commented Oct 7, 2020

vbatts commented Oct 14, 2020

vbatts commented Oct 14, 2020

fejta-bot commented Jan 12, 2021

fabriziopandini commented Jan 12, 2021

invidian commented Feb 10, 2021

MarcelMue commented Feb 10, 2021

invidian commented Feb 10, 2021

fejta-bot commented May 11, 2021

invidian commented May 11, 2021

k8s-triage-robot commented Aug 9, 2021

randomvariable commented Aug 9, 2021

randomvariable commented Aug 9, 2021

randomvariable commented Aug 9, 2021

k8s-triage-robot commented Nov 7, 2021

invidian commented Nov 7, 2021

randomvariable commented Nov 8, 2021

omniproc commented Nov 16, 2021 • edited Loading

vbatts commented Oct 7, 2020 •

edited

Loading

omniproc commented Nov 16, 2021 •

edited

Loading