helm: better support parallel deployments #650

marquiz · 2021-11-16T11:41:26Z

What would you like to be added:

Other Helm apps may also want to add NFD as a dependency. Actually, custom/local NFD sources look like they were designed for that reason. What needs to be done here is to make sure that there is a configuration for adding NFD as a Helm chart dependency in a way that it won't overlap with other NFD Helm chart dependencies and/or other NFD instances that may be present in the cluster (e.g. the volume mounts that configure the local source is an example of such an overlap, and this PR is another).

We should properly support multiple parallel Helm deployments. That is, sufficiently isolate them to avoid races/clashes

Why is this needed:

Helm makes it possible to have multiple parallel deployments and we should try our best to support this

k8s-triage-robot · 2022-02-14T12:34:24Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2022-03-16T13:27:38Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

marquiz · 2022-03-16T16:29:08Z

Still a valid issue, helping hands would be welcome
/remove-lifecycle rotten

k8s-triage-robot · 2022-06-14T17:14:48Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

marquiz · 2022-07-08T12:29:20Z

#831 certainly is one step in solving this issue.

@jasine do you have any comments on this issue? Have you tried multiple parallel Helm-based NFD deployments?

/remove-lifecycle stale

jasine · 2022-07-09T05:55:09Z

#831 certainly is one step in solving this issue.

@jasine do you have any comments on this issue? Have you tried multiple parallel Helm-based NFD deployments?

/remove-lifecycle stale

@marquiz I just tried to make a second deployment on cluster and failed with blowing error

Error: INSTALLATION FAILED: rendered manifests contain a resource that already exists. Unable to continue with install: CustomResourceDefinition "nodefeaturerules.nfd.k8s-sigs.io" in namespace "" exists and cannot be imported into the current release: invalid ownership metadata; annotation validation error: key "meta.helm.sh/release-name" must equal "nfd-2": current value is "nfd-1"

and the reason is crd nodefeaturerules.nfd.k8s-sigs.io placed under manifests folder, while helm encourage crds placed at crds folder

when I renamed manifests to crds, multiple parallel Helm-based NFD deployed succeed.

eliaskoromilas · 2022-07-11T09:05:35Z

I think that, as of NFD v0.11.1, there is no "real" need for parallel deployments. Even custom feature sources can be applied with a simple NodeFeatureRule. A cluster-scoped NFD deployment is just enough to address the needs of every app that either uses the out-of-the-box feature labels, dynamically specifies new features using the local filesystem, or registers custom rules through the operator.

Having said that, I wouldn't suggest using NFD as a direct Helm dependency, but instead as a requirement in a higher layer (e.g. Helmfile).

marquiz · 2022-08-09T06:48:58Z

I think that, as of NFD v0.11.1, there is no "real" need for parallel deployments. Even custom feature sources can be applied with a simple NodeFeatureRule. A cluster-scoped NFD deployment is just enough to address the needs of every app that either uses the out-of-the-box feature labels, dynamically specifies new features using the local filesystem, or registers custom rules through the operator.

Yeah, I fully agree on this 👍 We're really trying to make it unnecessary to have multiple parallel NFD deployments. The ´NodeFeatureRule` already causes some fuss with parallel deployments as it's a cluster-scoped (non-namespaced) resource and only the default instance (by default) is processing those resources. #828 will complicate matters further, probably making parallel NFD installations unsupported when/if gRPC communication is dropped from NFD. So, I think I'm not going to invest my time on this issue anymore.

Having said that, I'm still open to contributions if somebody wants to work on this. Currently I see two shortcomings in parallel installs: CRDs (thanks @jasine) and the hostPath mounts for hooks and feature files (/etc/kubernetes/node-feature-discovery/{source.d/ | features.d/}). CRD would be an easy fix, just rename. For mounts we could think about "namespacing" the host dirs e.g. /etc/kubernetes/node-feature-discovery/source.d.{INSTANCE}/

k8s-triage-robot · 2022-11-07T07:07:29Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

vaibhav2107 · 2022-11-14T07:40:21Z

/remove-lifecycle stale

k8s-triage-robot · 2023-02-12T08:26:45Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2023-03-14T08:59:28Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2023-04-13T09:29:22Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2023-04-13T09:29:26Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

marquiz added the kind/feature Categorizes issue or PR as related to a new feature. label Nov 16, 2021

marquiz mentioned this issue Nov 16, 2021

deployment: Implicitly generate the worker ConfigMap name #640

Merged

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 14, 2022

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 16, 2022

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Mar 16, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 14, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jul 8, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 7, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Nov 14, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Feb 12, 2023

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Mar 14, 2023

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale Apr 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

helm: better support parallel deployments #650

helm: better support parallel deployments #650

marquiz commented Nov 16, 2021

k8s-triage-robot commented Feb 14, 2022

k8s-triage-robot commented Mar 16, 2022

marquiz commented Mar 16, 2022

k8s-triage-robot commented Jun 14, 2022

marquiz commented Jul 8, 2022

jasine commented Jul 9, 2022

eliaskoromilas commented Jul 11, 2022

marquiz commented Aug 9, 2022

k8s-triage-robot commented Nov 7, 2022

vaibhav2107 commented Nov 14, 2022

k8s-triage-robot commented Feb 12, 2023

k8s-triage-robot commented Mar 14, 2023

k8s-triage-robot commented Apr 13, 2023

k8s-ci-robot commented Apr 13, 2023

helm: better support parallel deployments #650

helm: better support parallel deployments #650

Comments

marquiz commented Nov 16, 2021

k8s-triage-robot commented Feb 14, 2022

k8s-triage-robot commented Mar 16, 2022

marquiz commented Mar 16, 2022

k8s-triage-robot commented Jun 14, 2022

marquiz commented Jul 8, 2022

jasine commented Jul 9, 2022

eliaskoromilas commented Jul 11, 2022

marquiz commented Aug 9, 2022

k8s-triage-robot commented Nov 7, 2022

vaibhav2107 commented Nov 14, 2022

k8s-triage-robot commented Feb 12, 2023

k8s-triage-robot commented Mar 14, 2023

k8s-triage-robot commented Apr 13, 2023

k8s-ci-robot commented Apr 13, 2023