Add inter process communication API for external features #858

eero-t · 2022-08-15T13:15:51Z

What would you like to be added:

Local socket API to specify and update NFD external feature labels. This is extension of #857.

Why is this needed:

Hooks are going away for reasons explained in #855 / #856, whereas life-cycle management (#857) and updating [1] feature files is somewhat annoying.

[1] because one needs use atomic file write to avoid races between feature file write and NFD reading it:

#/bin/sh
set -e
cd /etc/kubernetes/node-feature-discovery/features.d/
mkdir -p tmp
/gen-labels > tmp/my-labels
mv tmp/my-labels .

Proposed solution:

NFD creates a node-local socket to which clients can connect at startup
- Either a unix socket in a host directory clients can access (= requires FS access handling), or
- Network socket / service limited to a node with worker pod spec.internalTrafficPolicy: Local setting
External feature clients connect to this socket on startup and communicate using few simple JSON structs
NFD startup struct contains:
- Default feature check interval (in seconds) which client should use (for wakeup alignment), unless client (CLI) options tell it to use some other interval
Client connection startup struct contains:
- Client origin ID (name), so that NFD can log it and reject duplicate clients
- A timeout value (in seconds) after which NFD should drop labels provided by the client
Whenever updates are needed, client sends new label/value list, which NFD stores under client origin ID
NFD logs an error and drops the connection, if client either:
- Gives invalid JSON content, or
- Provides a label which already exists under different origin ID
When client connections drops, NFD sets a timer for removing its labels. If client re-connects within timeout, timer is canceled, otherwise its labels are removed

Optimizations:

While simple client may send all labels at regular interval, better ones should send updates only for new labels, and new label values
NFD master needs to be informed only when there are new labels or label values, so NFD worker should check whether any were added / updates
Labels under other origin IDs need to be checked for conflicts only when client provides a new label

(Feature updates are expected to be very rare, so spending a bit more mem + CPU on client & worker side to check for updates, is worthwhile.)

Life-time management (#857) for socket-based external features is implement with:

a timeout for removing client's labels after its disconnection, and
label origin ID checking

The text was updated successfully, but these errors were encountered:

eero-t · 2022-08-15T13:18:05Z

FIFOs in non-blocking mode could also be used for this instead (or in addition to) sockets. This would allow clients to be shell scripts in addition to real programs.

eero-t · 2022-08-15T13:25:07Z

Performance impact

Compared to hooks and feature files, dummy implementation of this could increase resource usage somewhat, but with additional effort, it will be lower.

Wakeups / latency

One difference of socket based API compared to hooks is that client decides how often it needs to check system for updates. My recommendation would be for NFD to send clients a default update interval (given on NFD command line) when it connects them.

Clients can choose to ignore it, if their command line specifies an override, but by default they should heed it. This way it's easier for k8s admins to centrally lower idle node wakeup frequency, and to do per-client overrides when lower latency is needed.

Memory and CPU usage

If label value update functionality is in a separate/dedicated NFD client process, that will use more memory than hook programs that are invoked just at some interval by NFD, and not all at the same time / in parallel. Hook invocations can use more CPU, than idling NFD clients though.

E.g. in device plugins case, I would recommend folding the NFD hook functionality to the plugin itself, as device plugins are always running, and need to occasionally scan sysfs by anyway. This way memory usage would be lower than with separate hook programs.

@mythi Any comments from the device plugins side on this proposal?

mythi · 2022-08-17T07:50:15Z

@mythi Any comments from the device plugins side on this proposal?

not from device plugins perspective but a generic comment: worker API for this sounds unnecessary since the external features could also be pushed to master directly, right?

eero-t · 2022-08-17T12:58:29Z

This ticket could be pivoted from local NFD worker API to master remote service API instead, assuming one is suitable for updating features directly.

Looking at NFD master code:

The impact of using current master remote API directly, seems to be following...

Clients:

Could provide incorrect node name
Would need to authenticate to master to access the remote labeling API
Having authorization for that, they could probably access also other master APIs

Beside that security impact, this would also complicate duplicate detection for life-cycle management proposed in #857 (duplicate feature detection is intended to catch out obsolete clients, and their feature files).

If some labels would go directly to master, that would need to add origin ID to all labels (= remote API change), and do per-node label origin ID checks to detect conflicts, instead of worker. IMHO doing that in NFD worker would be easier and more robust (when worker creates the origin ID, faulty client can't fake it).

Moving work from workers to master, would also load that more, which might be a problem in larger clouds, if NFD does not support master replication yet.

marquiz · 2022-08-18T04:48:56Z

not from device plugins perspective but a generic comment: worker API for this sounds unnecessary since the external features could also be pushed to master directly, right?

Nope, it can't atm. Sending a labeling request removes all other labels from the node, i.e. in practice each node can only have one client. One inconvenience from the deployment pov would also be setting up TLS authentication.

Moreover, there is a plan to ditch the gRPC communication completely and replace it with a CRD API (#828). Maybe that could provide a meaningful way for extensions to create features/labels. Finalizers could be used to delete features when an extension is undeployed.

eero-t · 2022-09-05T10:15:04Z

If one wants to get rid of a host volume too, k8s has a (v1.23) Beta feature (enabled by default) which can help...

When Pod spec.internalTrafficPolicy is set to Local, only node local endpoints are considered for a service: https://github.com/kubernetes/enhancements/blob/master/keps/sig-network/2086-service-internal-traffic-policy/README.md

I.e. one would be using network socket instead of unix one, but tell k8s to limit it within node.

This would imply following changes to life-cycle management I originally suggested:

No need for socket file monitoring / removal, NFD listens on the socket and clients connect to, not the other way round
Client needs to provide feature origin-ID, instead of NFD using the unix socket file name for that

=> I'll change the original proposal so that it works both for Unix and network sockets.

k8s-triage-robot · 2022-12-21T10:14:15Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

eero-t · 2022-12-21T10:18:21Z

/remove-lifecycle stale

marquiz · 2022-12-23T10:01:23Z

TGH, this sounds overly complicated and non-cloud-native-style to me

I suggest to try to use the new NodeFeature CRD API
https://kubernetes-sigs.github.io/node-feature-discovery/v0.12/usage/customization-guide#nodefeature-custom-resource

eero-t · 2022-12-23T11:26:43Z

TGH, this sounds overly complicated and non-cloud-native-style to me

That kind of communication is only few lines of code, but I don't really care how it's implemented.

The main point is features life-cycle management (see #857).

I suggest to try to use the new NodeFeature CRD API https://kubernetes-sigs.github.io/node-feature-discovery/v0.12/usage/customization-guide#nodefeature-custom-resource

This doesn't even mention feature life-cycle.

k8s-triage-robot · 2023-03-23T11:40:23Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle stale
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2023-04-22T11:53:42Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues.

This bot triages un-triaged issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue as fresh with /remove-lifecycle rotten
Close this issue with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

k8s-triage-robot · 2023-05-22T12:24:18Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen
Mark this issue as fresh with /remove-lifecycle rotten
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

k8s-ci-robot · 2023-05-22T12:24:22Z

@k8s-triage-robot: Closing this issue, marking it as "Not Planned".

In response to this:

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues according to the following rules:

After 90d of inactivity, lifecycle/stale is applied

After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied

After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Reopen this issue with /reopen

Mark this issue as fresh with /remove-lifecycle rotten

Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/close not-planned

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

eero-t added the kind/feature Categorizes issue or PR as related to a new feature. label Aug 15, 2022

eero-t mentioned this issue Aug 15, 2022

Drop support for hooks #856

Closed

eero-t changed the title ~~Add local socket API for external features~~ Add inter process communication API for external features Sep 22, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 21, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Dec 21, 2022

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Mar 23, 2023

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Apr 22, 2023

k8s-ci-robot closed this as not planned Won't fix, can't repro, duplicate, stale May 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inter process communication API for external features #858

Add inter process communication API for external features #858

eero-t commented Aug 15, 2022 •

edited

Loading

eero-t commented Aug 15, 2022

eero-t commented Aug 15, 2022 •

edited

Loading

mythi commented Aug 17, 2022

eero-t commented Aug 17, 2022 •

edited

Loading

marquiz commented Aug 18, 2022

eero-t commented Sep 5, 2022

k8s-triage-robot commented Dec 21, 2022

eero-t commented Dec 21, 2022

marquiz commented Dec 23, 2022

eero-t commented Dec 23, 2022

k8s-triage-robot commented Mar 23, 2023

k8s-triage-robot commented Apr 22, 2023

k8s-triage-robot commented May 22, 2023

k8s-ci-robot commented May 22, 2023

Add inter process communication API for external features #858

Add inter process communication API for external features #858

Comments

eero-t commented Aug 15, 2022 • edited Loading

eero-t commented Aug 15, 2022

eero-t commented Aug 15, 2022 • edited Loading

Performance impact

mythi commented Aug 17, 2022

eero-t commented Aug 17, 2022 • edited Loading

marquiz commented Aug 18, 2022

eero-t commented Sep 5, 2022

k8s-triage-robot commented Dec 21, 2022

eero-t commented Dec 21, 2022

marquiz commented Dec 23, 2022

eero-t commented Dec 23, 2022

k8s-triage-robot commented Mar 23, 2023

k8s-triage-robot commented Apr 22, 2023

k8s-triage-robot commented May 22, 2023

k8s-ci-robot commented May 22, 2023

eero-t commented Aug 15, 2022 •

edited

Loading

eero-t commented Aug 15, 2022 •

edited

Loading

eero-t commented Aug 17, 2022 •

edited

Loading