Internal feature: Stateless scheduler #995

Omrigan · 2024-06-26T10:14:26Z

Problem description / Motivation

Currently, the scheduler acts as the source of truth for the cluster state. We'd like to change that because of the following reasons:

Makes scheduler replication hard, as it is unclear how to reliably replicate that state.
Every scheduler restart results in a state reset, thus a period of warm-up is needed to repopulate the state by the autoscaler-agent. This mechanism is responsible for additional complication of the components logic and protocol, namely:
a. The existence of the Buffer values.
b. The notion of lastPermit.
Potentially, the upscaling communication diagram can be simplified to bypass the autoscaler-agent after the upscaling was approved by scheduler.

Feature idea(s) / DoD

A scheduler can be replicated.
A scheduler restart doesn't cause the sub-optimal scheduling decisions.
We no longer have the concepts of Buffered resources and lastPermit.

Implementation ideas

The scheduler's state and autoscaler-agent ↔ scheduler protocol can be moved entirely to annotations on the associated VirtualMachine object.

Related work

The implementation of neondatabase/neon#8111 might allow us to significantly simplify autoscaler-agent (or even merge it's remaining functionality into other components).

Further information

There are more details in the internal RFC.

The text was updated successfully, but these errors were encountered:

I think the comments were just never updated when the autoscaler-agent was moved from a per-pod sidecar to a per-node daemonset. Found this while working on some background work for #995.

Similar to what was done in #1055, we need to explicitly add tolerations to the scheduler to get it to be recreated more quickly on node failure. This is particularly necessary because we don't have #995. We could wait for that, but it's a lot of work, and this is a small thing we can do in the meantime. Fixes neondatabase/cloud#17298, part of neondatabase/cloud#14114.

This was referenced Nov 18, 2024

api: Touch-up comments on AgentRequest #1143

Merged

agent,plugin: Remove metrics from AgentRequests #1145

Draft

scheduler: Shorten tolerations for node failure #1146

Merged

sharnoff self-assigned this Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Internal feature: Stateless scheduler #995

Internal feature: Stateless scheduler #995

Omrigan commented Jun 26, 2024

Internal feature: Stateless scheduler #995

Internal feature: Stateless scheduler #995

Comments

Omrigan commented Jun 26, 2024

Problem description / Motivation

Feature idea(s) / DoD

Implementation ideas

Related work

Further information