Add Eager mode for ScaledJobs #5114

junekhan · 2023-10-23T04:23:41Z

Proposal

KEDA doesn't scale up job number to the max according to this piece of description and information around. But in our case, we want to launch as many jobs as possible and keep the queue as empty. So, can we have an Eager mode in addition so users like me can shorten the waiting time in general?

Use-Case

We want to launch as many jobs as possible and keep the queue as empty.

Is this a feature you are interested in implementing yourself?

Maybe

Anything else?

No response

SpiritZhou · 2023-10-23T08:48:59Z

Which scaler do you use? I think set the para queueLength on queue scaler can shorten the time.

junekhan · 2023-10-25T01:10:42Z

  triggers:
    - type: rabbitmq
      metadata:
        queueName: queue
        hostFromEnv: RABBITMQ_URL
        mode: QueueLength
        value: "1"
---

@SpiritZhou Thanks for your reply. We configured the scaler as above.
Increasing the queueLength seems unacceptable to us since it keeps pods running as long as these pods haven't consumed as many messages as the number assigned to queueLength. It means that allocated resources for these pods won't be recycled, although we hope the cluster can scale down once there's no incoming message.

zroubalik · 2023-10-26T18:26:10Z

What is your ScaledJob config? What do you set as a maxReplicaCount?

junekhan · 2023-10-27T00:23:35Z

---
apiVersion: keda.sh/v1alpha1
kind: ScaledJob
metadata:
  name: worker
spec:
  jobTargetRef:
    template:
      metadata:
        labels:
          app: worker
      spec:
        containers:
          - name: worker
            image: worker-image
            imagePullPolicy: Always
            resources:
              requests:
                memory: "4Gi"
                cpu: "1"
            env:
              - name: WORKER_TYPE
                value: pilot
              - name: RB_USERNAME
                valueFrom:
                  secretKeyRef:
                    name: rabbitmq-instance-default-user
                    key: username
              - name: RB_PASSWORD
                valueFrom:
                  secretKeyRef:
                    name: rabbitmq-instance-default-user
                    key: password
              - name: RABBITMQ_URL
                value: amqp://$(RB_USERNAME):$(RB_PASSWORD)@rabbitmq-instance.arc:5672
        restartPolicy: Never
    backoffLimit: 1
  pollingInterval: 10 # Optional. Default: 30 seconds
  maxReplicaCount: 10 # Optional. Default: 100
  successfulJobsHistoryLimit: 0 # Optional. Default: 100. How many completed jobs should be kept.
  failedJobsHistoryLimit: 5 # Optional. Default: 100. How many failed jobs should be kept.
  triggers:
    - type: rabbitmq
      metadata:
        queueName: woker_queue
        hostFromEnv: RABBITMQ_URL
        mode: QueueLength
        value: "1"

@zroubalik Basically like this.

JorTurFer · 2023-10-27T06:14:57Z

I have a question, if you want to process as much as possible, and then finish, doesn't ScaledObject fit better with your use case? You can set queueLength: 1 and all the instances will be removed at the end if you set minReplicaCount:0

junekhan · 2023-10-27T07:26:36Z

@JorTurFer Because these workers are long-running inherently, I preferred ScaledJob to ScaledObject naturally. This section concerned me as pods can be terminated unexpectedly in terms of ScaledObject.

Getting back to ScaledJob, let's imagine a case with 3 running pods and another 3 messages standing in line, and each of them takes 3 hours or even longer to run. Does it sound better if we empty the queue and run 6 pods in parallel within our affordable limit which is 10 replicas?

JorTurFer · 2023-10-29T16:26:59Z

You are right, ScaledJob is the best fit for your use case.
In any case, if you set a message per pod, you will schedule a job per message, until the maxReplicaCount, so it's the fasters as possible, isn't it enough?
Based on your previous comments, you want to scale to max when there is any message, and scale to 0 when there isn't any message, am I right?
If yes, you could teak this behaviour using v2.12 and the experimental feature for scaling modifiers (based on formulas). The best part of this, is that you can use ScaledObject, allowing the scaling to 0 when there isn't any message in the queue

junekhan · 2023-10-30T13:59:33Z

@JorTurFer You are correct. Thanks for the solution. I will try it out.

zroubalik · 2023-11-01T14:21:11Z

Getting back to ScaledJob, let's imagine a case with 3 running pods and another 3 messages standing in line, and each of them takes 3 hours or even longer to run. Does it sound better if we empty the queue and run 6 pods in parallel within our affordable limit which is 10 replicas?

We should revisit the scaling behavior for ScaledJob as that ^ is something that should be doable.

stale · 2024-01-01T21:56:03Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2024-01-09T04:40:51Z

This issue has been automatically closed due to inactivity.

stale · 2024-03-10T20:48:02Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2024-05-10T12:25:03Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2024-05-18T12:24:51Z

This issue has been automatically closed due to inactivity.

stale · 2024-07-20T00:17:28Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

junekhan added feature-request All issues for new features that have not been committed to needs-discussion labels Oct 23, 2023

keda-automation added this to Roadmap - KEDA Core Oct 23, 2023

github-project-automation bot moved this to To Triage in Roadmap - KEDA Core Oct 23, 2023

tomkerkhove moved this from To Triage to Proposed in Roadmap - KEDA Core Oct 26, 2023

junekhan changed the title ~~Add Eager model for ScaledJobs~~ Add Eager mode for ScaledJobs Oct 26, 2023

stale bot added the stale All issues that are marked as stale due to inactivity label Jan 1, 2024

stale bot closed this as completed Jan 9, 2024

github-project-automation bot moved this from Proposed to Ready To Ship in Roadmap - KEDA Core Jan 9, 2024

zroubalik reopened this Jan 10, 2024

github-project-automation bot moved this from Ready To Ship to Proposed in Roadmap - KEDA Core Jan 10, 2024

stale bot removed the stale All issues that are marked as stale due to inactivity label Jan 10, 2024

stale bot added the stale All issues that are marked as stale due to inactivity label Mar 10, 2024

zroubalik removed the stale All issues that are marked as stale due to inactivity label Mar 11, 2024

stale bot added the stale All issues that are marked as stale due to inactivity label May 10, 2024

stale bot closed this as completed May 18, 2024

github-project-automation bot moved this from Proposed to Ready To Ship in Roadmap - KEDA Core May 18, 2024

zroubalik reopened this May 20, 2024

github-project-automation bot moved this from Ready To Ship to Proposed in Roadmap - KEDA Core May 20, 2024

zroubalik removed the stale All issues that are marked as stale due to inactivity label May 20, 2024

junekhan mentioned this issue Jun 8, 2024

Add eagerScalingStrategy for ScaledJob #5872

Merged

7 tasks

zroubalik mentioned this issue Jun 25, 2024

Add explanation for eagerScalingStrategy kedacore/keda-docs#1409

Closed

1 task

stale bot added the stale All issues that are marked as stale due to inactivity label Jul 20, 2024

zroubalik closed this as completed in #5872 Jul 30, 2024

github-project-automation bot moved this from Proposed to Ready To Ship in Roadmap - KEDA Core Jul 30, 2024

zroubalik mentioned this issue Jul 30, 2024

add explanation fore eager scaling strategy kedacore/keda-docs#1438

Merged

1 task

chinery mentioned this issue Dec 11, 2024

eager scaling strategy for ScaledJob does not work as documented (or intended?) #6416

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Eager mode for ScaledJobs #5114

Add Eager mode for ScaledJobs #5114

junekhan commented Oct 23, 2023

SpiritZhou commented Oct 23, 2023

junekhan commented Oct 25, 2023

zroubalik commented Oct 26, 2023

junekhan commented Oct 27, 2023

JorTurFer commented Oct 27, 2023

junekhan commented Oct 27, 2023 •

edited

Loading

JorTurFer commented Oct 29, 2023

junekhan commented Oct 30, 2023 •

edited

Loading

zroubalik commented Nov 1, 2023

stale bot commented Jan 1, 2024

stale bot commented Jan 9, 2024

stale bot commented Mar 10, 2024

stale bot commented May 10, 2024

stale bot commented May 18, 2024

stale bot commented Jul 20, 2024

Add Eager mode for ScaledJobs #5114

Add Eager mode for ScaledJobs #5114

Comments

junekhan commented Oct 23, 2023

Proposal

Use-Case

Is this a feature you are interested in implementing yourself?

Anything else?

SpiritZhou commented Oct 23, 2023

junekhan commented Oct 25, 2023

zroubalik commented Oct 26, 2023

junekhan commented Oct 27, 2023

JorTurFer commented Oct 27, 2023

junekhan commented Oct 27, 2023 • edited Loading

JorTurFer commented Oct 29, 2023

junekhan commented Oct 30, 2023 • edited Loading

zroubalik commented Nov 1, 2023

stale bot commented Jan 1, 2024

stale bot commented Jan 9, 2024

stale bot commented Mar 10, 2024

stale bot commented May 10, 2024

stale bot commented May 18, 2024

stale bot commented Jul 20, 2024

junekhan commented Oct 27, 2023 •

edited

Loading

junekhan commented Oct 30, 2023 •

edited

Loading