Excessive resrc tree search can be a scalability bottleneck? #184

dongahn · 2016-08-23T20:15:13Z

Hi @lipari: I am more closely looking into the scheduler code that may be on the critical path of the job throughput as part of #183 and came across this code.

I may have read the code wrong, but It seems our backfill plugin can perform a full resource tree search many times while trying to reserve those nodes that will be de-allocated soonest. My guess is that this may not scale as much as we like, in particular if the reservation is large and we have relatively large numbers of small jobs currently running.

You and I also discussed other ways to improve tree traversal and lookup. Please regard this issue as a handle to discuss issues relevant to resrc performance and scalability.

Just to make it clear, I haven't quantified the impact of this on the overall job throughput and not asking you to take on some immediate work.

dongahn · 2016-12-28T22:21:02Z

FYI -- this is one of the issues that PR #184 will ultimately address. Leaving it open for now.

dongahn · 2017-10-23T19:39:38Z

FYI -- PR #274 landed which will be one of the core performance-guaranteeing layer for this problem. The early findings for scheduler-driven aggregate update scheme is documented in #269.

dongahn · 2018-03-31T04:17:36Z

Closr this as such an effort should go into the new resource layer.

dongahn mentioned this issue Sep 16, 2016

Revise Sched's find() and select() operations to leverage visitor/matcher design patterns #193

Closed

dongahn mentioned this issue Oct 29, 2016

planner class: progress towards scheduler-driven aggregate updates #223

Closed

dongahn mentioned this issue Oct 30, 2017

PR to integrate the initial resource-query work into flux-sched #277

Closed

dongahn closed this as completed Mar 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Excessive resrc tree search can be a scalability bottleneck? #184

Excessive resrc tree search can be a scalability bottleneck? #184

dongahn commented Aug 23, 2016

dongahn commented Dec 28, 2016

dongahn commented Oct 23, 2017

dongahn commented Mar 31, 2018

Excessive resrc tree search can be a scalability bottleneck? #184

Excessive resrc tree search can be a scalability bottleneck? #184

Comments

dongahn commented Aug 23, 2016

dongahn commented Dec 28, 2016

dongahn commented Oct 23, 2017

dongahn commented Mar 31, 2018