Memory based loadbalancing #3747

cbickel · 2018-06-11T08:34:59Z

This PR implements the idea, that has been discussed in the following mail thread:
https://lists.apache.org/thread.html/dfccf972bc1419fe48dbc23119441108c45f85d53625fd6f8fc04fcb@%3Cdev.openwhisk.apache.org%3E

It changes the amount of containers an invoker can spawn based on available memory and not on the CPU.
In addition it makes the loadbalancer aware about the amount of available memory on each invoker and it limits the invoker to create only user containers, if there is enough free memory.

Related issue and scope

I opened an issue to propose and discuss this change (#????)

My changes affect the following components

Types of changes

Bug fix (generally a non-breaking change which closes an issue).
Enhancement or new feature (adds new functionality).
Breaking change (a bug fix or enhancement which changes existing behavior).

Checklist:

I signed an Apache CLA.
I reviewed the style guides and followed the recommendations (Travis CI will check :).
I added tests to cover my changes.
My changes require further changes to the documentation.
I updated the documentation where necessary.

dgrove-oss · 2018-06-12T19:54:24Z

Would love to see this get pushed through. There are some compensating changes we will need in kube-deploy; I will be happy to take care of them when it is time.

codecov-io · 2018-06-14T12:10:50Z

Codecov Report

Merging #3747 into master will decrease coverage by 4.49%.
The diff coverage is 100%.

@@            Coverage Diff            @@
##           master    #3747     +/-   ##
=========================================
- Coverage   85.41%   80.92%   -4.5%     
=========================================
  Files         147      147             
  Lines        7070     7093     +23     
  Branches      423      408     -15     
=========================================
- Hits         6039     5740    -299     
- Misses       1031     1353    +322

Impacted Files	Coverage Δ
...ain/scala/whisk/core/containerpool/Container.scala	`80.3% <ø> (ø)`	⬆️
.../scala/src/main/scala/whisk/core/WhiskConfig.scala	`94.16% <ø> (-0.1%)`	⬇️
...la/whisk/core/containerpool/ContainerFactory.scala	`100% <100%> (ø)`	⬆️
...scala/whisk/core/containerpool/ContainerPool.scala	`100% <100%> (+10.58%)`	⬆️
...cala/whisk/core/containerpool/ContainerProxy.scala	`93.82% <100%> (ø)`	⬆️
...ain/scala/whisk/core/invoker/InvokerReactive.scala	`74.16% <100%> (ø)`	⬆️
...e/loadBalancer/ShardingContainerPoolBalancer.scala	`86.01% <100%> (+0.44%)`	⬆️
.../scala/src/main/scala/whisk/core/entity/Size.scala	`96.49% <100%> (+0.33%)`	⬆️
...core/database/cosmosdb/RxObservableImplicits.scala	`0% <0%> (-100%)`	⬇️
...core/database/cosmosdb/CosmosDBArtifactStore.scala	`0% <0%> (-95.1%)`	⬇️
... and 6 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 74ffb4d...72ae32f. Read the comment docs.

markusthoemmes

Will need a pass over the tests, great stuff 🎉

markusthoemmes · 2018-06-11T08:51:09Z

core/controller/src/main/scala/whisk/core/loadBalancer/ShardingContainerPoolBalancer.scala

@@ -339,6 +345,7 @@ object ShardingContainerPoolBalancer extends LoadBalancerProvider {
  @tailrec
  def schedule(invokers: IndexedSeq[InvokerHealth],
               dispatched: IndexedSeq[ForcableSemaphore],
+               memory: ByteSize,


Shall we make this slotsNeeded and keep that as an integer?

markusthoemmes · 2018-06-11T08:52:16Z

core/controller/src/main/scala/whisk/core/loadBalancer/ShardingContainerPoolBalancer.scala

 */
-case class ShardingContainerPoolBalancerConfig(blackboxFraction: Double, invokerBusyThreshold: Int)
+case class ShardingContainerPoolBalancerConfig(blackboxFraction: Double, invokerBusyThreshold: ByteSize)


Rename invokerBusyThreshold to something more meaningful?

markusthoemmes · 2018-06-11T08:54:52Z