scheduler: Fix accounting when task ends up in multiple groups #2031

aaronlehmann · 2017-03-13T20:58:59Z

Tasks are split between task groups based on common specs. This allows
nodes to only be ranked once per group, not once per task.

This logic doesn't work correctly because maps are marshalled in a
random order. Currently, the same task can end up in a multiple groups
(say, if it's updated multiple times, and the marshalling ends up being
different). To make sure we don't try to schedule the same task twice
within the same batch, use a map for unassignedTasks instead of a linked
list.

Note this doesn't fix the brokenness of task spec deduplication based on
marshalling the protobuf. This is a fix for the symptom that can be
backported, and I'm going to replace the marshalling stuff in a
different PR.

cc @aluzzardi @dongluochen

dongluochen · 2017-03-13T21:01:50Z

LGTM

codecov · 2017-03-13T21:09:27Z

Codecov Report

Merging #2031 into master will decrease coverage by 0.08%.
The diff coverage is 66.66%.

@@            Coverage Diff             @@
##           master    #2031      +/-   ##
==========================================
- Coverage   53.74%   53.66%   -0.09%     
==========================================
  Files         109      109              
  Lines       19194    19193       -1     
==========================================
- Hits        10316    10300      -16     
- Misses       7631     7640       +9     
- Partials     1247     1253       +6

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 2b1b24b...83d3652. Read the comment docs.

Tasks are split between task groups based on common specs. This allows nodes to only be ranked once per group, not once per task. This logic doesn't work correctly because maps are marshalled in a random order. Currently, the same task can end up in a multiple groups (say, if it's updated multiple times, and the marshalling ends up being different). To make sure we don't try to schedule the same task twice within the same batch, use a map for unassignedTasks instead of a linked list. Note this doesn't fix the brokenness of task spec deduplication based on marshalling the protobuf. This is a fix for the symptom that can be backported, and I'm going to replace the marshalling stuff in a different PR. Signed-off-by: Aaron Lehmann <[email protected]>

aaronlehmann · 2017-03-13T22:43:40Z

I've changed this to make unassignedTasks a map instead. I think this is a better fix. It covers a case that wasn't being handled previously (noSuitableNode could overwrite an entry in schedulingDecisions).

dongluochen · 2017-03-13T23:23:50Z

LGTM

nishanttotla

LGTM

aaronlehmann added this to the 17.03.1 milestone Mar 13, 2017

aaronlehmann added the process/cherry-pick label Mar 13, 2017

aaronlehmann force-pushed the scheduler-resource-accounting branch from 281c31e to 83d3652 Compare March 13, 2017 22:42

aaronlehmann mentioned this pull request Mar 13, 2017

Version service specs #2033

Merged

nishanttotla approved these changes Mar 13, 2017

View reviewed changes

aaronlehmann merged commit deb25d3 into moby:master Mar 14, 2017

aaronlehmann deleted the scheduler-resource-accounting branch March 14, 2017 00:26

aaronlehmann added process/cherry-picked and removed process/cherry-pick labels Mar 14, 2017

This was referenced Mar 14, 2017

[17.03.x] Vendor swarmkit 1775645 moby/moby#31807

Merged

Vendor swarmkit 9fdea50 moby/moby#31808

Closed

aboch mentioned this pull request Mar 14, 2017

Allow user to replace ingress network moby/moby#31714

Merged

aaronlehmann mentioned this pull request Mar 16, 2017

[17.04] Vendor swarmkit d316a73 moby/moby#31870

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scheduler: Fix accounting when task ends up in multiple groups #2031

scheduler: Fix accounting when task ends up in multiple groups #2031

aaronlehmann commented Mar 13, 2017 •

edited

Loading

dongluochen commented Mar 13, 2017

codecov bot commented Mar 13, 2017 •

edited

Loading

aaronlehmann commented Mar 13, 2017

dongluochen commented Mar 13, 2017

nishanttotla left a comment

scheduler: Fix accounting when task ends up in multiple groups #2031

scheduler: Fix accounting when task ends up in multiple groups #2031

Conversation

aaronlehmann commented Mar 13, 2017 • edited Loading

dongluochen commented Mar 13, 2017

codecov bot commented Mar 13, 2017 • edited Loading

Codecov Report

aaronlehmann commented Mar 13, 2017

dongluochen commented Mar 13, 2017

nishanttotla left a comment

Choose a reason for hiding this comment

aaronlehmann commented Mar 13, 2017 •

edited

Loading

codecov bot commented Mar 13, 2017 •

edited

Loading