-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1835: fix remaining issues in RawData::getModeledComm
#1842
Conversation
Pipelines resultsPR tests (gcc-6, ubuntu, mpich) Build for 05cc300
PR tests (gcc-5, ubuntu, mpich) Build for 05cc300
PR tests (clang-3.9, ubuntu, mpich) Build for 05cc300
PR tests (gcc-9, ubuntu, mpich, zoltan) Build for 05cc300
PR tests (gcc-7, ubuntu, mpich, trace runtime, LB) Build for 05cc300
PR tests (gcc-8, ubuntu, mpich, address sanitizer) Build for 05cc300
PR tests (gcc-10, ubuntu, openmpi, no LB) Build for 05cc300
PR tests (clang-5.0, ubuntu, mpich) Build for 05cc300
PR tests (clang-9, ubuntu, mpich) Build for 05cc300
PR tests (nvidia cuda 10.1, ubuntu, mpich) Build for 05cc300
PR tests (clang-13, alpine, mpich) Build for 05cc300
PR tests (nvidia cuda 11.0, ubuntu, mpich) Build for 05cc300
PR tests (clang-12, ubuntu, mpich) Build for 05cc300
PR tests (clang-11, ubuntu, mpich) Build for 05cc300
PR tests (clang-13, ubuntu, mpich) Build for 05cc300
PR tests (clang-14, ubuntu, mpich) Build for 05cc300
PR tests (intel icpx, ubuntu, mpich) Build for 05cc300
PR tests (gcc-11, ubuntu, mpich) Build for 05cc300
PR tests (gcc-12, ubuntu, mpich) Build for 05cc300
PR tests (intel icpc, ubuntu, mpich) Build for 05cc300
PR tests (clang-10, ubuntu, mpich) Build for 05cc300
|
Codecov Report
@@ Coverage Diff @@
## develop #1842 +/- ##
===========================================
+ Coverage 84.33% 84.39% +0.05%
===========================================
Files 758 760 +2
Lines 26715 26776 +61
===========================================
+ Hits 22531 22598 +67
+ Misses 4184 4178 -6
|
// observer pointer to the underlying comm data | ||
std::unordered_map<PhaseType, CommMapType> const* proc_comm_; | ||
|
||
TimeType per_msg_weight_ = 0.001; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Up for discussion what the default values should be.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@ppebay Does LBAF use a per-message weight of zero (meaning it only considers bytes, not the number of messages)?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Going with following defaults:
per_msg_weight_ = 0.0;
per_byte_weight_ = 1.0;
to make the behavior consistent with current LBAF implementation. This means TemperedWMin
's beta
parameter is acting as per byte coefficient.
|
||
namespace vt { namespace vrt { namespace collection { namespace balance { | ||
|
||
struct CommModel : public ComposedModel { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Naming: how should this be called?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe something reflecting that the model is an affine combination of the number of messages and their size?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I went with WeightedMessages
.
9283f7b
to
b34f45e
Compare
b34f45e
to
25d4c28
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks pretty good. It could be made a little more rigorous by making the different subphases not all have the same load but I'm okay with it as is. I think there are some long lines that should be wrapped.
781c7dc
to
6d6be22
Compare
6d6be22
to
05cc300
Compare
Rebased on top of |
fixes #1835
fixes #1836
fixes #1837
Note: these were tiny issues (created per comment), so it's convenient to fix them with a single PR.
TODO:
test_model_comm_overhead.nompi.cc
)CommOverhead
? (no)