Skip to content
This repository has been archived by the owner on Aug 2, 2022. It is now read-only.

Add coordinating node logic #267

Closed

Conversation

kaituo
Copy link
Member

@kaituo kaituo commented Oct 15, 2020

Note: since there are a lot of dependencies, I only list the main class and test code to save reviewers' time. The build will fail due to missing dependencies. I will use that PR just for review. will not merge it. Will have a big one in the end and merge once after all review PRs get approved.

Issue #, if available:

Description of changes:

This PR adds coordinating node logic. To get feature data, the coordinating node aggregates log entries into multiple entity keys and their corresponding value vectors. Then, the coordinating node sends the key-value pair to the model hosting nodes using consistent hashing. Finally, it collects acknowledgements or exceptions and returns them to the job scheduler.

Testing done:

  1. Will add unit tests.
  2. Manual testing passes.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

This PR adds coordinating node logic. To get feature data, the coordinating node aggregates log entries into multiple entity keys and their corresponding value vectors.  Then, the coordinating node sends the key-value pair to the model hosting nodes using consistent hashing.  Finally, it collects acknowledgements or exceptions and returns them to the job scheduler.

Testing done:
1. Will add unit tests.
2. Manual testing passes.
@codecov
Copy link

codecov bot commented Oct 15, 2020

Codecov Report

Merging #267 into master will not change coverage.
The diff coverage is 80.95%.

Impacted file tree graph

@@            Coverage Diff            @@
##             master     #267   +/-   ##
=========================================
  Coverage     73.01%   73.01%           
  Complexity     1461     1461           
=========================================
  Files           164      164           
  Lines          6834     6834           
  Branches        527      527           
=========================================
  Hits           4990     4990           
  Misses         1594     1594           
  Partials        250      250           
Flag Coverage Δ Complexity Δ
#cli 79.27% <ø> (ø) 0.00 <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ Complexity Δ
...rch/ad/transport/AnomalyResultTransportAction.java 78.59% <80.95%> (ø) 59.00 <13.00> (ø)

});
}

listener.onResponse(new AnomalyResultResponse(0, 0, 0, new ArrayList<FeatureData>()));
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Always return empty anomaly result to job runner to persist ?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We need to return sth to job runner. Is it necessary to persist it? Currently I do as you told me to have a heartbeat.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Heartbeat can be persisted in anomaly detection state.
I'm ok with this currently if no impact to frontend.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did this for you. If you think it is not necessary, I will remove it.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, you can remove it.

Copy link
Contributor

@ylwu-amzn ylwu-amzn left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thanks for the change!

kaituo added a commit that referenced this pull request Oct 16, 2020
* Add support filtering the data by one categorical variable

This PR is a conglomerate of the following PRs.

#247
#249
#250
#252
#253
#256
#257
#258
#259
#260
#261
#262
#263
#264
#265
#266
#267
#268
#269

This spreadsheet contains the mappings from files to PR number: https://quip-amazon.com/DiHkAmz9oSLu/HC-PR

Testing done:
1. Add unit tests except four classes (excluded in build.gradle). Will add them in the later PR.
2. Manual testing passes.
@kaituo kaituo closed this Oct 16, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants