Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MLeap for local model scoring #249

Merged
merged 6 commits into from
Mar 24, 2019
Merged

MLeap for local model scoring #249

merged 6 commits into from
Mar 24, 2019

Conversation

tovbinm
Copy link
Collaborator

@tovbinm tovbinm commented Mar 22, 2019

Related issues
Aardpfark is no longer supported.

Describe the proposed solution
Switching to MLeap runtime instead.

Describe alternatives you've considered
N/A

@tovbinm
Copy link
Collaborator Author

tovbinm commented Mar 22, 2019

@alexandrnikitin ^

@codecov
Copy link

codecov bot commented Mar 22, 2019

Codecov Report

Merging #249 into master will decrease coverage by 58.01%.
The diff coverage is 0%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #249       +/-   ##
==========================================
- Coverage   86.41%   28.4%   -58.02%     
==========================================
  Files         312     312               
  Lines       10187    8316     -1871     
  Branches      336     420       +84     
==========================================
- Hits         8803    2362     -6441     
- Misses       1384    5954     +4570
Impacted Files Coverage Δ
...com/salesforce/op/local/OpWorkflowModelLocal.scala 0% <0%> (-81.58%) ⬇️
...sforce/op/stages/base/binary/BinaryEstimator.scala 0% <0%> (-100%) ⬇️
...la/com/salesforce/op/aggregators/Geolocation.scala 0% <0%> (-100%) ⬇️
...ala/com/salesforce/op/testkit/InfiniteStream.scala 0% <0%> (-100%) ⬇️
...la/com/salesforce/op/utils/io/avro/AvroInOut.scala 0% <0%> (-100%) ⬇️
.../salesforce/op/aggregators/FeatureAggregator.scala 0% <0%> (-100%) ⬇️
...ce/op/stages/impl/feature/TextLenTransformer.scala 0% <0%> (-100%) ⬇️
...cala/com/salesforce/op/features/types/OPList.scala 0% <0%> (-100%) ⬇️
...n/scala/com/salesforce/op/readers/CSVReaders.scala 0% <0%> (-100%) ⬇️
...stages/base/sequence/BinarySequenceEstimator.scala 0% <0%> (-100%) ⬇️
... and 221 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9826b38...a9193b2. Read the comment docs.

@codecov
Copy link

codecov bot commented Mar 22, 2019

Codecov Report

Merging #249 into master will increase coverage by 0.08%.
The diff coverage is 100%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #249      +/-   ##
==========================================
+ Coverage   86.41%   86.49%   +0.08%     
==========================================
  Files         312      312              
  Lines       10187    10184       -3     
  Branches      336      336              
==========================================
+ Hits         8803     8809       +6     
+ Misses       1384     1375       -9
Impacted Files Coverage Δ
...com/salesforce/op/local/OpWorkflowModelLocal.scala 100% <100%> (+18.42%) ⬆️
...es/src/main/scala/com/salesforce/op/OpParams.scala 89.79% <0%> (+4.08%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9826b38...488f5a8. Read the comment docs.

local/README.md Outdated
@@ -44,6 +39,6 @@ val scoreFn = new OpWorkflowRunnerLocal(workflow).score(opParams)
Below is an example of measured scoring performance on 6m records with 10 fields and 12 transformations applied.
Executed on MacBook Pro i7 3.5Ghz in a single thread.
```
Scored 6,000,000 records in 239s
Average time per record: 0.0399215ms
Scored 6,000,000 records in 710s
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

interesting that there is 3x throughput difference

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I had to warm up the JVM. Updated the numbers.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's slightly faster.

/**
* Internal PFA model representation
* Internal OP model representation
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

MLeap?

@tovbinm tovbinm merged commit a5df82e into master Mar 24, 2019
@tovbinm tovbinm deleted the mt/mleap branch March 24, 2019 04:09
@tovbinm tovbinm mentioned this pull request Apr 10, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants