-
Notifications
You must be signed in to change notification settings - Fork 225
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Return the OOF instances for classifier-based drift detectors (ClassifierDrift and SpotTheDiffDrift) #665
Conversation
Check out this pull request on See visual diffs & provide feedback on Jupyter Notebooks. Powered by ReviewNB |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM pending CI passing.
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## master #665 +/- ##
==========================================
+ Coverage 79.09% 79.12% +0.02%
==========================================
Files 126 126
Lines 8870 8895 +25
==========================================
+ Hits 7016 7038 +22
- Misses 1854 1857 +3
Flags with carried forward coverage won't be shown. Click here to find out more.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, would suggest updating the changelog as well to reduce burden come release time.
…fierDrift and SpotTheDiffDrift) (#665) * add out-of-fold instances to the return dict for classifier detectors * update docs * update score return type * fix typo and mypy error * extend to list inputs and update score return types * add Union import * fix type error * add changelog
When using the ClassifierDrift or SpotTheDiffDrift detectors, you can return the prediction probabilities on the out-of-fold instances. However, when you use
train_size
instead of cross-validation, you don't know which instances these probabilities correspond to, since you don't know which reference and test instances are used for training or prediction. This PR adds the associated reference and test instances to the detector's return dictionary.