-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Semi-automatic summary statistics and sample weighting #429
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…to distance function
Codecov Report
@@ Coverage Diff @@
## develop #429 +/- ##
============================================
- Coverage 87.34% 49.76% -37.58%
============================================
Files 103 107 +4
Lines 6098 6561 +463
============================================
- Hits 5326 3265 -2061
- Misses 772 3296 +2524
Continue to review full report at Codecov.
|
* Add distance weight plot * add colors * enable passing keys
… in adaptive distances (#467) * implement only accepted particles for scale calculation * add test * fix indent
Merged
EmadAlamoudi
pushed a commit
that referenced
this pull request
Jun 30, 2022
* init * limit look-ahead sample number in delayed mode * update releasenotes -> 0.10.15 * refactor: weight normalization applied to all particles; pass sample to distance function * fix typo in changelog * fix tmp changes * fixup * pycharm annoys me * fixit * fix population test * fix zero division error * fix docs * whatever * remove file * init * tmp * tmp * refactor adaptive distances: sumstat + vectorize * add working version of sumstat and predictor modules * whatever * add nbs * handle trivial statistics better * normalize info weighting correctly * refactor anew info weighting + normalization + gp and layer handles * fix flake8 * add lasso sumstat * set indices to keep correctly * add option to not normalize per parameter in info weight * cont * implement late model use * remove slad * tidy up * update nbs; fix various things * add predictor test * add model selection test * additional tests * update readme; add raise tests * add sumstat test * add test for dict2arr * test info weighting * test sample construction * test fit index construction * test inf norm; test scales errors * fixup * implement subsetting * fix imports * test augmentation * add missing base class dependency * move worker signup up * add logger * always normalize linreg inputs; postpone default fit indices * do not clear up redis server * fix typo * reset default scale function from rmsd to std for stability in most cases * cont * cont * update * add tests * whatever * Allow fitting at simulation-based events (#462) * Allow fitting at simulation-based events * update nb * cont * fix test * fix test * Add distance weight plot (#463) * fix wrong deviation threshold 0.5 -> 0.33 * Small fixes (#466) * Add distance weight plot * add colors * enable passing keys * integer coordinates * implement option to use only accepted particles for scale calculation in adaptive distances (#467) * implement only accepted particles for scale calculation * add test * fix indent * add max mlp method * log fitting time * add train-test-split model selection method * better info weight calculation * add pre_before_fit and from_events * fix * change default to weights * normalize in subsetter * add n_sample option to data plot * fix stuff * allow kwargs in distance weights plot * add pcmad convenience * apply la normalization to all particles * fix defaults * final edits Co-authored-by: Yannik Schälte <[email protected]> Co-authored-by: Yannik Schälte <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Breaking changes:
(Adaptive)PNormDistance
was altered substantially to allow cutom definition of update indices.Semi-automatic summary statistics:
sumstat
submodule for generic mappings (id, trafos), and especially aPredictorSumstat
summary statistic that can make use ofPredictor
objects.predictor
submodule with genericPredictor
class and concrete implementations including linear regression, Lasso, Gaussian Process, Neural Network.InfoWeightedPNormDistance
that allows using predictor models to weight data not only by scale, but also by information content.Changes to internal sample weighting:
Changes to internal object instruction from samples:
Visualization: