[ML] Prefer smaller models with similar performance #1516

valeriy42 · 2020-09-30T11:07:47Z

For regression and classification, during hyperparameter optimization we prefer smaller models if the loss functions are otherwise comparable.

To this end, we add 0.01 * "forest number nodes" * E[GP] / "average forest number nodes" as an additional penalty.

…-model-size

valeriy42 · 2020-10-13T09:01:00Z

@wwang500 just a heads-up: in some cases this may lead to a regression of the results by <1% (in terms of MSE, MSLE, Huber) or less (for classification).

tveasey

LGTM

For regression and classification, during hyperparameter optimization we prefer smaller models if the loss functions are otherwise comparable. To this end, we add 0.01 * "forest number nodes" * E[GP] / "average forest number nodes" as an additional penalty.

For regression and classification, during hyperparameter optimization we prefer smaller models if the loss functions are otherwise comparable. To this end, we add 0.01 * "forest number nodes" * E[GP] / "average forest number nodes" as an additional penalty. Backport of #1516.

valeriy42 added 2 commits September 28, 2020 13:47

initial commit

c6c89e4

annotations added

0a605d1

valeriy42 added >non-issue WIP :ml v8.0.0 labels Sep 30, 2020

valeriy42 added 4 commits September 30, 2020 13:20

cleaning up

a881ed6

Merge branch 'master' of https://github.com/elastic/ml-cpp into nudge…

f9bd9a8

…-model-size

remove forest size from bayesian optimization

201519a

fix compilation error

26af646

valeriy42 changed the title ~~[WIP][ML] Prefer smaller models with similar performance~~ [ML] Prefer smaller models with similar performance Oct 13, 2020

comment added

71562ed

valeriy42 added v7.11.0 and removed WIP labels Oct 13, 2020

Add enhancement note

fe1961a

valeriy42 requested a review from tveasey October 13, 2020 10:10

valeriy42 added the >enhancement label Oct 13, 2020

tveasey approved these changes Oct 13, 2020

View reviewed changes

relax unit test

b924ac1

valeriy42 merged commit bd14f42 into elastic:master Oct 13, 2020

valeriy42 deleted the nudge-model-size branch October 13, 2020 14:16

valeriy42 mentioned this pull request Oct 13, 2020

[7.x][ML] Prefer smaller models with similar performance #1533

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Prefer smaller models with similar performance #1516

[ML] Prefer smaller models with similar performance #1516

valeriy42 commented Sep 30, 2020 •

edited

Loading

valeriy42 commented Oct 13, 2020

tveasey left a comment

[ML] Prefer smaller models with similar performance #1516

[ML] Prefer smaller models with similar performance #1516

Conversation

valeriy42 commented Sep 30, 2020 • edited Loading

valeriy42 commented Oct 13, 2020

tveasey left a comment

Choose a reason for hiding this comment

valeriy42 commented Sep 30, 2020 •

edited

Loading