REF/ENH: add parameter routing #67

adriangb · 2020-08-25T06:23:45Z

Closes #50, closes #49, closes #37

Todo:

Implement
Write tests
Write docs (in DOC: Add basic docs for wrappers #73)
Add doc tests (future PR)
Naming of keras_expected_n_ouputs_ (REF/ENH: add parameter routing #67 (comment))

codecov-commenter · 2020-08-25T06:27:30Z

Codecov Report

Merging #67 into master will decrease coverage by 0.19%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master      #67      +/-   ##
==========================================
- Coverage   99.76%   99.56%   -0.20%     
==========================================
  Files           3        3              
  Lines         432      465      +33     
==========================================
+ Hits          431      463      +32     
- Misses          1        2       +1

Impacted Files	Coverage Δ
scikeras/_utils.py	`98.91% <100.00%> (+0.21%)`	⬆️
scikeras/wrappers.py	`99.72% <100.00%> (-0.28%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 250837e...1a38997. Read the comment docs.

tests/test_api.py

adriangb · 2020-08-25T21:54:46Z

scikeras/wrappers.py

+    _fit_params = {
+        # parameters destined to keras.Model.fit
+        "callbacks",
+        "batch_size",
+        "epochs",
+        "verbose",
+        "callbacks",
+        "validation_split",
+        "shuffle",
+        "class_weight",
+        "sample_weight",
+        "initial_epoch",
+        "validation_steps",
+        "validation_batch_size",
+        "validation_freq",
+    }
+
+    _predict_params = {
+        # parameters destined to keras.Model.predict
+        "batch_size",
+        "verbose",
+        "callbacks",
+        "steps",
+    }
+
+    _compile_params = {
+        # parameters destined to keras.Model.compile
+        "optimizer",
+        "loss",
+        "metrics",
+        "loss_weights",
+        "weighted_metrics",
+        "run_eagerly",
+    }
+
+    _wrapper_params = {
+        # parameters consumed by the wrappers themselves
+        "warm_start",
+        "random_state",
+    }


There's some testing to make sure that these are a subset of keras.Model's parameters. In the future, once we've figured out the usage of class_weight and others, we should add tests that also check that the default initializer for the wrappers also accepts all of these (except for sample_weight and such).

stsievert · 2020-08-26T01:32:16Z

I think this PR is ready for review now. I'll try to provide a review in the next couple days. I would say this PR also closes #18 again.

stsievert

Generally, this review looks good. I'll be glad to see this PR merged: I think it'll enable easier usage.

Here's a first pass at some comments. I am having a hard time reviewing this PR because of the length. Could you add some examples of the usage, either in the documentation or in a comment? For me, writing documentation typically leads to API improvements.

I have a couple questions:

What if I pass optimizer__momentum=0.9 and optimizer="sgd"? Right now, it looks like I'll have to create the Keras optimizer myself and pass that to model.compile. If so, I think work for a future PR is creating the optimizer in BaseWrapper and passing that to the optimizer key.
Is this PR backwards compatible for basic usage? It doesn't look like the tests changed much for the basic uses.

scikeras/wrappers.py

tests/test_api.py

scikeras/wrappers.py

scikeras/_utils.py

tests/test_param_routing.py

adriangb · 2020-08-26T22:36:48Z

Thank you for the review!

Could you add some examples of the usage, either in the documentation or in a comment? For me, writing documentation typically leads to API improvements.

It's not a bad idea to start working on the docs along with this. I'm just afraid that that will make a large PR even larger. Would examples from the tests help? I can cherry pick some.

What if I pass optimizer__momentum=0.9 and optimizer="sgd"? Right now, it looks like I'll have to create the Keras optimizer myself and pass that to model.compile. If so, I think work for a future PR is creating the optimizer in BaseWrapper and passing that to the optimizer key.

I agree, but I think that would come after #66

Is this PR backwards compatible for basic usage? It doesn't look like the tests changed much for the basic uses.

Sort of. It breaks if your build_fn was previously expecting parameters like n_outputs_ but it will be backwards compatible as long as build_fn was only expecting build parameters (like hidden_layer_sizes).

adriangb · 2020-08-27T15:06:39Z

@stsievert I addressed several of the comments, leaving two open so you have a chance to look at those again. Do you still want a couple of examples for this API / do you want to write docs as part of this PR?

stsievert · 2020-08-27T17:40:50Z

Do you still want a couple of examples for this API / do you want to write docs as part of this PR?

That'd be great. I think that'd allow me to more easily review this PR, especially if the examples are in .rst files.

adriangb · 2020-08-29T00:51:58Z

What would you think of using model_builder or something other than just model to replace build_fn? I started working on docs and felt that it might be a bit confusing with model and Model.

stsievert · 2020-08-29T01:03:40Z

I started working on docs and felt that it might be a bit confusing with model and Model.

That actually might be an indicator that model is a good name for the function because it returns a Keras Model. If you keep the name (I would), I would refer to the two in distinct ways, probably as "the Keras Model" and "the model parameter." I think the capitalization and monospace text is enough to distinguish the two.

Skorch has the same interface with their module parameter and PyTorch Modules.

adriangb · 2020-08-31T15:18:33Z

only an import statement needs to be changed

Agreed, I think this should be one of the "requirements" for this project.

If you'd like, I can mention some workarounds I have in mind.

Sure, more than welcome!

For this issue in particular, I think if we want to keep the original API, I would like the implications of keeping only that API. I don't want two ways to do things unless there is a clear use case where one of them works and one of them doesn't. The only issue I can think of with keeping only the original API (not using model__) is with subclassed models. I fear that if users want to add an arbitrary parameter, they would be forced to also accept this parameter in build_fn. If that's the case, I see two ways around it:

keep the introspection into build_fn's arguments
somehow make it so that if the user overrides __inti__ then we don't route all parameters. but I don't see how to do this cleanly

stsievert · 2020-09-01T20:29:49Z

some workarounds I have in mind.

The workaround I have in mind involves setting a (hidden) parameter at initialization to keep track of the parameters that should be routed to model_build_fn:

class BaseWrapper:
    def __init__(self, model=None, ..., **keras_params):
        self.model = model
        ...
        vars(self).update(**sk_params)
        self._keras_params = set(keras_params)

    def _model_params(self):
        return {k[len("model__"):]: v
                for k in self.get_params()
                if "model__" in k[:len("model__")] or k in self._keras_params}

I fear that if users want to add an arbitrary parameter, they would be forced to also accept this parameter in build_fn.

Why would this usage fail with the above implementation?

def model_build_fn(hidden=10):
    ...
    return model

class CustomKerasRegressor(KerasRegressor):
    def __init__(self, new_param=1, **kwargs):
        self.new_param = new_param
        super().__init__(**kwargs)

    ... # use new_param in fit/score/etc

With this, I think both of these usages would work:

est = CustomKerasRegressor(model=model_build_fn, hidden=20)
est.fit(X, y).score(X, y)  # say X, y defined somewhere

est2 = CustomKerasRegressor(model=model_build_fn, model__hidden=30)
est.fit(X, y).score(X, y)

scikeras/wrappers.py

scikeras/_utils.py

scikeras/wrappers.py

adriangb · 2020-09-01T22:34:13Z

The workaround I have in mind involves setting a (hidden) parameter at initialization to keep track of the parameters that should be routed to model_build_fn

Funny enough, that's how the original implementation worked (sort if, it just stored self.kwargs = kwargs, which did not work at all with the scikit-learn API and broke get_params, etc.). I then changed it to store a set and unpack kwargs into self in my first commit.

One minor improvement: if there are no kwargs, don't set the attribute, which allows this test to pass.

class BaseWrapper:
    def __init__(self, model=None, ..., **kwargs):
        self.model = model
        ...
        vars(self).update(**kwargs)
        if kwargs:
            self._init_kwargs = set(kwargs)

    def _model_params(self):
        return {k[len("model__"):]: v
                for k in self.get_params()
                if "model__" == k[:len("model__")] or k in getattr(self, "_init_kwargs", set())}

I think with this + introspecting into the parameters of model_build_fn we should be good. Do you think it will be clear how model_build_fn can "request" arguments from the set self._init_kwargs?

…into param-routing

adriangb · 2020-09-01T23:38:31Z

tests/test_utils.py

+    params = {"model__foo": object()}
+    destination = "model"
+    pass_filter = set()
+    out = route_params(params, destination, pass_filter)
+    assert out["foo"] is params["model__foo"]


@stsievert added your test here. we should probably add some more checks here.

stsievert

I think with this

By "this" you mean "keeping backwards compatibility with the Keras API and allowing the prefix model__"?

introspecting into the parameters of model_build_fn we should be good.

I think introspection should only be performed for three parameters: meta, params and compile_kwargs.

Do you think it will be clear how model_build_fn can "request" arguments from the set self._init_kwargs?

I think only meta, params and compile_kwargs should be able to be "requested." I think the documentation surrounding that is very clear:

Arguments to model_build_fn include any parameter with a model__ prefix and parameters provided at initialization. In addition, if model_build_fn accepts keyword arguments for meta, params or compile_kwargs the relevant dictionaries will be provided, described below: [list].

That means I think this code should raise a value error because model_build_fn does not accept an argument bar:

def build(foo=42):
    return _get_model(foo)

BaseWrapper(model=build, model__bar=42)  # fails; bar not a valid kwarg to `build`
BaseWrapper(model=build, bar=42)  # fails; bar not a valid kwarg to `build`

scikeras/wrappers.py

adriangb · 2020-09-02T04:00:34Z

By "this" you mean "keeping backwards compatibility with the Keras API and allowing the prefix model__"?

Yes

Regarding the rest:

That means I think this code should raise a value error because model_build_fn does not accept an argument bar:
def build(foo=42):
    return _get_model(foo)

BaseWrapper(model=build, model__bar=42)  # fails; bar not a valid kwarg to `build`
BaseWrapper(model=build, bar=42)  # fails; bar not a valid kwarg to `build`

I originally thought that the Keras API allowed this:

from tensorflow.keras.wrappers.scikit_learn import KerasClassifier
def build_fn(param1=0):
    ...
clf = KerasClassifier(build_fn=build_fn, param1=1, param2=3)
clf.fit(...)

But upon testing, it actually does raise an error:

from tensorflow.keras.wrappers.scikit_learn import KerasClassifier
def build_fn(param1=0):
    ...
clf = KerasClassifier(build_fn=build_fn, param1=1, param2=3)  # error
clf.fit(...)

Incidentally, it raises an error from __init__, which as you suggested is wrong.

This confirms what I guess you were assuming or knew, which is that raising an error is compatible with the old API. That being the case, I am 100% +1 for doing the same. Sorry for the confusion otherwise.

…ename xyz_params -> xyz_kwargs for meta, fit and predict

adriangb · 2020-09-02T04:25:30Z

Thank you for this last round of review @stsievert. I added the two pending comments/items to the OP to keep things a bit organized.

stsievert · 2020-09-02T12:15:24Z

I'll need at least one more review; I'll try to provide one this weekend.

adriangb · 2020-09-02T12:49:06Z

Great, TY for your help thus far

adriangb · 2020-09-14T18:57:16Z

@stsievert are you able to take another look at this? Thanks.

stsievert

Thanks for the ping; this slipped off my radar.

I expected the optimizer key in compile_kwargs to be the rendered optimizer. I expected this test to pass:

from typing import Any, Dict
from sklearn.datasets import make_classification
from tensorflow.keras.layers import Dense, Input
from tensorflow.keras.models import Model
import tensorflow.keras.optimizers as opt
from scikeras.wrappers import KerasClassifier

def get_model(num_hidden=10, meta=None, compile_kwargs=None):
    inp = Input(shape=(meta["n_features_in_"],))
    hidden = Dense(num_hidden, activation="relu")(inp)
    out = [Dense(1, activation="sigmoid")(hidden)]
    model = Model(inp, out)

    assert not isinstance(compile_kwargs["optimizer"], str)
    model.compile(**compile_kwargs)
    return model

if __name__ == "__main__":
    est = KerasClassifier(
        model=get_model,
        model__num_hidden=20,
        optimizer=opt.SGD,
        optimizer__learning_rate=0.15,
        optimizer__momentum=0.5,
        loss="binary_crossentropy",
    )
    X, y = make_classification()
    est.fit(X, y)

README.md

scikeras/_utils.py

adriangb · 2020-09-15T15:10:44Z

I expected the optimizer key in compile_kwargs to be the rendered optimizer. I expected this test to pass:

That will be the work of #66, too much for this PR in my opinion.

That said, I did edit the tests to at least use the model.compile(**compile_kwargs) syntax since it works to pass the kwargs, but it won't compile the optimizer like in your example.

stsievert · 2020-09-15T19:46:53Z

That will be the work of #66, too much for this PR in my opinion.

👍 I don't see reason to hold back on merge; nothing immediately jumps out. #73 and #66 probably deserve more attention now.

adriangb mentioned this pull request Aug 25, 2020

MAINT: try to make coverage less flaky #60

Merged

adriangb added 7 commits August 25, 2020 12:01

initial attempt at param routing

4e6fe8c

add tests, fix win failure

b355e50

fix upcast func

289d058

Add check

07112cb

fix test for tf 2.2.0

a5bdd5f

remove unused logic branhc

4bc17b4

edit docs

be9814f

adriangb force-pushed the param-routing branch from cbc816e to be9814f Compare August 25, 2020 17:01

adriangb added 2 commits August 25, 2020 12:11

remove default params util

c4d63cb

fix typo

34ba2d1

adriangb changed the title ~~initial attempt at param routing~~ REF/ENH: add parameter routing Aug 25, 2020

adriangb mentioned this pull request Aug 25, 2020

Parameter routing #50

Closed

add ability to do hyperparameter tuning on unset routed params

cf42f4e

adriangb commented Aug 25, 2020

View reviewed changes

tests/test_api.py Outdated Show resolved Hide resolved

Make build_params actual parameters of build_fn

8a0ff71

adriangb requested a review from stsievert August 25, 2020 21:52

adriangb commented Aug 25, 2020

View reviewed changes

adriangb marked this pull request as ready for review August 25, 2020 23:04

stsievert reviewed Aug 26, 2020

View reviewed changes

stsievert mentioned this pull request Aug 27, 2020

Requirements for SciKeras v0.2.0 #68

Closed

17 tasks

fix imports, rename parameters

2932984

remove live optimizer obj fr omtest

e0750e0

stsievert reviewed Sep 1, 2020

View reviewed changes

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/_utils.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Show resolved Hide resolved

adriangb added 3 commits September 1, 2020 18:33

add test for routed nonrouted equivalence

e3e9716

remove now redundant test

8404024

Merge branch 'param-routing' of https://github.com/adriangb/scikeras …

6e46a7f

…into param-routing

adriangb commented Sep 1, 2020

View reviewed changes

Merge branch 'master' into param-routing

bfa1789

stsievert reviewed Sep 2, 2020

View reviewed changes

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Outdated Show resolved Hide resolved

scikeras/wrappers.py Outdated Show resolved Hide resolved

make passing unexpected kwargs a typeerror, clean up set unpacking, r…

a6aa852

…ename xyz_params -> xyz_kwargs for meta, fit and predict

add tests for kwarg

c5d9d32

force cooperative inheritence

eb2c5d0

adriangb mentioned this pull request Sep 3, 2020

Add param based compile #66

Merged

stsievert reviewed Sep 15, 2020

View reviewed changes

README.md Show resolved Hide resolved

README.md Show resolved Hide resolved

scikeras/_utils.py Show resolved Hide resolved

adriangb added 3 commits September 15, 2020 11:33

PR comments

7bb2995

more doc updates

ceffe9b

use model.compile(**compile_kwargs) syntax in tests

3fe8e62

make test less sensitive

1a38997

adriangb merged commit 1c045dc into master Sep 15, 2020

adriangb deleted the param-routing branch September 15, 2020 21:01

adriangb mentioned this pull request Sep 17, 2020

REF: move warm_start to __init__ #54

Closed

adriangb mentioned this pull request Oct 16, 2020

MAINT: Remove _model_params property #108

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

REF/ENH: add parameter routing #67

REF/ENH: add parameter routing #67

adriangb commented Aug 25, 2020 •

edited

Loading

codecov-commenter commented Aug 25, 2020 •

edited

Loading

adriangb Aug 25, 2020

stsievert commented Aug 26, 2020

stsievert left a comment

adriangb commented Aug 26, 2020 •

edited

Loading

adriangb commented Aug 27, 2020

stsievert commented Aug 27, 2020

adriangb commented Aug 29, 2020

stsievert commented Aug 29, 2020 •

edited

Loading

adriangb commented Aug 31, 2020 •

edited

Loading

stsievert commented Sep 1, 2020

adriangb commented Sep 1, 2020 •

edited

Loading

adriangb Sep 1, 2020

stsievert left a comment

adriangb commented Sep 2, 2020

adriangb commented Sep 2, 2020

stsievert commented Sep 2, 2020

adriangb commented Sep 2, 2020

adriangb commented Sep 14, 2020

stsievert left a comment

adriangb commented Sep 15, 2020 •

edited

Loading

stsievert commented Sep 15, 2020 •

edited

Loading

REF/ENH: add parameter routing #67

REF/ENH: add parameter routing #67

Conversation

adriangb commented Aug 25, 2020 • edited Loading

codecov-commenter commented Aug 25, 2020 • edited Loading

Codecov Report

adriangb Aug 25, 2020

Choose a reason for hiding this comment

stsievert commented Aug 26, 2020

stsievert left a comment

Choose a reason for hiding this comment

adriangb commented Aug 26, 2020 • edited Loading

adriangb commented Aug 27, 2020

stsievert commented Aug 27, 2020

adriangb commented Aug 29, 2020

stsievert commented Aug 29, 2020 • edited Loading

adriangb commented Aug 31, 2020 • edited Loading

stsievert commented Sep 1, 2020

adriangb commented Sep 1, 2020 • edited Loading

adriangb Sep 1, 2020

Choose a reason for hiding this comment

stsievert left a comment

Choose a reason for hiding this comment

adriangb commented Sep 2, 2020

adriangb commented Sep 2, 2020

stsievert commented Sep 2, 2020

adriangb commented Sep 2, 2020

adriangb commented Sep 14, 2020

stsievert left a comment

Choose a reason for hiding this comment

adriangb commented Sep 15, 2020 • edited Loading

stsievert commented Sep 15, 2020 • edited Loading

adriangb commented Aug 25, 2020 •

edited

Loading

codecov-commenter commented Aug 25, 2020 •

edited

Loading

adriangb commented Aug 26, 2020 •

edited

Loading

stsievert commented Aug 29, 2020 •

edited

Loading

adriangb commented Aug 31, 2020 •

edited

Loading

adriangb commented Sep 1, 2020 •

edited

Loading

adriangb commented Sep 15, 2020 •

edited

Loading

stsievert commented Sep 15, 2020 •

edited

Loading