[python-package] add more hints in sklearn.py #5460

jameslamb · 2022-09-01T05:08:28Z

Contributes to #3756.

Adds some more type hints to places in sklearn.py that are missing them.

Notes for Reviewers

I intentionally avoided hints on anything "array-like", since those will require some extra research and should be handled separately (#3756 (comment)).

python-package/lightgbm/sklearn.py

…to more-sklearn-hints

jmoralez · 2022-09-08T18:14:24Z

python-package/lightgbm/sklearn.py

-        eval_metric=None,
-        eval_at=(1, 2, 3, 4, 5),
+        eval_metric: Optional[_LGBM_ScikitEvalMetricType] = None,
+        eval_at: Iterable[int] = (1, 2, 3, 4, 5),


I think this should be Sequence[int], given that for example range(5) is an iterable but is not a valid type for this.

🤩 Excellent point. I was just following the docstring and didn't consider this. Thanks very much for noting it!!!

I tried the following from the root of the repo, just to see what would happen:

sample code (click me)

from pathlib import Path import numpy as np import lightgbm as lgb from sklearn.datasets import load_svmlight_file rank_example_dir = Path('examples/lambdarank') X_train, y_train = load_svmlight_file(str(rank_example_dir / 'rank.train')) X_test, y_test = load_svmlight_file(str(rank_example_dir / 'rank.test')) q_train = np.loadtxt(str(rank_example_dir / 'rank.train.query')) q_test = np.loadtxt(str(rank_example_dir / 'rank.test.query')) gbm = lgb.LGBMRanker(n_estimators=10) gbm.fit( X_train, y_train, group=q_train, eval_set=[(X_test, y_test)], eval_group=[q_test], eval_at=range(3), callbacks=[ lgb.early_stopping(10), lgb.reset_parameter(learning_rate=lambda x: max(0.01, 0.1 - 0.01 * x)) ] )

And you're right...passing a range for eval_at causes a failure when serializing the parameters to string to pass them through the C API functions.

File ".../site-packages/lightgbm/basic.py", line 326, in param_dict_to_str raise TypeError(f'Unknown type of parameter:{key}, got:{type(val).__name__}') TypeError: Unknown type of parameter:eval_at, got:range

Given that, I think the hint here should be even stricter than typing.Sequence. Since this keyword argument is passed directly through to params and there's no other code in LightGBM manipulating its value, I think it can only accept values that are valid for lightgbm.basic.param_dict_to_str().

For eval_at, I think that means only a list of ints or tuple of ints is valid. param_dict_to_str() supports list, tuple, and set, but set isn't appropriate for eval_at because sets aren't iterable (e.g. don't have any ordering).

LightGBM/python-package/lightgbm/basic.py

Line 323 in 3d4e08e

if isinstance(val, (list, tuple, set)) or is_numpy_1d_array(val):

I just pushed 81c234f which:

sets the hint for eval_at to Union[List[int], Tuple[int]]

replaces use of the word "iterable" in the relevant docstrings with "list or tuple of int"

@jmoralez I won't merge this until you have a chance to respond, since what I did here is slightly different than what you suggested.

StrikerRUS

LGTM, thanks!

python-package/lightgbm/dask.py

Co-authored-by: Nikita Titov <[email protected]>

jmoralez

Thanks!

github-actions · 2023-08-19T03:22:22Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

[python-package] add more hints in sklearn.py

a637354

jameslamb added the maintenance label Sep 1, 2022

jameslamb requested review from StrikerRUS, shiyu1994 and jmoralez as code owners September 1, 2022 05:08

jameslamb commented Sep 1, 2022

View reviewed changes

python-package/lightgbm/sklearn.py Outdated Show resolved Hide resolved

Update python-package/lightgbm/sklearn.py

79ccfe3

StrikerRUS reviewed Sep 4, 2022

View reviewed changes

python-package/lightgbm/sklearn.py Show resolved Hide resolved

jameslamb added 3 commits September 4, 2022 22:55

Merge branch 'master' into more-sklearn-hints

b7c8999

use _LGBM_ScikitEvalMetricType in dask.py

347264d

Merge branch 'more-sklearn-hints' of github.com:microsoft/LightGBM in…

d9b2ca6

…to more-sklearn-hints

jameslamb added the awaiting review label Sep 8, 2022

jmoralez reviewed Sep 8, 2022

View reviewed changes

jameslamb added 2 commits September 9, 2022 22:51

Merge branch 'master' into more-sklearn-hints

248c4d6

fix eval_at hint and docstring

81c234f

jameslamb requested review from StrikerRUS and jmoralez September 10, 2022 16:33

StrikerRUS approved these changes Sep 11, 2022

View reviewed changes

python-package/lightgbm/dask.py Outdated Show resolved Hide resolved

Update python-package/lightgbm/dask.py

eee7e75

Co-authored-by: Nikita Titov <[email protected]>

jameslamb removed the awaiting review label Sep 12, 2022

jmoralez approved these changes Sep 12, 2022

View reviewed changes

jameslamb merged commit c3cf335 into master Sep 12, 2022

jameslamb deleted the more-sklearn-hints branch September 12, 2022 15:13

jameslamb mentioned this pull request Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

github-actions bot locked as resolved and limited conversation to collaborators Aug 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python-package] add more hints in sklearn.py #5460

[python-package] add more hints in sklearn.py #5460

jameslamb commented Sep 1, 2022

jmoralez Sep 8, 2022

jameslamb Sep 10, 2022 •

edited

Loading

jameslamb Sep 12, 2022

StrikerRUS left a comment

jmoralez left a comment

github-actions bot commented Aug 19, 2023

[python-package] add more hints in sklearn.py #5460

[python-package] add more hints in sklearn.py #5460

Conversation

jameslamb commented Sep 1, 2022

Notes for Reviewers

jmoralez Sep 8, 2022

Choose a reason for hiding this comment

jameslamb Sep 10, 2022 • edited Loading

Choose a reason for hiding this comment

jameslamb Sep 12, 2022

Choose a reason for hiding this comment

StrikerRUS left a comment

Choose a reason for hiding this comment

jmoralez left a comment

Choose a reason for hiding this comment

github-actions bot commented Aug 19, 2023

jameslamb Sep 10, 2022 •

edited

Loading