Feature Request: customizable early_stopping_tolerance #2526

kryptonite0 · 2019-10-24T12:54:25Z

I have a situation where the default numerical tolerance (0.001) for early stopping is too large. My target has a gamma distribution and the LGB Regressor reaches convergence too early, when the numerous low target values are well approximated by the model, but the few large values are still underestimated. When I deactivate early stopping, I can see the loss metric still improving at the 4th or more decimal digit, past the best iteration reached during early stopping. It would be great to be able to set manually the tolerance.

StrikerRUS · 2019-10-25T15:28:39Z

@kryptonite0 Can you please provide a reproducible example or training logs at least?

As I know, we do not have any default numerical tolerance. Refer to:

LightGBM/python-package/lightgbm/callback.py

Line 227 in fc991c9

if best_score_list[i] is None or cmp_op[i](score, best_score[i]):

https://docs.python.org/3/library/operator.html#operator.lt

Linking dmlc/xgboost#4982 here.

StrikerRUS · 2019-11-09T16:40:15Z

Ping @kryptonite0

As you can see from examples, there is no "default numerical tolerance (0.001)":

LightGBM/examples/python-guide/advanced_example.py

Lines 52 to 59 in 785e477

    
           gbm = lgb.train(params, 
        
                           lgb_train, 
        
                           num_boost_round=10, 
        
                           valid_sets=lgb_train,  # eval training data 
        
                           feature_name=feature_name, 
        
                           categorical_feature=[21]) 
        
           print('Finished first 10 rounds...')

[1]	training's binary_logloss: 0.680894
[2]	training's binary_logloss: 0.672151
[3]	training's binary_logloss: 0.664753
[4]	training's binary_logloss: 0.656185
[5]	training's binary_logloss: 0.648174
[6]	training's binary_logloss: 0.641671
[7]	training's binary_logloss: 0.635597
[8]	training's binary_logloss: 0.628874
[9]	training's binary_logloss: 0.622432
[10]	training's binary_logloss: 0.616403
Finished first 10 rounds...

StrikerRUS · 2019-12-20T02:08:06Z

At present we do not have any "default numerical tolerance". But having customizable early stopping tolerance might be useful in some cases.

StrikerRUS · 2019-12-20T02:09:40Z

Closed in favor of being in #2302. We decided to keep all feature requests in one place.

Welcome to contribute this feature! Please re-open this issue (or post a comment if you are not a topic starter) if you are actively working on implementing this feature.

jmoralez · 2021-08-27T01:36:27Z

Hi. I'm working on this, I'll make a PR soon.

StrikerRUS · 2021-08-27T19:44:39Z

Corresponding XGBoost experience:
Issue: dmlc/xgboost#4982
PR: dmlc/xgboost#6942
Further improvement: dmlc/xgboost#7137

jmoralez · 2021-08-27T22:10:27Z

Thanks for that. I believe my approach is the same as dmlc/xgboost#7137, basically the change I made was replacing operator.gt here:

LightGBM/python-package/lightgbm/callback.py

Line 200 in 99cc4f2

cmp_op.append(gt)

with:

def _gt_threshold(curr_score, best_score, threshold):
    return curr_score > best_score + threshold

and the opposite for the minimize case (curr_score < best_score - threshold). I named the parameter early_stopping_threshold in train, cv and still had to modify sklearn.py and the docs.

* initial changes * initial version * better handling of cases * warn only with positive threshold * remove early_stopping_threshold from high-level functions * remove remaining early_stopping_threshold * update test to use callback * better handling of cases * rename threshold to min_delta enhance parameter description update tests * Apply suggestions from code review Co-authored-by: Nikita Titov <[email protected]> * reduce num_boost_round in tests * Apply suggestions from code review Co-authored-by: Nikita Titov <[email protected]> * trigger ci Co-authored-by: Nikita Titov <[email protected]> Co-authored-by: Nikita Titov <[email protected]>

StrikerRUS · 2021-11-10T13:28:32Z

#4580 implemented this feature request for Python-package. Thank you very much @jmoralez !
I think we should reuse this issue as a feature request for adding the same functionality into R-package and core cpp code to not split the discussion. Refer to #4580 (comment). So, I'm not excluding this issue from #2302 for now.

github-actions · 2023-08-16T00:19:47Z

This issue has been automatically locked since there has not been any recent activity since it was closed.
To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues
including a reference to this.

kryptonite0 added enhancement feature request labels Oct 24, 2019

StrikerRUS added the help wanted label Dec 20, 2019

guolinke mentioned this issue Dec 20, 2019

Feature Requests & Voting Hub #2302

Open

StrikerRUS closed this as completed Dec 20, 2019

jmoralez reopened this Aug 27, 2021

StrikerRUS mentioned this issue Aug 29, 2021

[RFC][python] deprecate advanced args of train() and cv() functions and sklearn wrapper #4574

Merged

jmoralez mentioned this issue Sep 1, 2021

[python-package] early stopping min_delta (fixes #2526) #4580

Merged

StrikerRUS closed this as completed in #4580 Nov 10, 2021

StrikerRUS mentioned this issue Jan 6, 2022

[DO NOT MERGE] Release 3.3.2 #4930

Closed

13 tasks

jameslamb mentioned this issue Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

github-actions bot locked as resolved and limited conversation to collaborators Aug 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: customizable early_stopping_tolerance #2526

Feature Request: customizable early_stopping_tolerance #2526

kryptonite0 commented Oct 24, 2019

StrikerRUS commented Oct 25, 2019

StrikerRUS commented Nov 9, 2019

StrikerRUS commented Dec 20, 2019

StrikerRUS commented Dec 20, 2019

jmoralez commented Aug 27, 2021

StrikerRUS commented Aug 27, 2021

jmoralez commented Aug 27, 2021

StrikerRUS commented Nov 10, 2021

github-actions bot commented Aug 16, 2023

Feature Request: customizable early_stopping_tolerance #2526

Feature Request: customizable early_stopping_tolerance #2526

Comments

kryptonite0 commented Oct 24, 2019

StrikerRUS commented Oct 25, 2019

StrikerRUS commented Nov 9, 2019

StrikerRUS commented Dec 20, 2019

StrikerRUS commented Dec 20, 2019

jmoralez commented Aug 27, 2021

StrikerRUS commented Aug 27, 2021

jmoralez commented Aug 27, 2021

StrikerRUS commented Nov 10, 2021

github-actions bot commented Aug 16, 2023