[Bug] LightGBMError: bin size 257 cannot run on GPU #3339

rohan-gt · 2020-08-28T05:31:45Z

I'm getting the following error while running the latest LightGBM GPU using these params:

params = {
            'device_type': 'gpu',
            'gpu_device_id': 0,
            'gpu_platform_id': 0,
            'gpu_use_dp': 'false',
            'max_bin': 255
}

on Google Colab using this Kaggle dataset: https://www.kaggle.com/c/ieee-fraud-detection. I'm dropping all the categorical variables:

LightGBMError: bin size 257 cannot run on GPU

The text was updated successfully, but these errors were encountered:

hengzhe-zhang · 2020-10-22T11:29:19Z

I also have the same problem. Is there any way to solve this problem?

guolinke · 2020-10-22T13:14:05Z

sorry for missing this issue.
The max-bin actually cannot limit the number of bins for categorical feature.
there are two workarounds:

use the categorical encodings, converting categorical features to numerical ones.
split one categorical feature to multi categorical features, and make sure the number of categories in each splitted feature smaller than 256.

pseudotensor · 2021-03-16T04:30:44Z

I hit this randomly for no reason with categorical_features as explicitly empty. Has nothing to do with that. The test that hit this normally has passed 1000 times before.

File "/opt/h2oai/dai/cuda-10.0/lib/python3.6/site-packages/lightgbm_gpu/sklearn.py", line 794, in fit
    categorical_feature=categorical_feature, callbacks=callbacks, init_model=init_model)
  File "/opt/h2oai/dai/cuda-10.0/lib/python3.6/site-packages/lightgbm_gpu/sklearn.py", line 637, in fit
    callbacks=callbacks, init_model=init_model)
  File "/opt/h2oai/dai/cuda-10.0/lib/python3.6/site-packages/lightgbm_gpu/engine.py", line 230, in train
    booster = Booster(params=params, train_set=train_set)
  File "/opt/h2oai/dai/cuda-10.0/lib/python3.6/site-packages/lightgbm_gpu/basic.py", line 2104, in __init__
    ctypes.byref(self.handle)))
  File "/opt/h2oai/dai/cuda-10.0/lib/python3.6/site-packages/lightgbm_gpu/basic.py", line 52, in _safe_call
    raise LightGBMError(_LIB.LGBM_GetLastError().decode('utf-8'))
lightgbm.basic.LightGBMError: bin size 257 cannot run on GPU

The number of bins was 255 and there are no categorical features as explicitly chosen.

George3d6 · 2021-07-18T00:25:51Z

Same issue happened for me:

  File "/usr/local/lib/python3.6/dist-packages/lightgbm/engine.py", line 228, in train
    booster = Booster(params=params, train_set=train_set)
  File "/usr/local/lib/python3.6/dist-packages/lightgbm/basic.py", line 2237, in __init__
    ctypes.byref(self.handle)))
  File "/usr/local/lib/python3.6/dist-packages/lightgbm/basic.py", line 110, in _safe_call
    raise LightGBMError(_LIB.LGBM_GetLastError().decode('utf-8'))
lightgbm.basic.LightGBMError: bin size 257 cannot run on GPU

George3d6 · 2021-07-18T00:27:04Z

Some of the values are categorical in my case but not as many as 257 different ones, combined with @pseudotensor comment, I assume this is something else.

…3339

lewis-morris · 2022-01-12T08:16:52Z

I am also getting the same error.

lightgbm.basic.LightGBMError: bin size 257 cannot run on GPU

MAxx8371 · 2022-02-14T03:52:15Z

What causes this error？Is the bin_size of a categorical feature bigger than the max_bin that causes the error? Or it is because the memory is not enough. And the model can work on CPU. Thank you!

jiluojiluo · 2022-07-11T10:11:08Z

lightgbm.basic.LightGBMError: bin size 407 cannot run on GPU

jiluojiluo · 2022-07-11T10:19:58Z

lightgbm.basic.LightGBMError: bin size 407 cannot run on GPU

this is a bug for lightGBM run on GPU,when use CPU,it is OK. SO ,LGBM on GPU need improve.

ChiHangChen · 2022-07-27T05:37:34Z

Same error encountered, any update?

aforadi · 2022-08-04T06:41:30Z

Same here.

lightgbm.basic.LightGBMError: bin size 670 cannot run on GPU

CVPaul · 2023-08-04T07:22:31Z

It seems that I've identified the cause of the error: The calculation method for num_total_bin used during Exclusive Feature Bundling

LightGBM/src/io/dataset.cpp

Line 134 in 665c473

(bin_mappers[fidx]->GetDefaultBin() == 0 ? -1 : 0);

doesn't align completely with the way num_total_bin is calculated during the creation of a FeatureGroup

LightGBM/include/LightGBM/feature_group.h

Line 68 in 665c473

if (bin_mappers_[i]->GetMostFreqBin() == 0) {

As a result, the max_bin_per_group (=256) is working during Bundling, but it is not working when creating the FeatureGroup. When I replaced the GetDefaultBin() at dataset.cpp#L134 with GetMostFreqBin(), the issue was resolved. I had tested with the case reported here: #4082

microsoft#3339 (comment)

XQ-UT · 2023-11-28T20:40:41Z

Same issue here. Can we prioritize the fixing MR?

* solve 'bin size 257 cannot run on GPU #3339' #3339 (comment) * fix typo LeafIndex -> leaf_index --------- Co-authored-by: shiyu1994 <[email protected]> Co-authored-by: James Lamb <[email protected]>

damvantai · 2024-05-08T07:33:47Z

I also encountered the same error in this situation in fold 3th (total 5 fold), when my category characteristic had many NAN values.
Can do value np.NAN, np.int in feature category -> max_bin
I have use labelencoder convert feature category -> maxbin

But after i use
df_train[cat_cols] = df_train[cat_cols].astype(str)
df_train[cat_cols] = df_train[cat_cols].astype("category")
then the training process will good!

or can setup
"min_data_in_bin": 256, or higher

jameslamb added the bug label Oct 22, 2020

StrikerRUS mentioned this issue Jan 28, 2021

v3.2.0 release #3872

Merged

ThomasMeissnerDS added a commit to ThomasMeissnerDS/e2e_ml that referenced this issue Jul 21, 2021

Changed package requirements to lgbm 3.1.0 due to microsoft/LightGBM#…

abeb92f

…3339

George3d6 mentioned this issue Nov 8, 2021

Fix LGBM GPU Issue mindsdb/lightwood#730

Closed

jameslamb mentioned this issue Apr 14, 2022

[RFC] 4.0.0 Release #5153

Closed

60 tasks

CVPaul added a commit to CVPaul/LightGBM that referenced this issue Aug 4, 2023

solve 'bin size 257 cannot run on GPU microsoft#3339'

321ce4a

microsoft#3339 (comment)

jameslamb mentioned this issue Aug 4, 2023

Fix calculation of number of bins in FindGroup #6019

Merged

shiyu1994 closed this as completed in #6019 Feb 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] LightGBMError: bin size 257 cannot run on GPU #3339

[Bug] LightGBMError: bin size 257 cannot run on GPU #3339

rohan-gt commented Aug 28, 2020 •

edited

Loading

hengzhe-zhang commented Oct 22, 2020

guolinke commented Oct 22, 2020

pseudotensor commented Mar 16, 2021 •

edited

Loading

George3d6 commented Jul 18, 2021

George3d6 commented Jul 18, 2021

lewis-morris commented Jan 12, 2022

MAxx8371 commented Feb 14, 2022 •

edited

Loading

jiluojiluo commented Jul 11, 2022

jiluojiluo commented Jul 11, 2022

ChiHangChen commented Jul 27, 2022

aforadi commented Aug 4, 2022

CVPaul commented Aug 4, 2023 •

edited

Loading

XQ-UT commented Nov 28, 2023

damvantai commented May 8, 2024 •

edited

Loading

[Bug] LightGBMError: bin size 257 cannot run on GPU #3339

[Bug] LightGBMError: bin size 257 cannot run on GPU #3339

Comments

rohan-gt commented Aug 28, 2020 • edited Loading

hengzhe-zhang commented Oct 22, 2020

guolinke commented Oct 22, 2020

pseudotensor commented Mar 16, 2021 • edited Loading

George3d6 commented Jul 18, 2021

George3d6 commented Jul 18, 2021

lewis-morris commented Jan 12, 2022

MAxx8371 commented Feb 14, 2022 • edited Loading

jiluojiluo commented Jul 11, 2022

jiluojiluo commented Jul 11, 2022

ChiHangChen commented Jul 27, 2022

aforadi commented Aug 4, 2022

CVPaul commented Aug 4, 2023 • edited Loading

XQ-UT commented Nov 28, 2023

damvantai commented May 8, 2024 • edited Loading

rohan-gt commented Aug 28, 2020 •

edited

Loading

pseudotensor commented Mar 16, 2021 •

edited

Loading

MAxx8371 commented Feb 14, 2022 •

edited

Loading

CVPaul commented Aug 4, 2023 •

edited

Loading

damvantai commented May 8, 2024 •

edited

Loading