[gpu] Large dataset In LGBMRegressor Failed #4926

jiapengwen · 2022-01-05T08:14:38Z

Description

use gpu version, I have a Large dataset, but execute failed

Reproducible example

import pandas as pd
import numpy as np
import os
import time
from lightgbm import LGBMRegressor
from sklearn.datasets import make_classification
from sklearn.datasets import make_regression
from sklearn.model_selection import train_test_split
import lightgbm as lgbm




if __name__ == '__main__':
    time1=time.time()
    # X=np.random.random((4300000,2200)) #work
    # y=np.random.random((4300000)) 

    X=np.random.random((8000000,2200))
    y=np.random.random((8000000))
    # X=np.random.random((7000000,2200)) # work
    # y=np.random.random((7000000))
    time2 = time.time()
    print('contruct data cost:',time2-time1)
    print(X[2][2])
    print(y[2:10])
    time1=time.time()
    model = lgbm.LGBMRegressor(device="gpu",n_estimators=1000,verbose=4,max_bin=16,tree_learner = 'serial',gpu_use_dp='false',n_jobs=1,
                                max_depth=7,num_leaves=31 ,min_child_samples=17000)
    model.fit(X, y,callbacks=[lgbm.log_evaluation()])
    time2 = time.time()
    print('gpu cost:',time2-time1)


######################################
[LightGBM] [Warning] Accuracy may be bad since you didn't explicitly set num_leaves OR 2^max_depth > num_leaves. (num_leaves=31).
[LightGBM] [Info] This is the GPU trainer!!
[LightGBM] [Info] Total Bins 35200
[LightGBM] [Info] Number of data points in the train set: 8000000, number of used features: 2200
[LightGBM] [Info] Using GPU Device: Quadro RTX 6000, Vendor: NVIDIA Corporation
[LightGBM] [Info] Compiling OpenCL Kernel with 16 bins...
[LightGBM] [Info] GPU programs have been built
[LightGBM] [Info] Size of histogram bin entry: 8
terminate called after throwing an instance of 'boost::exception_detail::clone_impl<boost::exception_detail::error_info_injector<boost::compute::opencl_error> >'
  what():  Memory Object Allocation Failure

Environment info

LightGBM version or commit hash:

LightGBM version "3.3.1";

Command(s) you used to install LightGBM

pip3 install lightgbm --install=--gpu

python version python3.6.9

Additional Comments

My machine has 227G memory,

jiapengwen · 2022-01-06T05:34:31Z

I have fix it, already make a pr, please check , #4928

jiapengwen · 2022-01-06T05:36:44Z

@jameslamb

jameslamb · 2022-01-08T03:12:14Z

closed by #4928

github-actions · 2023-08-23T14:07:11Z

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jameslamb changed the title ~~Large dataset In LGBMRegressor Failed~~ [gpu] Large dataset In LGBMRegressor Failed Jan 5, 2022

jameslamb added the bug label Jan 5, 2022

jameslamb mentioned this issue Jan 8, 2022

gpu allocate memory overflow (fixes #4926) #4928

Merged

jameslamb closed this as completed Jan 8, 2022

jameslamb mentioned this issue Oct 7, 2022

[DO NOT MERGE] Release v3.3.3 #5525

Closed

40 tasks

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[gpu] Large dataset In LGBMRegressor Failed #4926

[gpu] Large dataset In LGBMRegressor Failed #4926

jiapengwen commented Jan 5, 2022

jiapengwen commented Jan 6, 2022 •

edited

Loading

jiapengwen commented Jan 6, 2022

jameslamb commented Jan 8, 2022

github-actions bot commented Aug 23, 2023

[gpu] Large dataset In LGBMRegressor Failed #4926

[gpu] Large dataset In LGBMRegressor Failed #4926

Comments

jiapengwen commented Jan 5, 2022

Description

Reproducible example

Environment info

Additional Comments

jiapengwen commented Jan 6, 2022 • edited Loading

jiapengwen commented Jan 6, 2022

jameslamb commented Jan 8, 2022

github-actions bot commented Aug 23, 2023

jiapengwen commented Jan 6, 2022 •

edited

Loading