-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CUDA] illegal memory access when using CUDA and large max_bin and large dataset #6512
Comments
我也有这个问题,我的训练数据大概有500m,环境配置按着官网的步骤来的。用gpu版本说我超出内存才换的cuda版,现在cuda版本报这个错,请问我该如何处理。 版本: 错误: [LightGBM] [Fatal] [CUDA] an illegal memory access was encountered /home/yuhr/桌面/LightGBM/src/io/cuda/cuda_tree.cpp 37 terminate called after throwing an instance of 'std::runtime_error' |
I'm running the examples |
Thanks for reporting this issue. With large max bin values, kernels constructing histogram using global memory may be used, which has not been tested heavily. I'm debugging this. |
Description
By using CUDA histogram of the master branch, the simple python code report memory error if it uses large max_bin size
Reproducible example
And it will report error:
Environment info
GPU: NVIDIA GeForce RTX 3060
Python: 3.12.4
LightGBM version or commit hash: master branch
LightGBM version or commit hash:
Command(s) you used to install LightGBM
Additional Comments
The text was updated successfully, but these errors were encountered: