[CUDA] Add quantile regression objective for new CUDA version #5605

shiyu1994 · 2022-11-25T16:06:03Z

Add quantile regression objective for new CUDA version.

update log in test_register_logger

…oft/LightGBM into cuda/objective-regression

…ession

add test cases for regression objectives

…oft/LightGBM into cuda/objective-regression

…ession-quantile

shiyu1994 · 2023-03-16T03:38:46Z

It is weird that

nvlink error   : Entry function '_ZN8LightGBM33CUDAConstructHistogramDenseKernelIhfLm11120EEEvPKNS_20CUDALeafSplitsStructEPKfS5_PKT_PKjSA_PKii' uses too much shared data (0xcbfc bytes, 0xc000 max) (target: sm_60)
nvlink error   : Entry function '_ZN8LightGBM34CUDAConstructHistogramSparseKernelItmfLm11120EEEvPKNS_20CUDALeafSplitsStructEPKfS5_PKT_PKT0_SB_PKji' uses too much shared data (0xcbfc bytes, 0xc000 max) (target: sm_60)

should arise when no related code of CUDAConstructHistogramSparseKernel or CUDAConstructHistogramDenseKernel is changed in this PR for cuda 10.0.

Note that for cuda 10.0, in master branch we've already used a smaller shared memory size than the claimed maximum 0c000 bytes (with which we should be able to use DP_SHARED_HIST_SIZE = 6144) to avoid the error above.

LightGBM/include/LightGBM/cuda/cuda_row_data.hpp

Lines 22 to 27 in 8811063

    
           #if CUDART_VERSION == 10000 
        
           #define DP_SHARED_HIST_SIZE (5560) 
        
           #else 
        
           #define DP_SHARED_HIST_SIZE (6144) 
        
           #endif 
        
           #define SP_SHARED_HIST_SIZE (DP_SHARED_HIST_SIZE * 2)

It is unclear to me why adding code in this PR will have to make the DP_SHARED_HIST_SIZE = 5560 for cuda 10.0 smaller.

@guolinke Do you have any idea?

guolinke · 2023-03-16T09:27:15Z

how many bytes were exceeded? It looks quite large. are there any functions that possibly inherit or call other functions with shared memory?

shiyu1994 · 2023-03-16T10:03:17Z

how many bytes were exceeded?

0xcbfc - 0xc000 = 0x0bfc = 3068 bytes

are there any functions that possibly inherit or call other functions with shared memory?

No. The code compiles with cuda 11.0 but not with cuda 10.0.

shiyu1994 · 2023-03-17T03:30:43Z

@guolinke Let me adjust the #define DP_SHARED_HIST_SIZE (5560) to a lower value so that the compilation can succeed. And let's drop the support for CUDA 10.0 in another PR.

shiyu1994 · 2023-03-17T04:39:39Z

It is unclear to me why adding code in this PR will have to make the DP_SHARED_HIST_SIZE = 5560 for cuda 10.0 smaller.

Just use binary search to find out that the maximum allowed value is 5176.

shiyu1994 · 2023-03-17T04:48:33Z

@guolinke This is ready for review.

github-actions · 2023-08-15T20:17:52Z

This pull request has been automatically locked since there has not been any recent activity since it was closed.
To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues
including a reference to this.

shiyu1994 added 30 commits August 16, 2022 03:42

add binary objective for cuda_exp

7eaf3d8

include <string> and <vector>

ec84dcc

exchange include ordering

a00f272

fix length of score to copy in evaluation

ce26a61

Merge branch 'master' into cuda/objective-binary

3429b68

fix EvalOneMetric

e91ec0b

fix cuda binary objective and prediction when boosting on gpu

a7641ca

Add white space

a6cc1f6

fix BoostFromScore for CUDABinaryLogloss

3b354ea

update log in test_register_logger

include <algorithm>

2ead53a

simplify shared memory buffer

a78b146

add (l2) regression objective for cuda_exp

6dc035a

Merge branch 'master' into cuda/objective-regression

b202478

fix lint errors

ac52e75

Merge branch 'cuda/objective-regression' of https://github.com/Micros…

1708d42

…oft/LightGBM into cuda/objective-regression

add (l1) regression objective for cuda_exp

bd0010f

merge LightGBM/master

063d066

remove RenewTreeOutputCUDA from CUDARegressionL2loss

8a581ed

remove mutable and use CUDAVector

95898fd

remove white spaces

3a1af9c

remove TODO and document in (#5459)

1107e32

Merge remote-tracking branch 'origin/master' into cuda/objective-regr…

0b5a63a

…ession

add huber regression for cuda_exp

67820d8

renew tree output on GPU

8e4fb0f

add test cases for regression objectives

remove useless changes

57f65a9

add white space

4bf0d15

Merge branch 'cuda/objective-regression' of https://github.com/Micros…

2b7299a

…oft/LightGBM into cuda/objective-regression

change percentile in CUDARegressionL1loss::BoostFromScore to 0.5

713a391

add fair regression objective

502267a

remove useless changes in test_engine.py

2e93075

shiyu1994 requested a review from jameslamb as a code owner November 25, 2022 16:06

shiyu1994 added 4 commits December 7, 2022 16:29

merge master

2e986d5

remove white space

0b80042

merge master

3df130e

merge with origin/cuda/objective-regression-quantile

d0a7610

shiyu1994 changed the title ~~[WIP] [CUDA] Add quantile regression objective for new CUDA version~~ [CUDA] Add quantile regression objective for new CUDA version Dec 28, 2022

shiyu1994 added 8 commits December 28, 2022 06:26

resolve merge conflicts

c29e688

remove useless changes

73aaa79

remove useless changes

ab465ea

Merge remote-tracking branch 'origin/master' into cuda/objective-regr…

8f489aa

…ession-quantile

enable cuda quantile regression objective

616b600

add a test case for quantile regression objective

c9443f7

remove useless changes

4ca25d0

remove useless changes

58bfd9b

shiyu1994 mentioned this pull request Mar 17, 2023

[RFC] Drop support for CUDA 10 #5789

Closed

reduce DP_SHARED_HIST_SIZE to 5176 for CUDA 10

e5c7af6

shiyu1994 added awaiting review and removed in progress labels Mar 17, 2023

guolinke approved these changes Mar 17, 2023

View reviewed changes

shiyu1994 merged commit ce0813e into master Mar 21, 2023

shiyu1994 deleted the cuda/objective-regression-quantile branch March 21, 2023 04:21

github-actions bot removed the awaiting review label Aug 15, 2023

github-actions bot locked as resolved and limited conversation to collaborators Aug 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA] Add quantile regression objective for new CUDA version #5605

[CUDA] Add quantile regression objective for new CUDA version #5605

shiyu1994 commented Nov 25, 2022

shiyu1994 commented Mar 16, 2023 •

edited

Loading

guolinke commented Mar 16, 2023

shiyu1994 commented Mar 16, 2023

shiyu1994 commented Mar 17, 2023

shiyu1994 commented Mar 17, 2023

shiyu1994 commented Mar 17, 2023

github-actions bot commented Aug 15, 2023

[CUDA] Add quantile regression objective for new CUDA version #5605

[CUDA] Add quantile regression objective for new CUDA version #5605

Conversation

shiyu1994 commented Nov 25, 2022

shiyu1994 commented Mar 16, 2023 • edited Loading

guolinke commented Mar 16, 2023

shiyu1994 commented Mar 16, 2023

shiyu1994 commented Mar 17, 2023

shiyu1994 commented Mar 17, 2023

shiyu1994 commented Mar 17, 2023

github-actions bot commented Aug 15, 2023

shiyu1994 commented Mar 16, 2023 •

edited

Loading