feat: Change locking strategy of Booster, allow for share and unique locks #2760

JoanFM · 2020-02-14T08:50:56Z

Changes introduced
I propose to have a different locking strategy that would allow to do singlePredictions in a parallel way.
I propose having shared lock for const methods and unique locks for non const that are potentially writing values.
I added const to the methods that do Prediction since they should not alter the Booster and should be able to access the mutex in shared mode.

I hope this can help.

… to upper and lower bound, move implementation to gdbt.cpp

Co-Authored-By: Nikita Titov <[email protected]>

…ython and a basic test

…ax_min_leafs

…eads

…to max_min_leafs

…ck to test with hardcoded value

…to max_min_leafs

…ax_min_leafs

…into read_write_locks

…to read_write_locks

imatiach-msft · 2020-07-14T14:52:31Z

@JoanFM agreed, I think it's ok to not use git submodules for now, especially if they are not updating their code frequently. However, long term I always try to reduce code duplication/forking since it increases maintenance costs. However, it sounds like this is not something we want to do right now.

JoanFM · 2020-07-16T05:55:14Z

Hello @StrikerRUS @guolinke, after solving a couple of conflicts I have issues again with the r-package.

guolinke · 2020-07-16T07:26:58Z

ping @jameslamb for R's CI.

StrikerRUS · 2020-07-16T12:23:50Z

Actually, all tests are failing, not just R ones.

I believe the problem is in conflicts after #2992 was merged. For example, see compilation errors:

[ 96%] Building CXX object CMakeFiles/_lightgbm.dir/src/c_api.cpp.o
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:1982:86: error: too many
      arguments to function call, expected 6, have 7
  ...get_row_fun, fastConfig->config, out_result, out_len);
                                                  ^~~~~~~
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:381:3: note: 
      'PredictSingleRow' declared here
  void PredictSingleRow(int predict_type, int ncol,
  ^
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:2116:53: error: too many
      arguments to function call, expected 6, have 7
                                        out_result, out_len);
                                                    ^~~~~~~
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:381:3: note: 
      'PredictSingleRow' declared here
  void PredictSingleRow(int predict_type, int ncol,
  ^
2 errors generated.
CMakeFiles/_lightgbm.dir/build.make:439: recipe for target 'CMakeFiles/_lightgbm.dir/src/c_api.cpp.o' failed

…ead_write_locks

JoanFM · 2020-07-16T12:33:18Z

Actually, all tests are failing, not just R ones.

I believe the problem is in conflicts after #2992 was merged. For example, see compilation errors:

[ 96%] Building CXX object CMakeFiles/_lightgbm.dir/src/c_api.cpp.o
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:1982:86: error: too many
      arguments to function call, expected 6, have 7
  ...get_row_fun, fastConfig->config, out_result, out_len);
                                                  ^~~~~~~
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:381:3: note: 
      'PredictSingleRow' declared here
  void PredictSingleRow(int predict_type, int ncol,
  ^
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:2116:53: error: too many
      arguments to function call, expected 6, have 7
                                        out_result, out_len);
                                                    ^~~~~~~
/home/travis/build/microsoft/LightGBM/src/c_api.cpp:381:3: note: 
      'PredictSingleRow' declared here
  void PredictSingleRow(int predict_type, int ncol,
  ^
2 errors generated.
CMakeFiles/_lightgbm.dir/build.make:439: recipe for target 'CMakeFiles/_lightgbm.dir/src/c_api.cpp.o' failed

Thanks @StrikerRUS ,

Sorry, my bad, I did the conflict resolution on the browser and did not double check. I solved the issues now, but now there are some unused parameters, but I guess removing them would break a lot of the interfaces.

JoanFM · 2020-07-17T10:51:25Z

Hi @StrikerRUS @guolinke @imatiach-msft, is there any issue blocking the merge of this PR? Thanks

guolinke · 2020-07-17T13:22:53Z

Thanks @JoanFM so much, looks good to me. ping @StrikerRUS for the final review.

StrikerRUS · 2020-07-17T21:43:14Z

Everything looks good to me either! Actually, I gave my approval a few days ago: #2760 (review). Sorry for the confusion!

I just wonder, should new functions in C API from the recently merged #2992 be enhanced with new locks?

JoanFM · 2020-07-18T05:01:51Z

Hey @StrikerRUS, I think he did not add any new method inside the Booster object, am I right? If it is so, that PR should directly benefit from the new locking strategy. Buy yes it is important to keep up with these locks when new methods arise, and if possible ensure const-correctness for new methods.

StrikerRUS · 2020-07-19T00:41:52Z

@JoanFM Yeah, seems you are right!

Thanks a lot for your contribution and a great patience!

Ten0 · 2023-08-17T10:16:13Z

For reference: yamc seems to be a pretty bad implementation with regards to the way it manages shared locks: it relies on an exclusive mutex on a state variable to implement the shared locking.
This seems to be far from being on par with regular rwlock implementations. (e.g. gcc's libstdc++17 using glibc using pthread...)

jameslamb · 2023-08-17T22:09:20Z

yamc seems to be a pretty bad implementation with regards to the way it manages shared locks

@Ten0 are you interested in submitting a pull request changing the behavior you're talking about? Or writing up in more detail a proposal for some change that someone else could implement?

We'd welcome your help in improving LightGBM if you see some opportunity for improvement.

Ten0 · 2023-08-17T22:47:58Z

Are you interested in submitting a pull request changing the behavior you're talking about?

Hello! Thank you for your interest!
Indeed I might be! I'm currently working towards a benchmark for #6024 (which should be feature-complete otherwise), and I'll make sure to specifically benchmark the impact of the sub-optimal locking. If it is large, I may work on this next. However if it turns out that running the decision tree is expensive enough that the sub-optimal locking becomes negligible and allowing parallelism from #6024 is enough, I probably won't prioritize it.

Ten0 · 2023-09-18T17:39:45Z

I've finished the benchmarks and as it turns out, running our decision tree is expensive enough that the sub-optimal locking becomes negligible and allowing parallelism from #6024 is totally enough for us ATM, so I won't dedicate time to this.

github-actions · 2023-12-20T00:17:08Z

This pull request has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

JoanFM and others added 28 commits February 3, 2020 14:51

Add capability to get possible max and min values for a model

971720f

Change implementation to have return value in tree.cpp, change naming…

3c49f4c

… to upper and lower bound, move implementation to gdbt.cpp

Update include/LightGBM/c_api.h

48a9474

Co-Authored-By: Nikita Titov <[email protected]>

Change iteration to avoid potential overflow, add bindings to R and P…

0d516c1

…ython and a basic test

Merge branch 'master' of https://github.com/microsoft/LightGBM into m…

f14ca1f

…ax_min_leafs

Adjust test values

8c58493

Consider const correctness and multithreading protection

019bda8

Put everything possible as const

a794241

Include shared_mutex, for now as unique_lock

70ca8fc

Update test values

c43bf1c

Put everything possible as const

87ee70f

Include shared_mutex, for now as unique_lock

fd24844

Make PredictSingleRow const and share the lock with other reading thr…

65cde98

…eads

Update test values

cfa612b

Add test to check that model is exactly the same in all platforms

d3a54ae

Try to parse the model to get the expected values

8ae8f3e

Try to parse the model to get the expected values

3713151

Merge branch 'max_min_leafs' of https://github.com/JoanFM/LightGBM in…

1cd8a9d

…to max_min_leafs

Fix implementation, num_leaves can be lower than the leaf_value_ size

c7185a0

Do not check for num_leaves to be smaller than actual size and get ba…

fd0bdb8

…ck to test with hardcoded value

Merge branch 'max_min_leafs' of https://github.com/JoanFM/LightGBM in…

d6ca4ff

…to max_min_leafs

Merge branch 'master' of https://github.com/microsoft/LightGBM into m…

a753edf

…ax_min_leafs

Change test order

19234e0

Add gpu_use_dp option in test

00bdde1

Remove helper test method

82c677c

Merge branch 'read_write_locks' of https://github.com/JoanFM/LightGBM …

46092fd

…into read_write_locks

Merge branch 'max_min_leafs' of https://github.com/JoanFM/LightGBM in…

397667c

…to read_write_locks

Remove TODO

46c4658

JoanFM requested review from chivee and guolinke as code owners February 14, 2020 08:50

imatiach-msft approved these changes Jul 14, 2020

View reviewed changes

guolinke approved these changes Jul 16, 2020

View reviewed changes

Merge branch 'master' into read_write_locks

611e3ea

JoanFM force-pushed the read_write_locks branch from 0f3cb8b to 611e3ea Compare July 16, 2020 05:34

JoanFM closed this Jul 16, 2020

JoanFM reopened this Jul 16, 2020

JoanFM and others added 2 commits July 16, 2020 14:26

Merge branch 'master' of https://github.com/microsoft/LightGBM into r…

e20f230

…ead_write_locks

Fix problems coming from merge conflict resolution

61c59fe

StrikerRUS merged commit 1c35c3b into microsoft:master Jul 19, 2020

AlbertoEAF mentioned this pull request Jul 20, 2020

[bug] Introduced bug in *Fast() methods with the locking strategy PR #3241

Closed

StrikerRUS mentioned this pull request Oct 12, 2020

Add arm64 wheel to travis-ci #3421

Closed

StrikerRUS mentioned this pull request Jan 12, 2021

Issue with call to c_api Predict functions in multi threaded way from golang #3751

Closed

Ten0 mentioned this pull request Sep 18, 2023

Fix single row prediction performance in a multi-threaded environment #6024

Merged

stonebrakert6 mentioned this pull request Oct 13, 2023

Closed - Opened by Mistake #6141

Closed

github-actions bot locked as resolved and limited conversation to collaborators Dec 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Change locking strategy of Booster, allow for share and unique locks #2760

feat: Change locking strategy of Booster, allow for share and unique locks #2760

JoanFM commented Feb 14, 2020 •

edited

Loading

imatiach-msft commented Jul 14, 2020

JoanFM commented Jul 16, 2020

guolinke commented Jul 16, 2020

StrikerRUS commented Jul 16, 2020

JoanFM commented Jul 16, 2020

JoanFM commented Jul 17, 2020

guolinke commented Jul 17, 2020

StrikerRUS commented Jul 17, 2020 •

edited

Loading

JoanFM commented Jul 18, 2020

StrikerRUS commented Jul 19, 2020

Ten0 commented Aug 17, 2023

jameslamb commented Aug 17, 2023

Ten0 commented Aug 17, 2023

Ten0 commented Sep 18, 2023

github-actions bot commented Dec 20, 2023

feat: Change locking strategy of Booster, allow for share and unique locks #2760

feat: Change locking strategy of Booster, allow for share and unique locks #2760

Conversation

JoanFM commented Feb 14, 2020 • edited Loading

imatiach-msft commented Jul 14, 2020

JoanFM commented Jul 16, 2020

guolinke commented Jul 16, 2020

StrikerRUS commented Jul 16, 2020

JoanFM commented Jul 16, 2020

JoanFM commented Jul 17, 2020

guolinke commented Jul 17, 2020

StrikerRUS commented Jul 17, 2020 • edited Loading

JoanFM commented Jul 18, 2020

StrikerRUS commented Jul 19, 2020

Ten0 commented Aug 17, 2023

jameslamb commented Aug 17, 2023

Ten0 commented Aug 17, 2023

Ten0 commented Sep 18, 2023

github-actions bot commented Dec 20, 2023

JoanFM commented Feb 14, 2020 •

edited

Loading

StrikerRUS commented Jul 17, 2020 •

edited

Loading