[RFC] Replace "parallel learning" in docs with "distributed learning"? #3596

jameslamb · 2020-11-25T21:45:46Z

LightGBM comes with the ability to use multiple machines for training. This can be done with the CLI, or with integrations like Spark, Kubeflow Fairing, and Dask (#3515 ).

Today, the docs refer to training with multiple machines as "parallel learning".

I think that this is not quite precise enough, and can lead to some confusion.

LightGBM has at least two types of parallelism:

within one process (shared memory), using multithreading with OpenMP
across multiple processes (possibly on multiple machines, and with distributed data), using either sockets or MPI

https://lightgbm.readthedocs.io/en/latest/Parallel-Learning-Guide.html#parallel-learning-guide only refers to the second case today.

I think we should rename this guide to "Distributed Learning" and use the word "distributed" everywhere in the documentation that talks about using multiple machines to accomplish model training.

Wanted to open this request for comment before I start making changes. What do you think?

StrikerRUS · 2020-11-25T23:44:26Z

I'm +1, BUT with keeping old links alive.

guolinke · 2020-11-26T03:07:57Z

agree with @StrikerRUS

jameslamb · 2020-11-26T03:17:22Z

ok thanks! I'll prepare a PR.

I also agree on preserving the prior links

github-actions · 2023-08-23T16:22:36Z

This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this.

jameslamb added question doc labels Nov 25, 2020

This was referenced Feb 13, 2021

[doc] Reorganize documentation on distributed learning (fixes #3596) #3951

Merged

[docs] Change some 'parallel learning' references to 'distributed learning' #4000

Merged

jameslamb self-assigned this Feb 19, 2021

jameslamb closed this as completed in 7171558 Feb 22, 2021

github-actions bot locked as resolved and limited conversation to collaborators Aug 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Replace "parallel learning" in docs with "distributed learning"? #3596

[RFC] Replace "parallel learning" in docs with "distributed learning"? #3596

jameslamb commented Nov 25, 2020

StrikerRUS commented Nov 25, 2020

guolinke commented Nov 26, 2020

jameslamb commented Nov 26, 2020

github-actions bot commented Aug 23, 2023

[RFC] Replace "parallel learning" in docs with "distributed learning"? #3596

[RFC] Replace "parallel learning" in docs with "distributed learning"? #3596

Comments

jameslamb commented Nov 25, 2020

StrikerRUS commented Nov 25, 2020

guolinke commented Nov 26, 2020

jameslamb commented Nov 26, 2020

github-actions bot commented Aug 23, 2023