[WIP] Add all gather op and all reduce op #9713

chengduoZH · 2018-04-07T14:32:35Z

This PR attempt to resolve the sparse gradient on multi-cards.

panyx0718 · 2018-04-08T02:38:33Z

Briefly discussed. Customized gather and reduce might allow us to fully overlap computation with memcpy. It also allows us to do customized sparse gradient aggregation. Nice!

reyoung · 2018-04-08T05:33:01Z

I do not believe that implementation is faster than NCCL, especially for dense parameters. Also, I think AllReduce operators must contain GPU kernels. It cannot be implemented by only memcpies.

chengduoZH · 2018-04-13T12:08:43Z

The PR did not reach the expectation, so it can be turned off.

chengduoZH force-pushed the feature/add_all_reduce_and_all_gather_op branch from 7f595aa to 22a064a Compare April 7, 2018 14:37

add all gather op and all reduce op

231a219

chengduoZH force-pushed the feature/add_all_reduce_and_all_gather_op branch from 22a064a to 231a219 Compare April 7, 2018 15:26

chengduoZH requested review from reyoung and panyx0718 April 8, 2018 02:09

chengduoZH force-pushed the feature/add_all_reduce_and_all_gather_op branch 2 times, most recently from f78783d to 5f337c9 Compare April 8, 2018 07:43

add tensor copying for cuda_pinned

1e29c2e

chengduoZH force-pushed the feature/add_all_reduce_and_all_gather_op branch from 5f337c9 to 1e29c2e Compare April 8, 2018 09:40

chengduoZH closed this Apr 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add all gather op and all reduce op #9713

[WIP] Add all gather op and all reduce op #9713

chengduoZH commented Apr 7, 2018 •

edited

Loading

panyx0718 commented Apr 8, 2018

reyoung commented Apr 8, 2018

chengduoZH commented Apr 13, 2018

[WIP] Add all gather op and all reduce op #9713

[WIP] Add all gather op and all reduce op #9713

Conversation

chengduoZH commented Apr 7, 2018 • edited Loading

panyx0718 commented Apr 8, 2018

reyoung commented Apr 8, 2018

chengduoZH commented Apr 13, 2018

chengduoZH commented Apr 7, 2018 •

edited

Loading