Investigate moving value/policy head to GPU #51

glinscott · 2018-01-28T18:18:39Z

There was a prototype here: https://github.com/ihavnoid/leela-zero/tree/dualhead_conv

It was not a win for Leela Zero, but with us increasing from 1/2 to 32 or 64 channels, it could make a big difference now.

Error323 · 2018-02-12T08:04:01Z

I'm looking into this. It's not as trivial as I had hoped.

gcp · 2018-02-15T10:45:09Z

leela-zero/leela-zero@f4a36c1

This doesn't move the entire head (i.e. not the FC/innerproduct layers), just the final 1x1 convolution. It's useful because it saves a lot of data transfer (64-128 fold!) over the PCIe bus, even if the computational gain is little.

Error323 · 2018-02-17T21:35:34Z

edit - never mind I found the issue. I do wonder though, what is a good way to debug OpenCL code? I'm a CUDA guy.

glinscott · 2018-02-25T06:29:55Z

Resolved by #64. Thanks @Error323!

Ttl · 2018-02-25T06:36:35Z

#64 only moved the convolutions to GPU. Innerproducts are still on CPU and according to the profiler take more time than convolutions on GPU.

I made earlier a GPU innerproduct for leelaz, but it was slower than doing it on CPU. It should be faster for lczero since innerproducts are much bigger. I'll see if I can port it over.

Error323 · 2018-02-25T06:37:35Z

Hey @glinscott sorry it's not fully resolved yet, we should still move the innerproducts to the GPU, atleast the policy head 32x8x8 -> 1924 and the 32x8x8 -> 128 from the valuehead.

glinscott · 2018-02-25T17:18:29Z

Ah, yes, good call @Error323. After some quick work by @Ttl the innerproducts are now moved to GPU as well! Thanks @Ttl!

This was referenced Jan 29, 2018

Policy and value heads are from AlphaGo Zero, not Alpha Zero #47

Closed

Optimize for NVIDIA cards using TensorRT #52

Open

Getting started #53

Closed

glinscott closed this as completed Feb 25, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate moving value/policy head to GPU #51

Investigate moving value/policy head to GPU #51

glinscott commented Jan 28, 2018

Error323 commented Feb 12, 2018

gcp commented Feb 15, 2018 •

edited

Loading

Error323 commented Feb 17, 2018 •

edited

Loading

glinscott commented Feb 25, 2018

Ttl commented Feb 25, 2018

Error323 commented Feb 25, 2018 •

edited

Loading

glinscott commented Feb 25, 2018

Investigate moving value/policy head to GPU #51

Investigate moving value/policy head to GPU #51

Comments

glinscott commented Jan 28, 2018

Error323 commented Feb 12, 2018

gcp commented Feb 15, 2018 • edited Loading

Error323 commented Feb 17, 2018 • edited Loading

glinscott commented Feb 25, 2018

Ttl commented Feb 25, 2018

Error323 commented Feb 25, 2018 • edited Loading

glinscott commented Feb 25, 2018

gcp commented Feb 15, 2018 •

edited

Loading

Error323 commented Feb 17, 2018 •

edited

Loading

Error323 commented Feb 25, 2018 •

edited

Loading