`bfloat16` and FP16 support for custom kernels #271

sidnb13 · 2023-10-31T18:42:24Z

🚀 The feature, motivation and pitch

As the kernels seem to be limited to the FP32 data type at the moment, it would be immensely helpful to have the implementations support mixed precision computations (FP16 and BF16) as well. This would be helpful for broader ranging applications in NLP, not just in graph neural nets.

How involved would enabling mixed-precision computations be? Any pointers to potentially start a PR?

Alternatives

No response

Additional context

No response

finndayton · 2023-11-02T02:44:56Z

So adding on to @sidnb13 's comments here, it looks like segment_matmul just takes in two Tensor types here which are simply torch.Tensors, right? And torch.Tensor does have native support for bfloat16 / torch.float16 / torch.half, right? The weird thing is that when one tries to run segment_matmul on two tensors cast to bfloat16, you get this error:

rusty1s · 2023-11-02T08:30:54Z

@DamianSzwichtenberg

DamianSzwichtenberg · 2023-11-02T11:00:13Z

(segment|grouped)_matmul had an incomplete dispatch types set, I've fixed CPU implementation with pyg-lib @ 272 (@puririshi98 could you please take a look at CUDA implementation?). If you find any custom operation that is lacking bf16 support you can take a look at @yanbing-j PRs, e.g. pytorch_scatter @ 316 and pytorch_scatter @ 375.

sidnb13 added the feature label Oct 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`bfloat16` and FP16 support for custom kernels #271

`bfloat16` and FP16 support for custom kernels #271

sidnb13 commented Oct 31, 2023 •

edited

Loading

finndayton commented Nov 2, 2023 •

edited

Loading

rusty1s commented Nov 2, 2023

DamianSzwichtenberg commented Nov 2, 2023

bfloat16 and FP16 support for custom kernels #271

bfloat16 and FP16 support for custom kernels #271

Comments

sidnb13 commented Oct 31, 2023 • edited Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

finndayton commented Nov 2, 2023 • edited Loading

rusty1s commented Nov 2, 2023

DamianSzwichtenberg commented Nov 2, 2023

`bfloat16` and FP16 support for custom kernels #271

`bfloat16` and FP16 support for custom kernels #271

sidnb13 commented Oct 31, 2023 •

edited

Loading

finndayton commented Nov 2, 2023 •

edited

Loading