Flash/blockwise/ring attention on Turing GPU for Kaggle Competition #137

defdet · 2024-04-05T01:21:10Z

defdet
Apr 5, 2024

Hello, I was wondering, is it possible to run flash (or better) attention on Turing GPUs (like T4) with your framework? I would much rather use TPUs obviously, but Kaggle competitions allow only GPUs for submission and we really need large context. Pytorch currently doesn't have support for these GPUs.

Answered by erfanzar

May 6, 2024

Hello, I'm working on that right now but, yes soon I assume it will be possible

View full answer

erfanzar · 2024-05-06T19:43:01Z

erfanzar
May 6, 2024
Maintainer

Hello, I'm working on that right now but, yes soon I assume it will be possible

0 replies

defdet · 2024-05-06T23:37:48Z

defdet
May 6, 2024
Author

Looking forward to it, that would help a lot!

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flash/blockwise/ring attention on Turing GPU for Kaggle Competition #137

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Flash/blockwise/ring attention on Turing GPU for Kaggle Competition #137

defdet Apr 5, 2024

Replies: 2 comments

erfanzar May 6, 2024 Maintainer

defdet May 6, 2024 Author

defdet
Apr 5, 2024

erfanzar
May 6, 2024
Maintainer

defdet
May 6, 2024
Author