Support for mixed precision #323

andres-ulloa-de-la-torre · 2022-10-24T02:26:59Z

Have an rx 6900xt which runs inference on stable diffusion in 33s. My 16 GB rtx a4000 does the same in 6.7s. DirectML is not a serious alternative to neither of ROCm and CUDA, without support or emulation for tensor cores. AMD inference times are 6 times slower than the equivalent Nvidia card running CUDA. Even ROCm has massive gains on Radeon cards without any actual matrix cores.

Any chance the plugin gets real mixed precision support? What are your plans going forward with regards to performance?

Thanks in advance for taking your time to address these concerns.

aliencaocao · 2022-10-25T02:24:25Z

#315 (comment)

PatriceVignola · 2022-11-01T23:36:22Z

Hi @andres-ulloa,

As @aliencaocao said, mixed precision is an area that we haven't been focusing on yet but it's on our radar. Is there a particular model that you're looking at?

cminnoy · 2024-03-26T10:14:54Z

mixed precision float16 on RDNA2 RX 6900 XT would be great for convolutions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for mixed precision #323

Support for mixed precision #323

andres-ulloa-de-la-torre commented Oct 24, 2022

aliencaocao commented Oct 25, 2022

PatriceVignola commented Nov 1, 2022

cminnoy commented Mar 26, 2024

Support for mixed precision #323

Support for mixed precision #323

Comments

andres-ulloa-de-la-torre commented Oct 24, 2022

aliencaocao commented Oct 25, 2022

PatriceVignola commented Nov 1, 2022

cminnoy commented Mar 26, 2024