richardson_lucy example crashes with large filter size #276

eddy16112 · 2022-04-13T23:38:17Z

The example is https://github.com/nv-legate/cunumeric/blob/branch-22.05/examples/richardson_lucy.py
If running with richardson_lucy.py -t -fx=512 -fy=256 -fz=151 -x=2048 -y=1024 -z=604, then I am running to the following errors:

Internal cuFFT failure with error code 4 in file cunumeric/convolution/convolve.cu at line 1133
Internal cuFFT failure with error code 1 in file cunumeric/cudalibs.cu at line 88

Even if I set x=fx, y=fy, z=fz, it still crashes.

The text was updated successfully, but these errors were encountered:

magnatelee · 2022-04-15T22:13:50Z

this should be fixed by #279. @eddy16112 can you pull my branch and see if that fixes the issue?

magnatelee · 2022-04-15T22:19:07Z

leaving a note to myself and others: the issue was caused by the convolve implementation trying to call cuFFT with pointers that are not 8B aligned. This is actually a documented behavior: https://docs.nvidia.com/cuda/cufft/index.html#function-cufftexecr2c-cufftexecd2z Though this fix addresses the reported issue, I'll revert #204, which originally introduced this bug; while fixing the bug, it turns out that cuFFT callbacks make the convolution slower more than 3X for some unfortunate shapes, which is too great to justify the saving on memory space for temporary allocations.

magnatelee · 2022-04-18T06:30:36Z

@eddy16112 Please pull and try again.

eddy16112 · 2022-04-18T16:51:33Z

It works for me. Thank you!

magnatelee self-assigned this Apr 14, 2022

magnatelee mentioned this issue Apr 18, 2022

Revert "Port and refactor GH #140" #280

Merged

eddy16112 closed this as completed Apr 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

richardson_lucy example crashes with large filter size #276

richardson_lucy example crashes with large filter size #276

eddy16112 commented Apr 13, 2022

magnatelee commented Apr 15, 2022

magnatelee commented Apr 15, 2022

magnatelee commented Apr 18, 2022

eddy16112 commented Apr 18, 2022

richardson_lucy example crashes with large filter size #276

richardson_lucy example crashes with large filter size #276

Comments

eddy16112 commented Apr 13, 2022

magnatelee commented Apr 15, 2022

magnatelee commented Apr 15, 2022

magnatelee commented Apr 18, 2022

eddy16112 commented Apr 18, 2022