cufft: use CUFFT_COMPATIBILITY_FFTW_PADDING instead of CUFFT_COMPATIBILITY_NATIVE #52

godsic · 2015-11-06T10:29:18Z

CUFFT_COMPATIBILITY_NATIVE mode is broken since cuda7.0+. Since CUFFT_COMPATIBILITY_NATIVE has been marked as DEPRECATED since cuda6.0, Nvidia developers are not willing to fix it. Therefore mumax3 should switch to the default CUFFT_COMPATIBILITY_FFTW_PADDING mode. This is rather big change.

Since Nvidia now suggests to use out-of-place r2c and c2r transforms, i.e. see page 20 of the "CUFFT LIBRARY USER'S GUIDE DU-06707-001_v7.5", then I would prefer to take this route. This will marginally increase mumax3 memory consumption, but is way less harmful, then say touching mighty demag kernel code.

@barnex any thoughts?

barnex · 2015-11-07T20:09:55Z

Yes, I agree. I don't want to touch the kernel layout at any cost.
I'll give it a shot this weekend.

godsic · 2015-11-07T21:00:25Z

If you are busy, I am happy to work on this next week.

On 7 November 2015 at 21:09, Arne Vansteenkiste [email protected]
wrote:

Yes, I agree. I don't want to touch the kernel layout at any cost.
I'll give it a shot this weekend.

—
Reply to this email directly or view it on GitHub
#52 (comment).

Mykola

barnex · 2015-11-07T21:02:16Z

Sounds good. Let me know if you need any help.
e0c3a28 re-introduces the convolution self-test (against brute-force , for a sparse random magnetization). It is enabled by the flag -paranoid, and is on in all tests. This should help in testing the fix.

godsic · 2015-11-07T21:29:43Z

Thanks, this should help indeed.

godsic · 2015-11-11T15:40:47Z

de985bf
7313693

barnex · 2015-11-12T20:12:12Z

Thanks @godsic! All tests pass on my 680.

godsic added bug enhancement question labels Nov 6, 2015

godsic self-assigned this Nov 6, 2015

barnex assigned barnex and godsic and unassigned godsic and barnex Nov 7, 2015

barnex removed the enhancement label Nov 7, 2015

godsic closed this as completed Nov 11, 2015

barnex mentioned this issue Dec 4, 2015

MFM imaging broken by cuFFT bug. #53

Closed

mkuron mentioned this issue Nov 14, 2016

cuFFT native memory layout removed in cuda 8.0 espressomd/espresso#918

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cufft: use CUFFT_COMPATIBILITY_FFTW_PADDING instead of CUFFT_COMPATIBILITY_NATIVE #52

cufft: use CUFFT_COMPATIBILITY_FFTW_PADDING instead of CUFFT_COMPATIBILITY_NATIVE #52

godsic commented Nov 6, 2015

barnex commented Nov 7, 2015

godsic commented Nov 7, 2015

barnex commented Nov 7, 2015

godsic commented Nov 7, 2015

godsic commented Nov 11, 2015

barnex commented Nov 12, 2015

cufft: use CUFFT_COMPATIBILITY_FFTW_PADDING instead of CUFFT_COMPATIBILITY_NATIVE #52

cufft: use CUFFT_COMPATIBILITY_FFTW_PADDING instead of CUFFT_COMPATIBILITY_NATIVE #52

Comments

godsic commented Nov 6, 2015

barnex commented Nov 7, 2015

godsic commented Nov 7, 2015

barnex commented Nov 7, 2015

godsic commented Nov 7, 2015

godsic commented Nov 11, 2015

barnex commented Nov 12, 2015