add type trait 'remove_restrict' #1474

psychocoderHPC · 2021-11-18T14:39:36Z

Provide a type trait to remove restrict from a type.

include/alpaka/core/RemoveRestrict.hpp

j-stephan · 2021-11-18T15:03:42Z

Is removing __restrict__ what we want to achieve here? Removing it against the user's explicit wish feels wrong to me.

fwyzard · 2021-11-18T15:14:12Z

Is removing __restrict__ what we want to achieve here? Removing it against the user's explicit wish feels wrong to me.

My impression is that for the CUDA and HIP backends, __restrict__ is not carried over across the call from host to device anyway. The pointers need to be declared as __restrict__ inside the kernel (on the device side).

For CPU backends the __restrict__ on the "host" side could still be meaningful (not sure about useful).

j-stephan · 2021-11-18T18:09:02Z

Got it, thanks!

include/alpaka/kernel/TaskKernelGpuUniformCudaHipRt.hpp

include/alpaka/core/RemoveRestrict.hpp

psychocoderHPC · 2021-11-26T12:49:58Z

Windows CI error in all runs with debug enabled:

2021-11-26T11:32:07.5385822Z   D:\a\alpaka\alpaka\build\example\helloWorld>"C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2\bin\nvcc.exe" -gencode=arch=compute_52,code=\"compute_52,compute_52\" -gencode=arch=compute_52,code=\"sm_52,compute_52\" --use-local-env -ccbin "C:\Program Files (x86)\Microsoft Visual Studio\2019\Enterprise\VC\Tools\MSVC\14.29.30133\bin\HostX64\x64" -x cu   -ID:\a\alpaka\alpaka\include -I"C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v11.2\include" -ID:\a\alpaka\alpaka\boost -I"C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2\include"     --keep-dir x64\Debug  -maxrregcount=0  --machine 64 --compile -cudart static --extended-lambda --expt-relaxed-constexpr -lineinfo -Xcudafe=--display_error_number -Xcudafe=--diag_suppress=esa_on_defaulted_function_ignored -std=c++17 -Xcompiler="/EHsc -Zi -Ob0 /openmp" -g  -D_WINDOWS -DALPAKA_ACC_CPU_B_SEQ_T_SEQ_ENABLED -DALPAKA_ACC_CPU_B_SEQ_T_THREADS_ENABLED -DALPAKA_ACC_CPU_B_OMP2_T_SEQ_ENABLED -DALPAKA_ACC_CPU_B_SEQ_T_OMP2_ENABLED -DALPAKA_ACC_GPU_CUDA_ENABLED -DALPAKA_DEBUG=0 -DALPAKA_DEBUG_OFFLOAD_ASSUME_HOST -DALPAKA_OFFLOAD_MAX_BLOCK_SIZE=256 -DALPAKA_BLOCK_SHARED_DYN_MEMBER_ALLOC_KIB=47 -DALPAKA_CI -D"CMAKE_INTDIR=\"Debug\"" -D"CMAKE_INTDIR=\"Debug\"" -D_MBCS -Xcompiler "/EHsc /W1 /nologo /Od /FdhelloWorld.dir\Debug\vc142.pdb /FS /Zi /RTC1 /MDd /GR" -o helloWorld.dir\Debug\helloWorld.obj "D:\a\alpaka\alpaka\example\helloWorld\src\helloWorld.cpp" 
2021-11-26T11:32:13.7723136Z   helloWorld.cpp
2021-11-26T11:32:14.9954951Z C:\Users\runneradmin\AppData\Local\Temp\tmpxft_000007fc_00000000-7_helloWorld.cudafe1.stub.c(27): 
error C2912: explicit specialization 'void alpaka::uniform_cuda_hip::detail::
__wrapper__device_stub_uniformCudaHipKernel<alpaka::AccGpuCudaRt<std::integral_constant<unsigned __int64,3>,unsigned __int64>,std::integral_constant<unsigned __int64,3>,unsigned __int64,HelloWorldKernel>(const _ZN6alpaka3VecISt17integral_constantIyLy3EEyEE &,const HelloWorldKernel &)' 
is not a specialization of a function template 
[D:\a\alpaka\alpaka\build\example\helloWorld\helloWorld.vcxproj]
2021-11-26T11:32:15.0268017Z C:\Program Files (x86)\Microsoft Visual Studio\2019\Enterprise\MSBuild\Microsoft\VC\v160\BuildCustomizations\CUDA 11.2.targets(785,9): error MSB3721: The command ""C:/Program Files/NVIDIA GPU Computing Toolkit/CUDA/v11.2\bin\nvcc.exe"

psychocoderHPC · 2021-11-26T20:54:16Z

Switched to draft. I changed some lines to pass the windows CI. I will run Monday some more tests if all CI jobs passed.

fix alpaka-group#1472 Provide a type trait to remove __restrict__ from a type.

psychocoderHPC · 2021-11-29T12:18:04Z

This PR is now passing the CI and can be reviewed.

psychocoderHPC added Type:Bug Backend:CUDA labels Nov 18, 2021

psychocoderHPC added this to the Version 0.9.0 (I/2022) milestone Nov 18, 2021

psychocoderHPC requested review from fwyzard and a team November 18, 2021 14:39

fwyzard mentioned this pull request Nov 18, 2021

Cannot compile with GCC 10 or later for the CUDA backend if the kernel functor is templated #1472

Closed

bernhardmgruber reviewed Nov 18, 2021

View reviewed changes

include/alpaka/core/RemoveRestrict.hpp Outdated Show resolved Hide resolved

fwyzard previously approved these changes Nov 18, 2021

View reviewed changes

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Nov 19, 2021

Add type trait 'remove_restrict' (alpaka-group#1474)

f71bdba

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Nov 19, 2021

Add type trait 'remove_restrict' (alpaka-group#1474)

6b4ca5f

fwyzard mentioned this pull request Nov 19, 2021

0.8.0-patatrack branch cms-patatrack/alpaka#1

Open

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Nov 23, 2021

Add type trait remove_restrict (alpaka-group#1474)

adcd6de

psychocoderHPC dismissed fwyzard’s stale review via ab9c10f November 24, 2021 14:08

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from 80b1e13 to ab9c10f Compare November 24, 2021 14:08

bernhardmgruber previously approved these changes Nov 24, 2021

View reviewed changes

psychocoderHPC dismissed bernhardmgruber’s stale review via 418f09b November 25, 2021 14:22

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from ab9c10f to 418f09b Compare November 25, 2021 14:22

bernhardmgruber reviewed Nov 25, 2021

View reviewed changes

include/alpaka/kernel/TaskKernelGpuUniformCudaHipRt.hpp Outdated Show resolved Hide resolved

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch 2 times, most recently from 32599d4 to 14f53e7 Compare November 26, 2021 07:55

bernhardmgruber reviewed Nov 26, 2021

View reviewed changes

include/alpaka/core/RemoveRestrict.hpp Outdated Show resolved Hide resolved

bernhardmgruber reviewed Nov 26, 2021

View reviewed changes

include/alpaka/core/RemoveRestrict.hpp Outdated Show resolved Hide resolved

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from 14f53e7 to 79a68ae Compare November 26, 2021 09:46

bernhardmgruber previously approved these changes Nov 26, 2021

View reviewed changes

psychocoderHPC dismissed bernhardmgruber’s stale review via ef8815a November 26, 2021 12:53

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from 79a68ae to ef8815a Compare November 26, 2021 12:53

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from ef8815a to 398d7a6 Compare November 26, 2021 15:44

psychocoderHPC marked this pull request as draft November 26, 2021 20:52

add type trait 'remove_restrict'

cffcc92

fix alpaka-group#1472 Provide a type trait to remove __restrict__ from a type.

psychocoderHPC force-pushed the fix-cudaKernelCallWithRestrictedPointers branch from 398d7a6 to cffcc92 Compare November 29, 2021 07:04

bernhardmgruber approved these changes Nov 29, 2021

View reviewed changes

psychocoderHPC marked this pull request as ready for review November 29, 2021 12:17

j-stephan approved these changes Nov 29, 2021

View reviewed changes

bernhardmgruber approved these changes Nov 29, 2021

View reviewed changes

j-stephan merged commit e1308c8 into alpaka-group:develop Nov 29, 2021

psychocoderHPC deleted the fix-cudaKernelCallWithRestrictedPointers branch November 30, 2021 09:13

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Dec 5, 2021

Add type trait 'remove_restrict' (alpaka-group#1474)

811de29

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Dec 21, 2021

Add type trait remove_restrict (alpaka-group#1474)

a3e3a00

fwyzard added a commit to cms-patatrack/alpaka that referenced this pull request Dec 21, 2021

Add type trait 'remove_restrict' (alpaka-group#1474)

49f1d9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add type trait 'remove_restrict' #1474

add type trait 'remove_restrict' #1474

psychocoderHPC commented Nov 18, 2021

j-stephan commented Nov 18, 2021

fwyzard commented Nov 18, 2021

j-stephan commented Nov 18, 2021

psychocoderHPC commented Nov 26, 2021 •

edited

Loading

psychocoderHPC commented Nov 26, 2021

psychocoderHPC commented Nov 29, 2021

add type trait 'remove_restrict' #1474

add type trait 'remove_restrict' #1474

Conversation

psychocoderHPC commented Nov 18, 2021

j-stephan commented Nov 18, 2021

fwyzard commented Nov 18, 2021

j-stephan commented Nov 18, 2021

psychocoderHPC commented Nov 26, 2021 • edited Loading

psychocoderHPC commented Nov 26, 2021

psychocoderHPC commented Nov 29, 2021

psychocoderHPC commented Nov 26, 2021 •

edited

Loading