Fix CUDA gradient script #806

kchristin22 · 2024-03-07T10:24:43Z

Fixes #744.

As described here, the gradient computation of the function of interest was moved inside the kernel. The results returned from the GPU were compared with the ones derived from the CPU execution to establish the correctness of the implementation.

codecov · 2024-03-07T10:36:18Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 94.89%. Comparing base (d7e5434) to head (20be1f3).

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #806   +/-   ##
=======================================
  Coverage   94.89%   94.89%           
=======================================
  Files          49       49           
  Lines        7478     7478           
=======================================
  Hits         7096     7096           
  Misses        382      382

kchristin22 · 2024-03-07T10:38:44Z

It would be also safer to add checks for CUDA errors when calling CUDA functions, but the final comparison of the GPU and CPU results will hint on a need for debugging, so I believe there's no need to "clutter" the code further.

vgvassilev · 2024-03-07T11:20:23Z

test/CUDA/GradientCuda.cu

@@ -15,6 +15,7 @@
 // XFAIL: clang-15


Can we understand why it fails for clang-15?

My first guess would be that the local CUDA version where this was tested and failed was not compatible with clang-15. I will have a look.

Unfortunately, I haven't found a CUDA version old enough (prior to 11.5) and compatible with my system (Ubuntu 22.04) to test this with clang-15, but upon examining another issue I came across a similar failure of clang-15 and clang-14 (didn't test with clang-16). Relevant comment here.

This looks good to me. We probably should open a new issue and a separate PR to continue the XFAIL: clang-15 discussion there.

I can open one if you'd like.

I was also thinking that if the CI was expanded to include CUDA's set up, we could check the build errors in the PR (harder for development but for older versions of clang it would be beneficial for developers with the latest OS versions). This could be a separate issue.

Sounds good.

vgvassilev · 2024-03-07T11:21:13Z

test/CUDA/GradientCuda.cu

@@ -109,14 +110,24 @@ int main(void) {
  cudaMemcpy(d_x, x, N * sizeof(double), cudaMemcpyHostToDevice);
  cudaMalloc(&d_p, N * sizeof(double));
  cudaMemcpy(d_p, p, N * sizeof(double), cudaMemcpyHostToDevice);
-  double *result, *d_result;
+  std::vector<double> result(N, 0);


We probably do not need a std::vector here. Can we use a fixed size array or an std::array?

vgvassilev

LGTM. See my comment.

vgvassilev · 2024-03-09T09:39:10Z

test/CUDA/GradientCuda.cu

@@ -15,6 +15,7 @@
 // XFAIL: clang-15


This looks good to me. We probably should open a new issue and a separate PR to continue the XFAIL: clang-15 discussion there.

Move gradient inside kernel and add result evaluation

ae7cc7d

vgvassilev reviewed Mar 7, 2024

View reviewed changes

Replace std vector with std array in GradientCuda

20be1f3

vgvassilev approved these changes Mar 9, 2024

View reviewed changes

vgvassilev merged commit c3a4f8e into vgvassilev:master Mar 9, 2024
81 checks passed

kchristin22 mentioned this pull request Mar 9, 2024

Examine clang-15 failure on CUDA Gradient script #812

Open

3 tasks

vgvassilev pushed a commit that referenced this pull request Mar 10, 2024

Move gradient inside kernel and add result evaluation (#806)

d575a98

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix CUDA gradient script #806

Fix CUDA gradient script #806

kchristin22 commented Mar 7, 2024

codecov bot commented Mar 7, 2024 •

edited

Loading

kchristin22 commented Mar 7, 2024

vgvassilev Mar 7, 2024

kchristin22 Mar 7, 2024 •

edited

Loading

kchristin22 Mar 7, 2024

vgvassilev Mar 9, 2024

kchristin22 Mar 9, 2024

vgvassilev Mar 9, 2024

vgvassilev Mar 7, 2024

vgvassilev left a comment

vgvassilev Mar 9, 2024

Fix CUDA gradient script #806

Fix CUDA gradient script #806

Conversation

kchristin22 commented Mar 7, 2024

codecov bot commented Mar 7, 2024 • edited Loading

Codecov Report

kchristin22 commented Mar 7, 2024

vgvassilev Mar 7, 2024

Choose a reason for hiding this comment

kchristin22 Mar 7, 2024 • edited Loading

Choose a reason for hiding this comment

kchristin22 Mar 7, 2024

Choose a reason for hiding this comment

vgvassilev Mar 9, 2024

Choose a reason for hiding this comment

kchristin22 Mar 9, 2024

Choose a reason for hiding this comment

vgvassilev Mar 9, 2024

Choose a reason for hiding this comment

vgvassilev Mar 7, 2024

Choose a reason for hiding this comment

vgvassilev left a comment

Choose a reason for hiding this comment

vgvassilev Mar 9, 2024

Choose a reason for hiding this comment

codecov bot commented Mar 7, 2024 •

edited

Loading

kchristin22 Mar 7, 2024 •

edited

Loading