Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nested 'Parallel Reduction' case #90

Open
josemonsalve2 opened this issue Feb 12, 2020 · 7 comments
Open

Nested 'Parallel Reduction' case #90

josemonsalve2 opened this issue Feb 12, 2020 · 7 comments
Labels
new test PR that adds new tests

Comments

@josemonsalve2
Copy link
Contributor

Directive and clause it applies to
Parallel within parallel and reduction

Specification details
From Mailing List:
Is it a known problem?

int main()
{
  int t=-1;

#pragma omp target teams map(t)
{
  #pragma omp parallel reduction(+: t)
  {
    #pragma omp parallel reduction(+: t)
    {
      t = 1;
    }
  }
}
}
$ clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda tmp.c
ptxas /tmp/thread_limit-984c26.s, line 1792; error   : Instruction 'vote' without '.sync' is not supported on .target sm_70 and higher from PTX ISA version 6.4
ptxas fatal   : Ptx assembly aborted due to errors
clang-11: error: ptxas command failed with exit code 255 (use -v to see invocation)
$ bin/clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda tmp.c -Xopenmp-target -march=sm_60
ptxas /tmp/thread_limit-5cc9b6.s, line 1792; warning : Instruction 'vote' without '.sync' is deprecated since PTX ISA version 6.0 and will be discontinued in a future PTX ISA version

Kelvin
Any other comments
Reported by Kelvin Li

@josemonsalve2 josemonsalve2 added the new test PR that adds new tests label Feb 12, 2020
@josemonsalve2 josemonsalve2 self-assigned this Feb 12, 2020
@spophale
Copy link
Collaborator

It compiles and runs fine on Summit with xlc, if that helps. Value of t is inconsistent across runs.

@Rombur
Copy link

Rombur commented Feb 20, 2020

For info, I have seen this error several times with different codes when using clang-9 but it works with clang-10. I can compile and run the test on Volta.

@josemonsalve2
Copy link
Contributor Author

Good to know @Rombur, I have yet to install LLVM10 in our testing systems, my understanding is that they are still in RC2? Anyway, we will make sure we have the test for any future regression testing.

@Rombur
Copy link

Rombur commented Feb 20, 2020

Yes, LLVM 10 is still in RC but LLVM 9 was not working for me so I had to do an early upgrade.

@tmh97
Copy link
Contributor

tmh97 commented Mar 3, 2022

@spophale Compiles but provides inconsistent results for gcc and clang, fails at runtime with nvidia

@spophale
Copy link
Collaborator

spophale commented Mar 3, 2022

What is the RT error msg ?

@tmh97
Copy link
Contributor

tmh97 commented Mar 3, 2022

@spophale the compile time error is: warning: variable "t" was set but never used
the runtime error is timeout: the monitored command dumped core

@nolanbaker31 nolanbaker31 changed the title Code from LLVM OpenMP Mailing list Nested 'Parallel Reduction' case Sep 20, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
new test PR that adds new tests
Projects
None yet
5 participants