CUDA atomic_fetch_sub for doubles is hitting CAS instead of intrinsic #1624
Labels
Bug
Broken / incorrect code; it could be Kokkos' responsibility, or others’ (e.g., Trilinos)
Milestone
This manifested in LAMMPS and ExaMiniMD where atomic_fetch_sub was exercised via the -= operator of an atomic view.
The text was updated successfully, but these errors were encountered: