This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
libcu++ 1.7.0 (CUDA Toolkit 11.6)
libcu++ 1.7.0 (CUDA Toolkit 11.6)
libcu++ 1.7.0 is a major release. It adds cuda::std::atomic_ref
for integral
types. cuda::std::atomic_ref
may potentially replace uses of CUDA specific
atomicOperator(_Scope)
calls and provides a singular API for host and device code.
Supported ABI Versions: 4 (default), 3, and 2.
New Features
- #203 Implements
cuda::std::atomic_ref
for integral types.
Issues Fixed
- #204: Fallback macro backend in
<nv/target>
when C or pre-C++11 dialects are used. - #206: Fix compilation with ASAN enabled.
- Thanks Janusz Lisiecki for this contribution.
- #207: Fix compilation of
<cuda/std/atomic>
for GCC/Clang. - #208: Flip an internal directory symlink, fixes packaging issues for internal tools.
- #212: Fix
<nv/target>
on MSVC, fallback macros would always choose pre-C++11 backend. - #216: Annotated Pointer documentation.
- Thanks Gonzalo Brito for this contribution.
- #215: Add SM87 awareness to
<nv/target>
. - #217: Fix how CUDACC version is calculated for
__int128
support. - #228: Fix LLVM lit pattern matching in test score calculation.
- #227: Silence 4296 for type_traits.
- #225: Fix calculation of
_LIBCUDACXX_CUDACC_VER
broken from #217.- Thanks Robert Maynard for this contribution.
- #220:
memcpy_async
should cache only in L2 when possible. - #219: Change
atomic/atomic_ref
ctors to prevent copy construction.