Skip to content

Commit

Permalink
Kokkos: disable failing CUDA+DEBUG test
Browse files Browse the repository at this point in the history
This test requests a hardcoded number of
32 CUDA threads per warp, but with debugging
enabled the CUDA kernel uses too many registers
and can only run on 16 threads per warp max.
[kokkos/kokkos#1514, kokkos/kokkos#1513, #2471]
  • Loading branch information
ibaned committed Apr 2, 2018
1 parent d191960 commit ca2465c
Showing 1 changed file with 9 additions and 0 deletions.
9 changes: 9 additions & 0 deletions packages/kokkos/core/unit_test/TestTeamVector.hpp
Original file line number Diff line number Diff line change
Expand Up @@ -900,8 +900,17 @@ TEST_F( TEST_CATEGORY, team_vector )
#ifndef SKIP_TEST
TEST_F( TEST_CATEGORY, triple_nested_parallelism )
{
// With KOKKOS_DEBUG enabled, the functor uses too many registers to run
// with a team size of 32 on GPUs, 16 is the max possible (at least on a K80 GPU)
// See https://github.com/kokkos/kokkos/issues/1513
#if defined(KOKKOS_DEBUG)
if (!std::is_same<TEST_EXECSPACE, Kokkos::Cuda>::value) {
#endif
TestTripleNestedReduce< double, TEST_EXECSPACE >( 8192, 2048, 32, 32 );
TestTripleNestedReduce< double, TEST_EXECSPACE >( 8192, 2048, 32, 16 );
#if defined(KOKKOS_DEBUG)
}
#endif
TestTripleNestedReduce< double, TEST_EXECSPACE >( 8192, 2048, 16, 16 );
}
#endif
Expand Down

0 comments on commit ca2465c

Please sign in to comment.