Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deprecate and replace thrust::cuda_cub iterators #3422

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 16, 2025

  • No SASS changes for thrust.test.count on SM86 except kernel symbol names
  • No SASS changes for thrust.test.min_element on SM86 except kernel symbol names
  • No SASS changes for thrust.test.find on SM86 except kernel symbol names
  • No SASS changes for thrust.test.inner_product on SM86 except kernel symbol names
  • SASS changes for thrust.test.mismatch on SM86: kernel symbol names and addresses into constant memory, but same instructions
  • No SASS changes for thrust.test.transform_scan on SM86 except kernel symbol names

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 16, 2025 15:01
@bernhardmgruber bernhardmgruber added thrust For all items related to Thrust. backport branch/2.8.x labels Jan 16, 2025
Copy link
Contributor

🟨 CI finished in 2h 01m: Pass: 97%/78 | Total: 2d 08h | Avg: 43m 27s | Max: 1h 15m | Hits: 230%/12720
  • 🟨 cub: Pass: 97%/38 | Total: 1d 08h | Avg: 51m 03s | Max: 1h 13m | Hits: 338%/3540

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/36  | Total:  1d 06h | Avg: 50m 47s | Max:  1h 13m | Hits: 338%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 51m | Avg: 55m 48s | Max: 57m 03s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🔍 12.6               Pass:  96%/31  | Total:  1d 01h | Avg: 49m 19s | Max:  1h 13m | Hits: 338%/2655  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🔍 nvcc12.6           Pass:  96%/29  | Total: 23h 29m | Avg: 48m 35s | Max:  1h 13m | Hits: 338%/2655  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🔍 nvcc               Pass:  97%/36  | Total:  1d 06h | Avg: 50m 33s | Max:  1h 13m | Hits: 338%/3540  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 31s | Max: 57m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 56m 16s | Avg: 56m 16s | Max: 56m 16s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 54s | Avg: 57m 54s | Max: 57m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 06s | Avg: 55m 06s | Max: 55m 06s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 58m | Avg: 51m 11s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 09s | Max: 57m 43s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 02s | Avg: 59m 02s | Max: 59m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC11              Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 47m | Avg: 35m 41s | Max:  1h 00m
      🔍 GCC13              Pass:  87%/8   | Total:  4h 39m | Avg: 34m 56s | Max: 57m 03s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits: 338%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits: 338%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/14  | Total: 12h 29m | Avg: 53m 33s | Max:  1h 03m
      🔍 GCC                Pass:  94%/18  | Total: 13h 06m | Avg: 43m 41s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 13m | Hits: 338%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🔍 v100               Pass:  97%/36  | Total:  1d 07h | Avg: 52m 36s | Max:  1h 13m | Hits: 338%/3540  
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 56m 36s | Max:  1h 13m | Hits: 338%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 08s | Avg: 28m 08s | Max: 28m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 30s | Avg: 24m 30s | Max: 24m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 30m | Avg: 30m 19s | Max: 39m 30s
      🔍 TestGPU            Pass:  50%/2   | Total: 41m 46s | Avg: 20m 53s | Max: 37m 08s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/14  | Total: 13h 52m | Avg: 59m 27s | Max:  1h 11m | Hits: 338%/2655  
      🔍 20                 Pass:  95%/24  | Total: 18h 27m | Avg: 46m 09s | Max:  1h 13m | Hits: 337%/885   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 90a                Pass: 100%/1   | Total: 24m 11s | Avg: 24m 11s | Max: 24m 11s
    
  • 🟨 thrust: Pass: 97%/37 | Total: 23h 10m | Avg: 37m 35s | Max: 1h 15m | Hits: 188%/9180

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/35  | Total: 22h 03m | Avg: 37m 49s | Max:  1h 15m | Hits: 188%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 31s | Max: 34m 22s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🔍 12.6               Pass:  96%/30  | Total: 17h 15m | Avg: 34m 31s | Max:  1h 15m | Hits: 200%/7344  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🔍 nvcc12.6           Pass:  96%/28  | Total: 16h 16m | Avg: 34m 53s | Max:  1h 15m | Hits: 200%/7344  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🔍 nvcc               Pass:  97%/35  | Total: 22h 11m | Avg: 38m 02s | Max:  1h 15m | Hits: 188%/9180  
    🔍 cxx: GCC13 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 35m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang16            Pass: 100%/1   | Total: 39m 39s | Avg: 39m 39s | Max: 39m 39s
      🟩 Clang17            Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 03m | Avg: 26m 15s | Max: 36m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 35s | Max: 34m 03s
      🟩 GCC8               Pass: 100%/1   | Total: 38m 25s | Avg: 38m 25s | Max: 38m 25s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 35m 46s
      🟩 GCC10              Pass: 100%/1   | Total: 36m 21s | Avg: 36m 21s | Max: 36m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 37m 14s | Avg: 37m 14s | Max: 37m 14s
      🟩 GCC12              Pass: 100%/1   | Total: 41m 06s | Avg: 41m 06s | Max: 41m 06s
      🔍 GCC13              Pass:  87%/8   | Total:  3h 21m | Avg: 25m 09s | Max: 35m 35s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits: 145%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 01m | Avg:  1h 00m | Max:  1h 15m | Hits: 216%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/14  | Total:  7h 12m | Avg: 30m 52s | Max: 39m 39s
      🔍 GCC                Pass:  93%/16  | Total:  8h 12m | Avg: 30m 46s | Max: 41m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 15m | Hits: 188%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🔍 jobs: TestGPU 🔍
      🟩 Build              Pass: 100%/31  | Total: 21h 33m | Avg: 41m 44s | Max:  1h 15m | Hits: 143%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 30s | Avg: 16m 30s | Max: 34m 18s | Hits: 365%/1836  
      🔍 TestGPU            Pass:  66%/3   | Total: 47m 17s | Avg: 15m 45s | Max: 17m 15s
    🟨 cmake_options
      🟨 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:  50%/2   | Total: 49m 14s | Avg: 24m 37s | Max: 31m 59s
    🟨 gpu
      🟨 v100               Pass:  97%/37  | Total: 23h 10m | Avg: 37m 35s | Max:  1h 15m | Hits: 188%/9180  
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 22m 01s | Avg: 22m 01s | Max: 22m 01s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 10h 28m | Avg: 44m 52s | Max:  1h 15m | Hits: 145%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 53m | Avg: 33m 57s | Max:  1h 15m | Hits: 252%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 57s | Avg: 5m 58s | Max: 9m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 3h 24m: Pass: 100%/78 | Total: 2d 08h | Avg: 43m 38s | Max: 1h 15m | Hits: 230%/12720
  • 🟩 cub: Pass: 100%/38 | Total: 1d 08h | Avg: 51m 34s | Max: 1h 13m | Hits: 338%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 06h | Avg: 51m 19s | Max:  1h 13m | Hits: 338%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 51m | Avg: 55m 48s | Max: 57m 03s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 56s | Max:  1h 13m | Hits: 338%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 44m | Avg: 56m 56s | Max:  1h 02m | Hits: 338%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 48m | Avg: 49m 15s | Max:  1h 13m | Hits: 338%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 03m
      🟩 nvcc               Pass: 100%/36  | Total:  1d 06h | Avg: 51m 06s | Max:  1h 13m | Hits: 338%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 42m | Avg: 55m 31s | Max: 57m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 56m 16s | Avg: 56m 16s | Max: 56m 16s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 54s | Avg: 57m 54s | Max: 57m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 06s | Avg: 55m 06s | Max: 55m 06s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 58m | Avg: 51m 11s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 09s | Max: 57m 43s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 02s | Avg: 59m 02s | Max: 59m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max:  1h 00m
      🟩 GCC10              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC11              Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 47m | Avg: 35m 41s | Max:  1h 00m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 58m | Avg: 37m 21s | Max: 57m 03s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 10m | Hits: 338%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 13m | Hits: 338%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 12h 29m | Avg: 53m 33s | Max:  1h 03m
      🟩 GCC                Pass: 100%/18  | Total: 13h 25m | Avg: 44m 45s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 13m | Hits: 338%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 v100               Pass: 100%/36  | Total:  1d 07h | Avg: 53m 08s | Max:  1h 13m | Hits: 338%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 56m 36s | Max:  1h 13m | Hits: 338%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 08s | Avg: 28m 08s | Max: 28m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 30s | Avg: 24m 30s | Max: 24m 30s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 30m | Avg: 30m 19s | Max: 39m 30s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 01m | Avg: 30m 36s | Max: 37m 08s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 46m 16s | Avg: 23m 08s | Max: 26m 30s
      🟩 90a                Pass: 100%/1   | Total: 24m 11s | Avg: 24m 11s | Max: 24m 11s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 13h 52m | Avg: 59m 27s | Max:  1h 11m | Hits: 338%/2655  
      🟩 20                 Pass: 100%/24  | Total: 18h 47m | Avg: 46m 58s | Max:  1h 13m | Hits: 337%/885   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 23h 06m | Avg: 37m 28s | Max: 1h 15m | Hits: 188%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 01s | Avg: 22m 30s | Max: 31m 59s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 21h 59m | Avg: 37m 41s | Max:  1h 15m | Hits: 188%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 31s | Max: 34m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🟩 12.6               Pass: 100%/30  | Total: 17h 11m | Avg: 34m 23s | Max:  1h 15m | Hits: 200%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 23m | Avg: 40m 45s | Max:  1h 06m | Hits: 140%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
      🟩 nvcc12.6           Pass: 100%/28  | Total: 16h 12m | Avg: 34m 44s | Max:  1h 15m | Hits: 200%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 01s | Avg: 29m 30s | Max: 30m 24s
      🟩 nvcc               Pass: 100%/35  | Total: 22h 07m | Avg: 37m 55s | Max:  1h 15m | Hits: 188%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 17m | Avg: 34m 18s | Max: 35m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang16            Pass: 100%/1   | Total: 39m 39s | Avg: 39m 39s | Max: 39m 39s
      🟩 Clang17            Pass: 100%/1   | Total: 37m 58s | Avg: 37m 58s | Max: 37m 58s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 03m | Avg: 26m 15s | Max: 36m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 35s | Max: 34m 03s
      🟩 GCC8               Pass: 100%/1   | Total: 38m 25s | Avg: 38m 25s | Max: 38m 25s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 28s | Max: 35m 46s
      🟩 GCC10              Pass: 100%/1   | Total: 36m 21s | Avg: 36m 21s | Max: 36m 21s
      🟩 GCC11              Pass: 100%/1   | Total: 37m 14s | Avg: 37m 14s | Max: 37m 14s
      🟩 GCC12              Pass: 100%/1   | Total: 41m 06s | Avg: 41m 06s | Max: 41m 06s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 17m | Avg: 24m 37s | Max: 35m 35s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 06m | Hits: 145%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 01m | Avg:  1h 00m | Max:  1h 15m | Hits: 216%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  7h 12m | Avg: 30m 52s | Max: 39m 39s
      🟩 GCC                Pass: 100%/16  | Total:  8h 08m | Avg: 30m 31s | Max: 41m 06s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 15m | Hits: 188%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 15m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 23h 06m | Avg: 37m 28s | Max:  1h 15m | Hits: 188%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 21h 33m | Avg: 41m 44s | Max:  1h 15m | Hits: 143%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 49m 30s | Avg: 16m 30s | Max: 34m 18s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 43m 04s | Avg: 14m 21s | Max: 16m 59s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 22m 01s | Avg: 22m 01s | Max: 22m 01s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 10h 28m | Avg: 44m 52s | Max:  1h 15m | Hits: 145%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 53m | Avg: 33m 57s | Max:  1h 15m | Hits: 252%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 57s | Avg: 5m 58s | Max: 9m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 57s | Avg:  5m 58s | Max:  9m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 02s | Avg:  2m 02s | Max:  2m 02s
      🟩 Test               Pass: 100%/1   | Total:  9m 55s | Avg:  9m 55s | Max:  9m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 46m 26s | Avg: 46m 26s | Max: 46m 26s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

thrust/thrust/system/cuda/detail/find.h Outdated Show resolved Hide resolved
Copy link
Contributor

🟩 CI finished in 1h 59m: Pass: 100%/78 | Total: 2d 06h | Avg: 41m 55s | Max: 1h 12m | Hits: 247%/12720
  • 🟩 cub: Pass: 100%/38 | Total: 1d 09h | Avg: 52m 10s | Max: 1h 12m | Hits: 339%/3540

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 07h | Avg: 51m 57s | Max:  1h 12m | Hits: 339%/3540  
      🟩 arm64              Pass: 100%/2   | Total:  1h 52m | Avg: 56m 03s | Max: 56m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 01m | Avg:  1h 00m | Max:  1h 11m | Hits: 339%/885   
      🟩 12.5               Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 47s | Max:  1h 12m | Hits: 339%/2655  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 01m | Avg:  1h 00m | Max:  1h 11m | Hits: 339%/885   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 41m | Avg: 49m 01s | Max:  1h 12m | Hits: 339%/2655  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 03m
      🟩 nvcc               Pass: 100%/36  | Total:  1d 07h | Avg: 51m 41s | Max:  1h 12m | Hits: 339%/3540  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 52m | Avg: 58m 04s | Max: 59m 55s
      🟩 Clang15            Pass: 100%/1   | Total: 58m 51s | Avg: 58m 51s | Max: 58m 51s
      🟩 Clang16            Pass: 100%/1   | Total: 53m 52s | Avg: 53m 52s | Max: 53m 52s
      🟩 Clang17            Pass: 100%/1   | Total: 54m 24s | Avg: 54m 24s | Max: 54m 24s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 31m | Avg: 47m 18s | Max:  1h 03m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 08s | Max: 55m 42s
      🟩 GCC8               Pass: 100%/1   | Total: 59m 15s | Avg: 59m 15s | Max: 59m 15s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 09s | Max: 55m 45s
      🟩 GCC10              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC11              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 GCC12              Pass: 100%/3   | Total:  1h 44m | Avg: 34m 54s | Max: 59m 08s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 26m | Avg: 40m 50s | Max: 59m 30s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits: 339%/1770  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 12m | Hits: 339%/1770  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 12h 10m | Avg: 52m 10s | Max:  1h 03m
      🟩 GCC                Pass: 100%/18  | Total: 13h 51m | Avg: 46m 11s | Max:  1h 00m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 43m | Avg:  1h 10m | Max:  1h 12m | Hits: 339%/3540  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 10m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 45m 36s | Avg: 22m 48s | Max: 26m 12s
      🟩 v100               Pass: 100%/36  | Total:  1d 08h | Avg: 53m 48s | Max:  1h 12m | Hits: 339%/3540  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 05h | Avg: 57m 38s | Max:  1h 12m | Hits: 339%/3540  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 37m 16s | Avg: 37m 16s | Max: 37m 16s
      🟩 GraphCapture       Pass: 100%/1   | Total: 28m 29s | Avg: 28m 29s | Max: 28m 29s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 17m | Avg: 25m 51s | Max: 34m 55s
      🟩 TestGPU            Pass: 100%/2   | Total: 52m 43s | Avg: 26m 21s | Max: 27m 37s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 36s | Avg: 22m 48s | Max: 26m 12s
      🟩 90a                Pass: 100%/1   | Total: 24m 35s | Avg: 24m 35s | Max: 24m 35s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 14h 03m | Avg:  1h 00m | Max:  1h 12m | Hits: 339%/2655  
      🟩 20                 Pass: 100%/24  | Total: 18h 58m | Avg: 47m 27s | Max:  1h 12m | Hits: 338%/885   
    
  • 🟩 thrust: Pass: 100%/37 | Total: 20h 34m | Avg: 33m 21s | Max: 1h 04m | Hits: 211%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 51s | Avg: 20m 25s | Max: 28m 59s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 19h 32m | Avg: 33m 29s | Max:  1h 04m | Hits: 211%/9180  
      🟩 arm64              Pass: 100%/2   | Total:  1h 02m | Avg: 31m 06s | Max: 33m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits: 173%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
      🟩 12.6               Pass: 100%/30  | Total: 15h 28m | Avg: 30m 56s | Max:  1h 04m | Hits: 221%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 56m 48s | Avg: 28m 24s | Max: 28m 53s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 10m | Avg: 38m 10s | Max:  1h 00m | Hits: 173%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
      🟩 nvcc12.6           Pass: 100%/28  | Total: 14h 31m | Avg: 31m 06s | Max:  1h 04m | Hits: 221%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 56m 48s | Avg: 28m 24s | Max: 28m 53s
      🟩 nvcc               Pass: 100%/35  | Total: 19h 37m | Avg: 33m 38s | Max:  1h 04m | Hits: 211%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 14s | Max: 33m 17s
      🟩 Clang15            Pass: 100%/1   | Total: 34m 36s | Avg: 34m 36s | Max: 34m 36s
      🟩 Clang16            Pass: 100%/1   | Total: 31m 54s | Avg: 31m 54s | Max: 31m 54s
      🟩 Clang17            Pass: 100%/1   | Total: 32m 09s | Avg: 32m 09s | Max: 32m 09s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 48m | Avg: 24m 07s | Max: 32m 12s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 05s | Max: 33m 02s
      🟩 GCC8               Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 10m | Avg: 35m 08s | Max: 35m 23s
      🟩 GCC10              Pass: 100%/1   | Total: 34m 51s | Avg: 34m 51s | Max: 34m 51s
      🟩 GCC11              Pass: 100%/1   | Total: 31m 54s | Avg: 31m 54s | Max: 31m 54s
      🟩 GCC12              Pass: 100%/1   | Total: 35m 53s | Avg: 35m 53s | Max: 35m 53s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 57m | Avg: 22m 10s | Max: 33m 44s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m | Hits: 173%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 37m | Avg: 52m 21s | Max:  1h 03m | Hits: 237%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  6h 32m | Avg: 28m 02s | Max: 34m 36s
      🟩 GCC                Pass: 100%/16  | Total:  7h 24m | Avg: 27m 47s | Max: 35m 53s
      🟩 MSVC               Pass: 100%/5   | Total:  4h 41m | Avg: 56m 21s | Max:  1h 04m | Hits: 211%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 55m | Avg: 57m 47s | Max: 59m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 20h 34m | Avg: 33m 21s | Max:  1h 04m | Hits: 211%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 19h 06m | Avg: 36m 59s | Max:  1h 04m | Hits: 173%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 51m 59s | Avg: 17m 19s | Max: 36m 29s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 35m 42s | Avg: 11m 54s | Max: 12m 51s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 12s | Avg: 19m 12s | Max: 19m 12s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  9h 19m | Avg: 39m 55s | Max:  1h 04m | Hits: 173%/5508  
      🟩 20                 Pass: 100%/21  | Total: 10h 34m | Avg: 30m 13s | Max:  1h 03m | Hits: 269%/3672  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 53s | Avg: 4m 56s | Max: 8m 03s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 53s | Avg:  4m 56s | Max:  8m 03s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 50s | Avg:  1m 50s | Max:  1m 50s
      🟩 Test               Pass: 100%/1   | Total:  8m 03s | Avg:  8m 03s | Max:  8m 03s
    
  • 🟩 python: Pass: 100%/1 | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 43m 12s | Avg: 43m 12s | Max: 43m 12s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 78)

# Runner
53 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
9 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber
Copy link
Contributor Author

Do we need to add a benchmark for thrust.test.mismatch? Some of the addresses into constant memory changed, but the instructions are identical.

@bernhardmgruber
Copy link
Contributor Author

Do we need to add a benchmark for thrust.test.mismatch? Some of the addresses into constant memory changed, but the instructions are identical.

@gevtushenko says it's fine, we can proceed without benchmark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
backport branch/2.8.x thrust For all items related to Thrust.
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

2 participants