Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add support for NVFP8/6/4 in <cuda/std/cmath> pt. 1 #3843

Open
wants to merge 15 commits into
base: main
Choose a base branch
from

Conversation

davebayer
Copy link
Contributor

This PR implements several functions from <cuda/std/cmath> for NVFP8/6/4 types.

@davebayer davebayer requested a review from a team as a code owner February 18, 2025 15:53
@davebayer davebayer requested a review from wmaxey February 18, 2025 15:53
Copy link

copy-pr-bot bot commented Feb 18, 2025

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@miscco
Copy link
Collaborator

miscco commented Feb 18, 2025

/ok to test

Copy link
Collaborator

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some nits, although I am unsure about the __nv prefix

#endif // _LIBCUDACXX_HAS_NVFP16

#if defined(_LIBCUDACXX_HAS_NVBF16)
_CCCL_NODISCARD _LIBCUDACXX_HIDE_FROM_ABI constexpr bool isinf(__nv_bfloat16 __x) noexcept
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The extended floating point types are not literal types so we cannot mark these functions as constexpr

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what prevents __nv_bfloat16 to be used here? __nv_bfloat16 can be constructed in a constexpr function by using __nv_bfloat16_raw. Both default and copy ctors are constexpr if __CPP_VERSION_AT_LEAST_11_BF16 is defined

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The issue is that we should test with the conversion and arithmetic operations disabled because our code must work without them

Copy link
Contributor

🟨 CI finished in 1h 28m: Pass: 30%/158 | Total: 2d 16h | Avg: 24m 28s | Max: 1h 19m | Hits: 36%/60534
  • 🟨 libcudacxx: Pass: 2%/43 | Total: 14h 16m | Avg: 19m 55s | Max: 36m 53s

    🟨 jobs
      🟥 Build              Pass:   0%/37  | Total: 13h 43m | Avg: 22m 14s | Max: 36m 53s
      🟥 NVRTC              Pass:   0%/2   | Total: 31m 26s | Avg: 15m 43s | Max: 16m 35s
      🟥 Test               Pass:   0%/3  
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 15s | Avg:  2m 15s | Max:  2m 15s
    🟨 cpu
      🟨 amd64              Pass:   2%/41  | Total: 13h 31m | Avg: 19m 47s | Max: 36m 53s
      🟥 arm64              Pass:   0%/2   | Total: 45m 20s | Avg: 22m 40s | Max: 22m 53s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  1h 33m | Avg: 18m 45s | Max: 21m 36s
      🟥 12.5               Pass:   0%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 36m 53s
      🟨 12.8               Pass:   2%/36  | Total: 11h 30m | Avg: 19m 11s | Max: 31m 49s
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 45m 24s | Avg: 22m 42s | Max: 23m 15s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  1h 33m | Avg: 18m 45s | Max: 21m 36s
      🟥 nvcc12.5           Pass:   0%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 36m 53s
      🟨 nvcc12.8           Pass:   2%/34  | Total: 10h 45m | Avg: 18m 59s | Max: 31m 49s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 45m 24s | Avg: 22m 42s | Max: 23m 15s
      🟨 nvcc               Pass:   2%/41  | Total: 13h 31m | Avg: 19m 47s | Max: 36m 53s
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total:  1h 33m | Avg: 23m 18s | Max: 26m 41s
      🟥 Clang15            Pass:   0%/2   | Total: 49m 30s | Avg: 24m 45s | Max: 25m 51s
      🟥 Clang16            Pass:   0%/2   | Total: 53m 00s | Avg: 26m 30s | Max: 28m 01s
      🟥 Clang17            Pass:   0%/2   | Total: 47m 32s | Avg: 23m 46s | Max: 24m 19s
      🟥 Clang18            Pass:   0%/6   | Total:  2h 00m | Avg: 20m 06s | Max: 27m 52s
      🟥 GCC7               Pass:   0%/2   | Total: 22m 50s | Avg: 11m 25s | Max: 20m 04s
      🟥 GCC8               Pass:   0%/1   | Total:  2m 46s | Avg:  2m 46s | Max:  2m 46s
      🟥 GCC9               Pass:   0%/2   | Total: 44m 03s | Avg: 22m 01s | Max: 24m 01s
      🟥 GCC10              Pass:   0%/2   | Total: 48m 35s | Avg: 24m 17s | Max: 26m 21s
      🟥 GCC11              Pass:   0%/2   | Total: 46m 23s | Avg: 23m 11s | Max: 24m 05s
      🟥 GCC12              Pass:   0%/2   | Total: 47m 57s | Avg: 23m 58s | Max: 24m 30s
      🟨 GCC13              Pass:  10%/10  | Total:  2h 37m | Avg: 15m 43s | Max: 31m 49s
      🟥 MSVC14.29          Pass:   0%/2   | Total: 25m 17s | Avg: 12m 38s | Max: 12m 48s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 25m 40s | Avg: 12m 50s | Max: 12m 53s
      🟥 NVHPC24.7          Pass:   0%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 36m 53s
    🟨 cxx_family
      🟥 Clang              Pass:   0%/16  | Total:  6h 03m | Avg: 22m 44s | Max: 28m 01s
      🟨 GCC                Pass:   4%/21  | Total:  6h 09m | Avg: 17m 36s | Max: 31m 49s
      🟥 MSVC               Pass:   0%/4   | Total: 50m 57s | Avg: 12m 44s | Max: 12m 53s
      🟥 NVHPC              Pass:   0%/2   | Total:  1h 12m | Avg: 36m 06s | Max: 36m 53s
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total: 21m 13s | Avg: 10m 36s | Max: 21m 13s
      🟨 rtx2080            Pass:   2%/41  | Total: 13h 55m | Avg: 20m 22s | Max: 36m 53s
    🟥 sm
      🟥 75                 Pass:   0%/2   | Total: 31m 26s | Avg: 15m 43s | Max: 16m 35s
      🟥 90                 Pass:   0%/2   | Total: 21m 13s | Avg: 10m 36s | Max: 21m 13s
      🟥 90;90a;100         Pass:   0%/1   | Total: 31m 49s | Avg: 31m 49s | Max: 31m 49s
    🟥 std
      🟥 17                 Pass:   0%/21  | Total:  7h 01m | Avg: 20m 04s | Max: 35m 20s
      🟥 20                 Pass:   0%/21  | Total:  7h 13m | Avg: 20m 37s | Max: 36m 53s
    
  • 🟨 cub: Pass: 37%/45 | Total: 1d 07h | Avg: 41m 35s | Max: 1h 19m | Hits: 25%/20047

    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 05m | Hits:  14%/2090  
      🔍 nvcc               Pass:  34%/43  | Total:  1d 05h | Avg: 40m 30s | Max:  1h 19m | Hits:  27%/17957 
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  3h 24m | Avg: 40m 48s | Max:  1h 08m
      🟩 12.5               Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 19m | Hits:  10%/2236  
      🟨 12.8               Pass:  39%/38  | Total:  1d 01h | Avg: 39m 57s | Max:  1h 16m | Hits:  27%/17811 
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 05m | Hits:  14%/2090  
      🟥 nvcc12.0           Pass:   0%/5   | Total:  3h 24m | Avg: 40m 48s | Max:  1h 08m
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 19m | Hits:  10%/2236  
      🟨 nvcc12.8           Pass:  36%/36  | Total: 23h 07m | Avg: 38m 33s | Max:  1h 16m | Hits:  29%/15721 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  3h 16m | Avg: 49m 00s | Max:  1h 05m | Hits:  16%/2422  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  16%/2418  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max: 59m 03s | Hits:  16%/2418  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  16%/2418  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 59m | Avg: 51m 24s | Max:  1h 05m | Hits:  40%/8135  
      🟥 GCC7               Pass:   0%/2   | Total:  1h 01m | Avg: 30m 53s | Max: 32m 08s
      🟥 GCC8               Pass:   0%/1   | Total: 33m 50s | Avg: 33m 50s | Max: 33m 50s
      🟥 GCC9               Pass:   0%/2   | Total:  1h 07m | Avg: 33m 56s | Max: 34m 02s
      🟥 GCC10              Pass:   0%/2   | Total:  1h 02m | Avg: 31m 03s | Max: 31m 52s
      🟥 GCC11              Pass:   0%/2   | Total:  1h 04m | Avg: 32m 25s | Max: 33m 49s
      🟥 GCC12              Pass:   0%/2   | Total:  1h 00m | Avg: 30m 14s | Max: 30m 49s
      🟥 GCC13              Pass:   0%/11  | Total:  2h 42m | Avg: 14m 47s | Max: 44m 16s
      🟥 MSVC14.29          Pass:   0%/2   | Total:  2h 17m | Avg:  1h 08m | Max:  1h 08m
      🟥 MSVC14.42          Pass:   0%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 16m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 19m | Hits:  10%/2236  
    🟨 cxx_family
      🟨 Clang              Pass:  88%/17  | Total: 15h 28m | Avg: 54m 37s | Max:  1h 05m | Hits:  27%/17811 
      🟥 GCC                Pass:   0%/22  | Total:  8h 33m | Avg: 23m 20s | Max: 44m 16s
      🟥 MSVC               Pass:   0%/4   | Total:  4h 39m | Avg:  1h 09m | Max:  1h 16m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 19m | Hits:  10%/2236  
    🟨 cpu
      🟨 amd64              Pass:  37%/43  | Total:  1d 05h | Avg: 41m 07s | Max:  1h 19m | Hits:  26%/18838 
      🟨 arm64              Pass:  50%/2   | Total:  1h 43m | Avg: 51m 36s | Max:  1h 01m | Hits:  16%/1209  
    🟨 gpu
      🟥 h100               Pass:   0%/3   | Total: 12m 07s | Avg:  4m 02s | Max: 12m 07s
      🟨 rtx2080            Pass:  41%/34  | Total:  1d 04h | Avg: 50m 37s | Max:  1h 19m | Hits:  15%/16420 
      🟨 rtxa6000           Pass:  37%/8   | Total:  2h 18m | Avg: 17m 17s | Max:  1h 01m | Hits:  72%/3627  
    🟨 jobs
      🟨 Build              Pass:  40%/37  | Total:  1d 06h | Avg: 49m 23s | Max:  1h 19m | Hits:  15%/17629 
      🟥 DeviceLaunch       Pass:   0%/1  
      🟥 GraphCapture       Pass:   0%/1  
      🟨 HostLaunch         Pass:  33%/3   | Total: 23m 13s | Avg:  7m 44s | Max: 23m 13s | Hits: 100%/1209  
      🟨 TestGPU            Pass:  33%/3   | Total: 20m 54s | Avg:  6m 58s | Max: 20m 54s | Hits: 100%/1209  
    🟥 sm
      🟥 90                 Pass:   0%/3   | Total: 12m 07s | Avg:  4m 02s | Max: 12m 07s
      🟥 90;90a;100         Pass:   0%/1   | Total: 44m 16s | Avg: 44m 16s | Max: 44m 16s
    🟨 std
      🟨 17                 Pass:  35%/20  | Total: 16h 16m | Avg: 48m 48s | Max:  1h 10m | Hits:  15%/8210  
      🟨 20                 Pass:  40%/25  | Total: 14h 55m | Avg: 35m 49s | Max:  1h 19m | Hits:  32%/11837 
    
  • 🟨 thrust: Pass: 44%/45 | Total: 15h 01m | Avg: 20m 02s | Max: 1h 07m | Hits: 44%/35614

    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total: 58m 27s | Avg: 29m 13s | Max: 30m 54s | Hits:  38%/3562  
      🔍 nvcc               Pass:  41%/43  | Total: 14h 03m | Avg: 19m 36s | Max:  1h 07m | Hits:  44%/32052 
    🟨 ctk
      🟨 12.0               Pass:  60%/5   | Total:  2h 03m | Avg: 24m 41s | Max: 48m 57s | Hits:  38%/5337  
      🟩 12.5               Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:   0%/3562  
      🟨 12.8               Pass:  39%/38  | Total: 10h 47m | Avg: 17m 02s | Max: 53m 23s | Hits:  50%/26715 
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 58m 27s | Avg: 29m 13s | Max: 30m 54s | Hits:  38%/3562  
      🟨 nvcc12.0           Pass:  60%/5   | Total:  2h 03m | Avg: 24m 41s | Max: 48m 57s | Hits:  38%/5337  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:   0%/3562  
      🟨 nvcc12.8           Pass:  36%/36  | Total:  9h 48m | Avg: 16m 21s | Max: 53m 23s | Hits:  52%/23153 
    🟨 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 08m | Avg: 32m 13s | Max: 32m 49s | Hits:  53%/7124  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 07m | Avg: 33m 47s | Max: 34m 27s | Hits:  38%/3562  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 17s | Max: 33m 39s | Hits:  38%/3562  
      🟩 Clang17            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 04s | Max: 33m 49s | Hits:  38%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 45m | Avg: 23m 38s | Max: 31m 39s | Hits:  59%/12467 
      🟥 GCC7               Pass:   0%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  5m 07s
      🟥 GCC8               Pass:   0%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟥 GCC9               Pass:   0%/2   | Total:  9m 25s | Avg:  4m 42s | Max:  4m 44s
      🟥 GCC10              Pass:   0%/2   | Total:  8m 52s | Avg:  4m 26s | Max:  4m 33s
      🟥 GCC11              Pass:   0%/2   | Total:  9m 29s | Avg:  4m 44s | Max:  4m 50s
      🟥 GCC12              Pass:   0%/2   | Total:  8m 46s | Avg:  4m 23s | Max:  4m 31s
      🟥 GCC13              Pass:   0%/10  | Total: 25m 35s | Avg:  2m 33s | Max:  4m 53s
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 37m | Avg: 48m 44s | Max: 48m 57s | Hits:  20%/1775  
      🟥 MSVC14.42          Pass:   0%/3   | Total:  1h 42m | Avg: 34m 11s | Max: 53m 23s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:   0%/3562  
    🟨 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 14m | Avg: 29m 06s | Max: 34m 27s | Hits:  50%/30277 
      🟥 GCC                Pass:   0%/21  | Total:  1h 16m | Avg:  3m 37s | Max:  5m 07s
      🟨 MSVC               Pass:  20%/5   | Total:  3h 20m | Avg: 40m 00s | Max: 53m 23s | Hits:  20%/1775  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:   0%/3562  
    🟥 cmake_options
      🟥 -DTHRUST_DISPATCH_TYPE=Force32bit Pass:   0%/2   | Total:  3m 43s | Avg:  1m 51s | Max:  3m 43s
    🟨 cpu
      🟨 amd64              Pass:  44%/43  | Total: 14h 28m | Avg: 20m 11s | Max:  1h 07m | Hits:  44%/33833 
      🟨 arm64              Pass:  50%/2   | Total: 33m 23s | Avg: 16m 41s | Max: 28m 30s | Hits:  38%/1781  
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  3m 07s | Avg:  1m 33s | Max:  3m 07s
      🟨 rtx2080            Pass:  51%/33  | Total: 13h 07m | Avg: 23m 52s | Max:  1h 07m | Hits:  37%/30271 
      🟨 rtx4090            Pass:  30%/10  | Total:  1h 50m | Avg: 11m 04s | Max: 53m 23s | Hits:  83%/5343  
    🟨 jobs
      🟨 Build              Pass:  47%/38  | Total: 14h 44m | Avg: 23m 16s | Max:  1h 07m | Hits:  37%/32052 
      🟨 TestCPU            Pass:  33%/3   | Total:  7m 15s | Avg:  2m 25s | Max:  7m 15s | Hits: 100%/1781  
      🟨 TestGPU            Pass:  25%/4   | Total: 10m 02s | Avg:  2m 30s | Max: 10m 02s | Hits: 100%/1781  
    🟥 sm
      🟥 90                 Pass:   0%/2   | Total:  3m 07s | Avg:  1m 33s | Max:  3m 07s
      🟥 90;90a;100         Pass:   0%/1   | Total:  4m 34s | Avg:  4m 34s | Max:  4m 34s
    🟨 std
      🟨 17                 Pass:  45%/20  | Total:  7h 53m | Avg: 23m 41s | Max:  1h 02m | Hits:  37%/16023 
      🟨 20                 Pass:  47%/23  | Total:  7h 04m | Avg: 18m 26s | Max:  1h 07m | Hits:  49%/19591 
    
  • 🟨 cudax: Pass: 45%/22 | Total: 3h 51m | Avg: 10m 31s | Max: 18m 28s | Hits: 30%/4873

    🔍 ctk: 12.8 🔍
      🟩 12.0               Pass: 100%/1   | Total: 11m 05s | Avg: 11m 05s | Max: 11m 05s | Hits:  45%/262   
      🟩 12.5               Pass: 100%/2   | Total: 18m 50s | Avg:  9m 25s | Max:  9m 52s | Hits:  31%/710   
      🔍 12.8               Pass:  36%/19  | Total:  3h 21m | Avg: 10m 37s | Max: 18m 28s | Hits:  29%/3901  
    🔍 cudacxx: nvcc12.8 🔍
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 05s | Avg: 11m 05s | Max: 11m 05s | Hits:  45%/262   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 50s | Avg:  9m 25s | Max:  9m 52s | Hits:  31%/710   
      🔍 nvcc12.8           Pass:  36%/19  | Total:  3h 21m | Avg: 10m 37s | Max: 18m 28s | Hits:  29%/3901  
    🟨 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 15m 24s | Avg: 15m 24s | Max: 15m 24s | Hits:  29%/559   
      🟩 Clang15            Pass: 100%/1   | Total: 18m 28s | Avg: 18m 28s | Max: 18m 28s | Hits:  29%/557   
      🟩 Clang16            Pass: 100%/1   | Total: 16m 16s | Avg: 16m 16s | Max: 16m 16s | Hits:  29%/557   
      🟩 Clang17            Pass: 100%/1   | Total: 17m 40s | Avg: 17m 40s | Max: 17m 40s | Hits:  29%/557   
      🟨 Clang18            Pass:  75%/4   | Total: 57m 17s | Avg: 14m 19s | Max: 16m 47s | Hits:  29%/1671  
      🟥 GCC10              Pass:   0%/1   | Total:  9m 52s | Avg:  9m 52s | Max:  9m 52s
      🟥 GCC11              Pass:   0%/1   | Total:  8m 47s | Avg:  8m 47s | Max:  8m 47s
      🟥 GCC12              Pass:   0%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  9m 44s
      🟥 GCC13              Pass:   0%/6   | Total: 37m 40s | Avg:  6m 16s | Max:  8m 26s
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 05s | Avg: 11m 05s | Max: 11m 05s | Hits:  45%/262   
      🟥 MSVC14.42          Pass:   0%/1   | Total: 10m 37s | Avg: 10m 37s | Max: 10m 37s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 50s | Avg:  9m 25s | Max:  9m 52s | Hits:  31%/710   
    🟨 cxx_family
      🟨 Clang              Pass:  87%/8   | Total:  2h 05m | Avg: 15m 38s | Max: 18m 28s | Hits:  29%/3901  
      🟥 GCC                Pass:   0%/10  | Total:  1h 06m | Avg:  6m 36s | Max:  9m 52s
      🟨 MSVC               Pass:  50%/2   | Total: 21m 42s | Avg: 10m 51s | Max: 11m 05s | Hits:  45%/262   
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 50s | Avg:  9m 25s | Max:  9m 52s | Hits:  31%/710   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  45%/22  | Total:  3h 51m | Avg: 10m 31s | Max: 18m 28s | Hits:  30%/4873  
    🟨 cpu
      🟨 amd64              Pass:  44%/18  | Total:  3h 07m | Avg: 10m 24s | Max: 18m 28s | Hits:  31%/3759  
      🟨 arm64              Pass:  50%/4   | Total: 44m 22s | Avg: 11m 05s | Max: 14m 57s | Hits:  29%/1114  
    🟨 gpu
      🟥 h100               Pass:   0%/2   | Total:  7m 07s | Avg:  3m 33s | Max:  7m 07s
      🟨 rtx2080            Pass:  50%/20  | Total:  3h 44m | Avg: 11m 13s | Max: 18m 28s | Hits:  30%/4873  
    🟨 jobs
      🟨 Build              Pass:  52%/19  | Total:  3h 39m | Avg: 11m 32s | Max: 18m 28s | Hits:  30%/4873  
      🟥 Test               Pass:   0%/3   | Total: 12m 24s | Avg:  4m 08s | Max: 12m 24s
    🟥 sm
      🟥 90                 Pass:   0%/3   | Total: 14m 10s | Avg:  4m 43s | Max:  7m 07s
      🟥 90a                Pass:   0%/1   | Total:  7m 14s | Avg:  7m 14s | Max:  7m 14s
    🟨 std
      🟨 17                 Pass:  50%/4   | Total: 37m 00s | Avg:  9m 15s | Max: 13m 09s | Hits:  30%/912   
      🟨 20                 Pass:  44%/18  | Total:  3h 14m | Avg: 10m 48s | Max: 18m 28s | Hits:  30%/3961  
    
  • 🟥 cccl_c_parallel: Pass: 0%/2 | Total: 2m 23s | Avg: 1m 11s | Max: 2m 23s

    🟥 cpu
      🟥 amd64              Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 ctk
      🟥 12.8               Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 cxx
      🟥 GCC13              Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/2   | Total:  2m 23s | Avg:  1m 11s | Max:  2m 23s
    🟥 jobs
      🟥 Build              Pass:   0%/1   | Total:  2m 23s | Avg:  2m 23s | Max:  2m 23s
      🟥 Test               Pass:   0%/1  
    
  • 🟥 python: Pass: 0%/1 | Total: 3m 04s | Avg: 3m 04s | Max: 3m 04s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 ctk
      🟥 12.8               Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 cudacxx
      🟥 nvcc12.8           Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 gpu
      🟥 rtx2080            Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total:  3m 04s | Avg:  3m 04s | Max:  3m 04s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Comment on lines 1 to 9
// -*- C++ -*-
//===----------------------------------------------------------------------===//
//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
// SPDX-FileCopyrightText: Copyright (c) 2025 NVIDIA CORPORATION & AFFILIATES.
//
//===----------------------------------------------------------------------===//
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Technically none of those are part of libc++ so we should probalby use the "Part of libu++" license?

@miscco
Copy link
Collaborator

miscco commented Feb 19, 2025

/ok to test

#include "test_macros.h"

template <class T>
__host__ __device__ void test_fpclassify(T val, int expected)
Copy link
Collaborator

@miscco miscco Feb 19, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One more thing, can you please check whether we can run some of those tests with the disabled conversions / constructors

We might need to add a header with generator functions but it would be awesome if we could get those tests to work without assuming functionality that might be user disabled

@miscco
Copy link
Collaborator

miscco commented Feb 19, 2025

/ok to test

@miscco
Copy link
Collaborator

miscco commented Feb 19, 2025

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 55m: Pass: 74%/158 | Total: 3d 01h | Avg: 27m 58s | Max: 1h 19m | Hits: 46%/168911
  • 🟨 thrust: Pass: 55%/45 | Total: 16h 29m | Avg: 21m 59s | Max: 1h 10m | Hits: 53%/44520

    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  1h 07m | Avg: 13m 35s | Max: 49m 09s
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m | Hits:   4%/3562  
      🟨 12.8               Pass:  60%/38  | Total: 13h 02m | Avg: 20m 35s | Max:  1h 04m | Hits:  57%/40958 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  4m 51s | Avg:  2m 25s | Max:  2m 29s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  1h 07m | Avg: 13m 35s | Max: 49m 09s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m | Hits:   4%/3562  
      🟨 nvcc12.8           Pass:  63%/36  | Total: 12h 57m | Avg: 21m 35s | Max:  1h 04m | Hits:  57%/40958 
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 18m 13s | Avg:  4m 33s | Max:  4m 35s
      🟥 Clang15            Pass:   0%/2   | Total:  9m 18s | Avg:  4m 39s | Max:  4m 50s
      🟥 Clang16            Pass:   0%/2   | Total:  9m 04s | Avg:  4m 32s | Max:  4m 40s
      🟥 Clang17            Pass:   0%/2   | Total:  9m 15s | Avg:  4m 37s | Max:  4m 48s
      🟥 Clang18            Pass:   0%/7   | Total: 19m 39s | Avg:  2m 48s | Max:  5m 06s
      🟨 GCC7               Pass:  50%/2   | Total: 37m 07s | Avg: 18m 33s | Max: 32m 22s | Hits:  38%/1782  
      🟩 GCC8               Pass: 100%/1   | Total: 30m 44s | Avg: 30m 44s | Max: 30m 44s | Hits:  38%/1782  
      🟨 GCC9               Pass:  50%/2   | Total: 35m 53s | Avg: 17m 56s | Max: 30m 56s | Hits:  52%/1782  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 33s | Max: 32m 48s | Hits:  52%/3564  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 44s | Max: 33m 52s | Hits:  49%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 55s | Max: 33m 56s | Hits:  52%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 42m | Avg: 22m 16s | Max: 33m 54s | Hits:  76%/17820 
      🟨 MSVC14.29          Pass:  50%/2   | Total:  1h 39m | Avg: 49m 51s | Max: 50m 33s | Hits:  52%/1775  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 37m | Avg: 52m 39s | Max:  1h 04m | Hits:  25%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m | Hits:   4%/3562  
    🟨 cxx_family
      🟥 Clang              Pass:   0%/17  | Total:  1h 05m | Avg:  3m 51s | Max:  5m 06s
      🟨 GCC                Pass:  90%/21  | Total:  8h 47m | Avg: 25m 05s | Max: 33m 56s | Hits:  63%/33858 
      🟨 MSVC               Pass:  80%/5   | Total:  4h 17m | Avg: 51m 32s | Max:  1h 04m | Hits:  32%/7100  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 10m | Hits:   4%/3562  
    🟨 gpu
      🟩 h100               Pass: 100%/2   | Total: 34m 03s | Avg: 17m 01s | Max: 22m 56s | Hits:  76%/3564  
      🟨 rtx2080            Pass:  48%/33  | Total: 12h 45m | Avg: 23m 12s | Max:  1h 10m | Hits:  42%/28496 
      🟨 rtx4090            Pass:  70%/10  | Total:  3h 09m | Avg: 18m 56s | Max:  1h 04m | Hits:  71%/12460 
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 16s | Avg: 19m 38s | Max: 28m 08s | Hits:  75%/3564  
    🟨 cpu
      🟨 amd64              Pass:  55%/43  | Total: 15h 54m | Avg: 22m 11s | Max:  1h 10m | Hits:  53%/42738 
      🟨 arm64              Pass:  50%/2   | Total: 35m 27s | Avg: 17m 43s | Max: 30m 21s | Hits:  53%/1782  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  4m 51s | Avg:  2m 25s | Max:  2m 29s
      🟨 nvcc               Pass:  58%/43  | Total: 16h 24m | Avg: 22m 53s | Max:  1h 10m | Hits:  53%/44520 
    🟨 jobs
      🟨 Build              Pass:  52%/38  | Total: 15h 17m | Avg: 24m 09s | Max:  1h 10m | Hits:  43%/35617 
      🟨 TestCPU            Pass:  66%/3   | Total: 38m 14s | Avg: 12m 44s | Max: 29m 48s | Hits:  85%/3557  
      🟨 TestGPU            Pass:  75%/4   | Total: 33m 32s | Avg:  8m 23s | Max: 11m 17s | Hits:  99%/5346  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 34m 03s | Avg: 17m 01s | Max: 22m 56s | Hits:  76%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 54s | Avg: 33m 54s | Max: 33m 54s | Hits:  75%/1782  
    🟨 std
      🟨 17                 Pass:  50%/20  | Total:  8h 19m | Avg: 24m 59s | Max:  1h 08m | Hits:  39%/17805 
      🟨 20                 Pass:  56%/23  | Total:  7h 30m | Avg: 19m 35s | Max:  1h 10m | Hits:  61%/23151 
    
  • 🟨 libcudacxx: Pass: 69%/43 | Total: 15h 55m | Avg: 22m 12s | Max: 50m 04s | Hits: 37%/67841

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  68%/41  | Total: 15h 21m | Avg: 22m 28s | Max: 50m 04s | Hits:  36%/62170 
      🟩 arm64              Pass: 100%/2   | Total: 33m 49s | Avg: 16m 54s | Max: 21m 31s | Hits:  51%/5671  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 21m 11s | Avg: 10m 35s | Max: 13m 27s | Hits:  81%/2925  
      🔍 rtx2080            Pass:  68%/41  | Total: 15h 33m | Avg: 22m 46s | Max: 50m 04s | Hits:  36%/64916 
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  64%/37  | Total: 13h 29m | Avg: 21m 52s | Max: 38m 51s | Hits:  37%/67801 
      🟩 NVRTC              Pass: 100%/2   | Total: 30m 22s | Avg: 15m 11s | Max: 15m 32s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total:  1h 53m | Avg: 37m 40s | Max: 50m 04s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 54m 38s | Avg: 10m 55s | Max: 20m 11s
      🟩 12.5               Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 37m 07s | Hits:  28%/5616  
      🟨 12.8               Pass:  77%/36  | Total: 13h 52m | Avg: 23m 06s | Max: 50m 04s | Hits:  38%/62225 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 41m 43s | Avg: 20m 51s | Max: 21m 34s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 54m 38s | Avg: 10m 55s | Max: 20m 11s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 37m 07s | Hits:  28%/5616  
      🟨 nvcc12.8           Pass:  82%/34  | Total: 13h 10m | Avg: 23m 14s | Max: 50m 04s | Hits:  38%/62225 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  1h 25m | Avg: 21m 20s | Max: 23m 10s | Hits:  32%/5628  
      🟩 Clang15            Pass: 100%/2   | Total: 44m 01s | Avg: 22m 00s | Max: 22m 51s | Hits:  32%/5628  
      🟩 Clang16            Pass: 100%/2   | Total: 46m 51s | Avg: 23m 25s | Max: 25m 37s | Hits:  32%/5628  
      🟩 Clang17            Pass: 100%/2   | Total: 38m 26s | Avg: 19m 13s | Max: 23m 11s | Hits:  46%/5628  
      🟨 Clang18            Pass:  66%/6   | Total:  2h 38m | Avg: 26m 27s | Max: 50m 04s | Hits:  32%/8463  
      🟥 GCC7               Pass:   0%/2   | Total: 23m 44s | Avg: 11m 52s | Max: 21m 18s
      🟥 GCC8               Pass:   0%/1   | Total: 20m 46s | Avg: 20m 46s | Max: 20m 46s
      🟥 GCC9               Pass:   0%/2   | Total: 24m 32s | Avg: 12m 16s | Max: 22m 15s
      🟩 GCC10              Pass: 100%/2   | Total: 46m 32s | Avg: 23m 16s | Max: 23m 27s | Hits:  32%/5634  
      🟩 GCC11              Pass: 100%/2   | Total: 34m 08s | Avg: 17m 04s | Max: 20m 16s | Hits:  45%/5630  
      🟩 GCC12              Pass: 100%/2   | Total: 47m 58s | Avg: 23m 59s | Max: 26m 09s | Hits:  32%/5630  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 12m | Avg: 19m 14s | Max: 49m 30s | Hits:  49%/14356 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 49m 06s | Avg: 24m 33s | Max: 38m 25s
      🟥 MSVC14.42          Pass:   0%/2   | Total:  1h 14m | Avg: 37m 00s | Max: 38m 51s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 37m 07s | Hits:  28%/5616  
    🟨 cxx_family
      🟨 Clang              Pass:  75%/16  | Total:  6h 13m | Avg: 23m 20s | Max: 50m 04s | Hits:  34%/30975 
      🟨 GCC                Pass:  76%/21  | Total:  6h 30m | Avg: 18m 34s | Max: 49m 30s | Hits:  42%/31250 
      🟥 MSVC               Pass:   0%/4   | Total:  2h 03m | Avg: 30m 46s | Max: 38m 51s
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 11s | Max: 37m 07s | Hits:  28%/5616  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 41m 43s | Avg: 20m 51s | Max: 21m 34s
      🟨 nvcc               Pass:  73%/41  | Total: 15h 13m | Avg: 22m 16s | Max: 50m 04s | Hits:  37%/67841 
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 30m 22s | Avg: 15m 11s | Max: 15m 32s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 21m 11s | Avg: 10m 35s | Max: 13m 27s | Hits:  81%/2925  
      🟩 90;90a;100         Pass: 100%/1   | Total: 30m 43s | Avg: 30m 43s | Max: 30m 43s | Hits:  31%/2925  
    🟨 std
      🟨 17                 Pass:  52%/21  | Total:  7h 18m | Avg: 20m 53s | Max: 38m 25s | Hits:  32%/27950 
      🟨 20                 Pass:  85%/21  | Total:  8h 34m | Avg: 24m 29s | Max: 50m 04s | Hits:  42%/39891 
    
  • 🟨 cub: Pass: 84%/45 | Total: 1d 14h | Avg: 50m 55s | Max: 1h 19m | Hits: 39%/45252

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  83%/43  | Total:  1d 12h | Avg: 50m 25s | Max:  1h 19m | Hits:  39%/42834 
      🟩 arm64              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  30%/2418  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 21s | Max: 25m 46s | Hits:  76%/3627  
      🔍 rtx2080            Pass:  79%/34  | Total:  1d 08h | Avg: 57m 50s | Max:  1h 19m | Hits:  22%/31953 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 11m | Avg: 31m 26s | Max:  1h 04m | Hits:  79%/9672  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  81%/37  | Total:  1d 11h | Avg: 57m 10s | Max:  1h 19m | Hits:  22%/35580 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 40s | Avg: 21m 40s | Max: 21m 40s | Hits:  99%/1209  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 08s | Avg: 17m 08s | Max: 17m 08s | Hits:  99%/1209  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 48s | Max: 25m 04s | Hits:  99%/3627  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 02m | Avg: 20m 52s | Max: 22m 43s | Hits:  99%/3627  
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  3h 09m | Avg: 37m 51s | Max: 57m 49s
      🟩 12.5               Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 17m | Hits:  10%/2236  
      🟨 12.8               Pass:  94%/38  | Total:  1d 08h | Avg: 51m 20s | Max:  1h 19m | Hits:  40%/43016 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  6m 21s | Avg:  3m 10s | Max:  3m 13s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  3h 09m | Avg: 37m 51s | Max: 57m 49s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 17m | Hits:  10%/2236  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 00s | Max:  1h 19m | Hits:  40%/43016 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  3h 09m | Avg: 47m 27s | Max:  1h 06m | Hits:  30%/2422  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m | Hits:  25%/2418  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  25%/2418  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 03m | Hits:  25%/2418  
      🟨 Clang18            Pass:  71%/7   | Total:  3h 54m | Avg: 33m 26s | Max:  1h 03m | Hits:  55%/6045  
      🟨 GCC7               Pass:  50%/2   | Total:  1h 34m | Avg: 47m 18s | Max:  1h 00m | Hits:  16%/1211  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 02m | Avg:  1h 02m | Max:  1h 02m | Hits:  16%/1211  
      🟨 GCC9               Pass:  50%/2   | Total:  1h 38m | Avg: 49m 23s | Max:  1h 06m | Hits:  28%/1211  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 04m | Hits:  21%/2422  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  25%/2418  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 04m | Hits:  22%/2418  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 07m | Avg: 38m 52s | Max:  1h 19m | Hits:  65%/13299 
      🟨 MSVC14.29          Pass:  50%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 12m | Hits:  12%/1035  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 18m | Hits:  12%/2070  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 17m | Hits:  10%/2236  
    🟨 cxx_family
      🟨 Clang              Pass:  76%/17  | Total: 13h 17m | Avg: 46m 55s | Max:  1h 06m | Hits:  37%/15721 
      🟨 GCC                Pass:  90%/22  | Total: 17h 40m | Avg: 48m 13s | Max:  1h 19m | Hits:  45%/24190 
      🟨 MSVC               Pass:  75%/4   | Total:  4h 41m | Avg:  1h 10m | Max:  1h 18m | Hits:  12%/3105  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 17m | Hits:  10%/2236  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  6m 21s | Avg:  3m 10s | Max:  3m 13s
      🟨 nvcc               Pass:  88%/43  | Total:  1d 14h | Avg: 53m 08s | Max:  1h 19m | Hits:  39%/45252 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 21s | Max: 25m 46s | Hits:  76%/3627  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 19m | Avg:  1h 19m | Max:  1h 19m | Hits:  18%/1209  
    🟨 std
      🟨 17                 Pass:  75%/20  | Total: 18h 50m | Avg: 56m 31s | Max:  1h 13m | Hits:  23%/17706 
      🟨 20                 Pass:  92%/25  | Total: 19h 20m | Avg: 46m 26s | Max:  1h 19m | Hits:  48%/27546 
    
  • 🟨 cudax: Pass: 95%/22 | Total: 2h 19m | Avg: 6m 21s | Max: 17m 19s | Hits: 93%/11002

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  94%/18  | Total:  2h 06m | Avg:  7m 03s | Max: 17m 19s | Hits:  92%/8774  
      🟩 arm64              Pass: 100%/4   | Total: 12m 49s | Avg:  3m 12s | Max:  3m 42s | Hits:  99%/2228  
    🚨 ctk: 12.0 🚨
      🔥 12.0               Pass:   0%/1   | Total:  9m 06s | Avg:  9m 06s | Max:  9m 06s
      🟩 12.5               Pass: 100%/2   | Total: 13m 43s | Avg:  6m 51s | Max:  6m 53s | Hits:  95%/710   
      🟩 12.8               Pass: 100%/19  | Total:  1h 56m | Avg:  6m 09s | Max: 17m 19s | Hits:  93%/10292 
    🚨 cudacxx: nvcc12.0 🚨
      🔥 nvcc12.0           Pass:   0%/1   | Total:  9m 06s | Avg:  9m 06s | Max:  9m 06s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 43s | Avg:  6m 51s | Max:  6m 53s | Hits:  95%/710   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  1h 56m | Avg:  6m 09s | Max: 17m 19s | Hits:  93%/10292 
    🚨 cxx: MSVC14.39 🚨
      🟩 Clang14            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s | Hits:  97%/559   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 46s | Avg:  4m 46s | Max:  4m 46s | Hits:  94%/557   
      🟩 Clang16            Pass: 100%/1   | Total:  5m 48s | Avg:  5m 48s | Max:  5m 48s | Hits:  84%/557   
      🟩 Clang17            Pass: 100%/1   | Total:  6m 00s | Avg:  6m 00s | Max:  6m 00s | Hits:  82%/557   
      🟩 Clang18            Pass: 100%/4   | Total: 21m 59s | Avg:  5m 29s | Max: 11m 42s | Hits:  97%/2228  
      🟩 GCC10              Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s | Hits:  96%/559   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 19s | Avg:  4m 19s | Max:  4m 19s | Hits:  97%/557   
      🟩 GCC12              Pass: 100%/2   | Total: 24m 06s | Avg: 12m 03s | Max: 17m 19s | Hits:  83%/1114  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 50s | Avg:  5m 18s | Max: 13m 52s | Hits:  98%/3342  
      🔥 MSVC14.39          Pass:   0%/1   | Total:  9m 06s | Avg:  9m 06s | Max:  9m 06s
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 26s | Avg:  9m 26s | Max:  9m 26s | Hits:  60%/262   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 13m 43s | Avg:  6m 51s | Max:  6m 53s | Hits:  95%/710   
    🔍 cxx_family: MSVC 🔍
      🟩 Clang              Pass: 100%/8   | Total: 42m 56s | Avg:  5m 22s | Max: 11m 42s | Hits:  93%/4458  
      🟩 GCC                Pass: 100%/10  | Total:  1h 04m | Avg:  6m 27s | Max: 17m 19s | Hits:  95%/5572  
      🔍 MSVC               Pass:  50%/2   | Total: 18m 32s | Avg:  9m 16s | Max:  9m 26s | Hits:  60%/262   
      🟩 NVHPC              Pass: 100%/2   | Total: 13m 43s | Avg:  6m 51s | Max:  6m 53s | Hits:  95%/710   
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 17m 20s | Avg:  8m 40s | Max: 13m 52s | Hits:  98%/1114  
      🔍 rtx2080            Pass:  95%/20  | Total:  2h 02m | Avg:  6m 07s | Max: 17m 19s | Hits:  93%/9888  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  94%/19  | Total:  1h 36m | Avg:  5m 05s | Max:  9m 26s | Hits:  92%/9331  
      🟩 Test               Pass: 100%/3   | Total: 42m 53s | Avg: 14m 17s | Max: 17m 19s | Hits:  99%/1671  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 16m 59s | Avg:  4m 14s | Max:  6m 53s | Hits:  98%/2026  
      🔍 20                 Pass:  94%/18  | Total:  2h 02m | Avg:  6m 49s | Max: 17m 19s | Hits:  92%/8976  
    🟨 cudacxx_family
      🟨 nvcc               Pass:  95%/22  | Total:  2h 19m | Avg:  6m 21s | Max: 17m 19s | Hits:  93%/11002 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 21m 02s | Avg:  7m 00s | Max: 13m 52s | Hits:  98%/1671  
      🟩 90a                Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s | Hits:  98%/557   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 54s | Avg: 6m 27s | Max: 10m 45s | Hits: 98%/296

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 12m 54s | Avg:  6m 27s | Max: 10m 45s | Hits:  98%/296   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s | Hits:  98%/148   
      🟩 Test               Pass: 100%/1   | Total: 10m 45s | Avg: 10m 45s | Max: 10m 45s | Hits:  98%/148   
    
  • 🟩 python: Pass: 100%/1 | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 30m 22s | Avg: 30m 22s | Max: 30m 22s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Collaborator

miscco commented Feb 20, 2025

/ok to test

@miscco
Copy link
Collaborator

miscco commented Feb 20, 2025

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 29m: Pass: 62%/158 | Total: 2d 13h | Avg: 23m 25s | Max: 1h 16m | Hits: 38%/146302
  • 🟨 thrust: Pass: 42%/45 | Total: 10h 46m | Avg: 14m 21s | Max: 36m 46s | Hits: 56%/33858

    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 18m 14s | Avg:  4m 33s | Max:  4m 45s
      🟥 Clang15            Pass:   0%/2   | Total:  9m 04s | Avg:  4m 32s | Max:  4m 36s
      🟥 Clang16            Pass:   0%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 31s
      🟥 Clang17            Pass:   0%/2   | Total:  8m 57s | Avg:  4m 28s | Max:  4m 33s
      🟥 Clang18            Pass:   0%/7   | Total: 19m 02s | Avg:  2m 43s | Max:  4m 59s
      🟨 GCC7               Pass:  50%/2   | Total: 36m 10s | Avg: 18m 05s | Max: 31m 22s | Hits:  39%/1782  
      🟩 GCC8               Pass: 100%/1   | Total: 32m 37s | Avg: 32m 37s | Max: 32m 37s | Hits:  39%/1782  
      🟨 GCC9               Pass:  50%/2   | Total: 35m 45s | Avg: 17m 52s | Max: 31m 08s | Hits:  39%/1782  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 48s | Max: 32m 15s | Hits:  39%/3564  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 40s | Max: 33m 48s | Hits:  39%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 07s | Max: 36m 46s | Hits:  39%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 44m | Avg: 22m 27s | Max: 34m 14s | Hits:  70%/17820 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 18m 58s | Avg:  9m 29s | Max:  9m 34s
      🟥 MSVC14.42          Pass:   0%/3   | Total: 21m 27s | Avg:  7m 09s | Max: 10m 47s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 13m 06s | Avg:  6m 33s | Max:  6m 36s
    🟨 gpu
      🟩 h100               Pass: 100%/2   | Total: 35m 05s | Avg: 17m 32s | Max: 23m 31s | Hits:  69%/3564  
      🟨 rtx2080            Pass:  36%/33  | Total:  8h 20m | Avg: 15m 10s | Max: 36m 46s | Hits:  42%/21384 
      🟨 rtx4090            Pass:  50%/10  | Total:  1h 50m | Avg: 11m 00s | Max: 34m 14s | Hits:  82%/8910  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 40m 56s | Avg: 20m 28s | Max: 29m 47s | Hits:  69%/3564  
    🟨 cpu
      🟨 amd64              Pass:  41%/43  | Total: 10h 10m | Avg: 14m 11s | Max: 36m 46s | Hits:  56%/32076 
      🟨 arm64              Pass:  50%/2   | Total: 35m 49s | Avg: 17m 54s | Max: 30m 50s | Hits:  39%/1782  
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 28m 21s | Avg:  5m 40s | Max:  9m 34s
      🟥 12.5               Pass:   0%/2   | Total: 13m 06s | Avg:  6m 33s | Max:  6m 36s
      🟨 12.8               Pass:  50%/38  | Total: 10h 04m | Avg: 15m 54s | Max: 36m 46s | Hits:  56%/33858 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  4m 47s | Avg:  2m 23s | Max:  2m 32s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 28m 21s | Avg:  5m 40s | Max:  9m 34s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 13m 06s | Avg:  6m 33s | Max:  6m 36s
      🟨 nvcc12.8           Pass:  52%/36  | Total:  9h 59m | Avg: 16m 39s | Max: 36m 46s | Hits:  56%/33858 
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  4m 47s | Avg:  2m 23s | Max:  2m 32s
      🟨 nvcc               Pass:  44%/43  | Total: 10h 41m | Avg: 14m 54s | Max: 36m 46s | Hits:  56%/33858 
    🟨 cxx_family
      🟥 Clang              Pass:   0%/17  | Total:  1h 04m | Avg:  3m 47s | Max:  4m 59s
      🟨 GCC                Pass:  90%/21  | Total:  8h 48m | Avg: 25m 09s | Max: 36m 46s | Hits:  56%/33858 
      🟥 MSVC               Pass:   0%/5   | Total: 40m 25s | Avg:  8m 05s | Max: 10m 47s
      🟥 NVHPC              Pass:   0%/2   | Total: 13m 06s | Avg:  6m 33s | Max:  6m 36s
    🟨 jobs
      🟨 Build              Pass:  39%/38  | Total: 10h 03m | Avg: 15m 53s | Max: 36m 46s | Hits:  44%/26730 
      🟨 TestCPU            Pass:  33%/3   | Total:  8m 09s | Avg:  2m 43s | Max:  8m 09s | Hits:  99%/1782  
      🟨 TestGPU            Pass:  75%/4   | Total: 34m 04s | Avg:  8m 31s | Max: 11m 34s | Hits:  99%/5346  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 35m 05s | Avg: 17m 32s | Max: 23m 31s | Hits:  69%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 29m 46s | Avg: 29m 46s | Max: 29m 46s | Hits:  74%/1782  
    🟨 std
      🟨 17                 Pass:  35%/20  | Total:  5h 04m | Avg: 15m 12s | Max: 36m 46s | Hits:  39%/12474 
      🟨 20                 Pass:  43%/23  | Total:  5h 01m | Avg: 13m 05s | Max: 34m 14s | Hits:  64%/17820 
    
  • 🟨 libcudacxx: Pass: 60%/43 | Total: 14h 00m | Avg: 19m 32s | Max: 47m 39s | Hits: 25%/62207

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  58%/41  | Total: 13h 15m | Avg: 19m 24s | Max: 47m 39s | Hits:  25%/56534 
      🟩 arm64              Pass: 100%/2   | Total: 44m 20s | Avg: 22m 10s | Max: 22m 19s | Hits:  25%/5673  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 31m 44s | Avg: 15m 52s | Max: 18m 09s | Hits:  25%/2926  
      🔍 rtx2080            Pass:  58%/41  | Total: 13h 28m | Avg: 19m 43s | Max: 47m 39s | Hits:  25%/59281 
    🚨 sm: 75 🚨
      🔥 75                 Pass:   0%/2   | Total: 30m 24s | Avg: 15m 12s | Max: 15m 39s
      🟩 90                 Pass: 100%/2   | Total: 31m 44s | Avg: 15m 52s | Max: 18m 09s | Hits:  25%/2926  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 04s | Avg: 32m 04s | Max: 32m 04s | Hits:  25%/2926  
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  1h 28m | Avg: 22m 08s | Max: 25m 23s | Hits:  26%/5630  
      🟩 Clang15            Pass: 100%/2   | Total: 48m 02s | Avg: 24m 01s | Max: 25m 38s | Hits:  25%/5630  
      🟩 Clang16            Pass: 100%/2   | Total: 45m 49s | Avg: 22m 54s | Max: 23m 06s | Hits:  26%/5630  
      🟩 Clang17            Pass: 100%/2   | Total: 51m 28s | Avg: 25m 44s | Max: 26m 32s | Hits:  25%/5630  
      🟨 Clang18            Pass:  66%/6   | Total:  2h 37m | Avg: 26m 18s | Max: 47m 39s | Hits:  26%/8466  
      🟥 GCC7               Pass:   0%/2   | Total: 23m 05s | Avg: 11m 32s | Max: 21m 07s
      🟥 GCC8               Pass:   0%/1   | Total: 21m 57s | Avg: 21m 57s | Max: 21m 57s
      🟥 GCC9               Pass:   0%/2   | Total: 23m 47s | Avg: 11m 53s | Max: 21m 42s
      🟩 GCC10              Pass: 100%/2   | Total: 45m 37s | Avg: 22m 48s | Max: 24m 37s | Hits:  26%/5636  
      🟩 GCC11              Pass: 100%/2   | Total: 46m 40s | Avg: 23m 20s | Max: 25m 19s | Hits:  25%/5632  
      🟩 GCC12              Pass: 100%/2   | Total: 50m 22s | Avg: 25m 11s | Max: 25m 11s | Hits:  25%/5632  
      🟨 GCC13              Pass:  80%/10  | Total:  2h 56m | Avg: 17m 36s | Max: 32m 04s | Hits:  25%/14321 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 23m 39s | Avg: 11m 49s | Max: 12m 08s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 25m 35s | Avg: 12m 47s | Max: 13m 47s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  6m 03s
    🟨 jobs
      🟨 Build              Pass:  59%/37  | Total: 12h 17m | Avg: 19m 55s | Max: 32m 04s | Hits:  25%/62207 
      🟥 NVRTC              Pass:   0%/2   | Total: 30m 24s | Avg: 15m 12s | Max: 15m 39s
      🟩 Test               Pass: 100%/3   | Total:  1h 10m | Avg: 23m 29s | Max: 47m 39s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 11s | Avg:  2m 11s | Max:  2m 11s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 56m 16s | Avg: 11m 15s | Max: 20m 16s
      🟥 12.5               Pass:   0%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  6m 03s
      🟨 12.8               Pass:  72%/36  | Total: 12h 52m | Avg: 21m 27s | Max: 47m 39s | Hits:  25%/62207 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 41m 22s | Avg: 20m 41s | Max: 22m 43s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 56m 16s | Avg: 11m 15s | Max: 20m 16s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  6m 03s
      🟨 nvcc12.8           Pass:  76%/34  | Total: 12h 10m | Avg: 21m 29s | Max: 47m 39s | Hits:  25%/62207 
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 41m 22s | Avg: 20m 41s | Max: 22m 43s
      🟨 nvcc               Pass:  63%/41  | Total: 13h 18m | Avg: 19m 29s | Max: 47m 39s | Hits:  25%/62207 
    🟨 cxx_family
      🟨 Clang              Pass:  75%/16  | Total:  6h 31m | Avg: 24m 29s | Max: 47m 39s | Hits:  26%/30986 
      🟨 GCC                Pass:  66%/21  | Total:  6h 27m | Avg: 18m 27s | Max: 32m 04s | Hits:  25%/31221 
      🟥 MSVC               Pass:   0%/4   | Total: 49m 14s | Avg: 12m 18s | Max: 13m 47s
      🟥 NVHPC              Pass:   0%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  6m 03s
    🟨 std
      🟨 17                 Pass:  42%/21  | Total:  6h 11m | Avg: 17m 41s | Max: 25m 23s | Hits:  26%/25152 
      🟨 20                 Pass:  76%/21  | Total:  7h 46m | Avg: 22m 12s | Max: 47m 39s | Hits:  25%/37055 
    
  • 🟨 cub: Pass: 73%/45 | Total: 1d 08h | Avg: 43m 15s | Max: 1h 16m | Hits: 36%/39911

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  72%/43  | Total:  1d 06h | Avg: 42m 13s | Max:  1h 16m | Hits:  37%/37493 
      🟩 arm64              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 07m | Hits:  16%/2418  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 23s | Max: 27m 00s | Hits:  71%/3627  
      🔍 rtx2080            Pass:  64%/34  | Total:  1d 03h | Avg: 47m 41s | Max:  1h 16m | Hits:  16%/26612 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 11m | Avg: 31m 29s | Max:  1h 03m | Hits:  78%/9672  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  67%/37  | Total:  1d 05h | Avg: 47m 55s | Max:  1h 16m | Hits:  16%/30239 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 39s | Avg: 21m 39s | Max: 21m 39s | Hits:  99%/1209  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 08s | Avg: 16m 08s | Max: 16m 08s | Hits:  99%/1209  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 11s | Max: 24m 39s | Hits:  99%/3627  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 03m | Avg: 21m 04s | Max: 22m 14s | Hits:  99%/3627  
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  3h 08m | Avg: 47m 01s | Max:  1h 05m | Hits:  16%/2422  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 07m | Hits:  16%/2418  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 39s | Max:  1h 01m | Hits:  16%/2418  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  16%/2418  
      🟨 Clang18            Pass:  71%/7   | Total:  3h 53m | Avg: 33m 23s | Max:  1h 03m | Hits:  49%/6045  
      🟨 GCC7               Pass:  50%/2   | Total:  1h 38m | Avg: 49m 08s | Max:  1h 06m | Hits:  16%/1211  
      🟩 GCC8               Pass: 100%/1   | Total: 58m 32s | Avg: 58m 32s | Max: 58m 32s | Hits:  16%/1211  
      🟨 GCC9               Pass:  50%/2   | Total:  1h 36m | Avg: 48m 08s | Max:  1h 04m | Hits:  16%/1211  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m | Hits:  16%/2422  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 05m | Hits:  15%/2418  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  15%/2418  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 07m | Avg: 38m 50s | Max:  1h 16m | Hits:  61%/13299 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 17m 07s | Avg:  8m 33s | Max:  8m 36s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 19m 49s | Avg:  9m 54s | Max:  9m 56s
      🟥 NVHPC24.7          Pass:   0%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 52s
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  2h 16m | Avg: 27m 18s | Max: 32m 20s
      🟥 12.5               Pass:   0%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 52s
      🟨 12.8               Pass:  86%/38  | Total:  1d 05h | Avg: 45m 53s | Max:  1h 16m | Hits:  36%/39911 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  6m 04s | Avg:  3m 02s | Max:  3m 03s
      🟥 nvcc12.0           Pass:   0%/5   | Total:  2h 16m | Avg: 27m 18s | Max: 32m 20s
      🟥 nvcc12.5           Pass:   0%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 52s
      🟨 nvcc12.8           Pass:  91%/36  | Total:  1d 04h | Avg: 48m 16s | Max:  1h 16m | Hits:  36%/39911 
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  6m 04s | Avg:  3m 02s | Max:  3m 03s
      🟨 nvcc               Pass:  76%/43  | Total:  1d 08h | Avg: 45m 07s | Max:  1h 16m | Hits:  36%/39911 
    🟨 cxx_family
      🟨 Clang              Pass:  76%/17  | Total: 13h 12m | Avg: 46m 37s | Max:  1h 07m | Hits:  29%/15721 
      🟨 GCC                Pass:  90%/22  | Total: 17h 31m | Avg: 47m 46s | Max:  1h 16m | Hits:  41%/24190 
      🟥 MSVC               Pass:   0%/4   | Total: 36m 56s | Avg:  9m 14s | Max:  9m 56s
      🟥 NVHPC              Pass:   0%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 52s
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 23s | Max: 27m 00s | Hits:  71%/3627  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 16m | Avg:  1h 16m | Max:  1h 16m | Hits:  15%/1209  
    🟨 std
      🟨 17                 Pass:  60%/20  | Total: 15h 04m | Avg: 45m 13s | Max:  1h 06m | Hits:  16%/14518 
      🟨 20                 Pass:  84%/25  | Total: 17h 22m | Avg: 41m 41s | Max:  1h 16m | Hits:  48%/25393 
    
  • 🟨 cudax: Pass: 81%/22 | Total: 3h 44m | Avg: 10m 11s | Max: 14m 21s | Hits: 57%/10030

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  77%/18  | Total:  3h 07m | Avg: 10m 26s | Max: 14m 21s | Hits:  60%/7802  
      🟩 arm64              Pass: 100%/4   | Total: 36m 09s | Avg:  9m 02s | Max:  9m 47s | Hits:  49%/2228  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 22m 42s | Avg: 11m 21s | Max: 14m 14s | Hits:  74%/1114  
      🔍 rtx2080            Pass:  80%/20  | Total:  3h 21m | Avg: 10m 04s | Max: 14m 21s | Hits:  55%/8916  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/19  | Total:  3h 03m | Avg:  9m 40s | Max: 11m 33s | Hits:  49%/8359  
      🟩 Test               Pass: 100%/3   | Total: 40m 25s | Avg: 13m 28s | Max: 14m 21s | Hits:  99%/1671  
    🟨 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 10m 39s | Avg: 10m 39s | Max: 10m 39s | Hits:  49%/559   
      🟩 Clang15            Pass: 100%/1   | Total: 11m 26s | Avg: 11m 26s | Max: 11m 26s | Hits:  49%/557   
      🟩 Clang16            Pass: 100%/1   | Total: 11m 33s | Avg: 11m 33s | Max: 11m 33s | Hits:  49%/557   
      🟩 Clang17            Pass: 100%/1   | Total:  9m 51s | Avg:  9m 51s | Max:  9m 51s | Hits:  49%/557   
      🟩 Clang18            Pass: 100%/4   | Total: 39m 35s | Avg:  9m 53s | Max: 11m 50s | Hits:  62%/2228  
      🟩 GCC10              Pass: 100%/1   | Total:  9m 57s | Avg:  9m 57s | Max:  9m 57s | Hits:  49%/559   
      🟩 GCC11              Pass: 100%/1   | Total: 10m 32s | Avg: 10m 32s | Max: 10m 32s | Hits:  49%/557   
      🟩 GCC12              Pass: 100%/2   | Total: 25m 31s | Avg: 12m 45s | Max: 14m 21s | Hits:  74%/1114  
      🟩 GCC13              Pass: 100%/6   | Total: 55m 56s | Avg:  9m 19s | Max: 14m 14s | Hits:  57%/3342  
      🟥 MSVC14.39          Pass:   0%/1   | Total: 10m 49s | Avg: 10m 49s | Max: 10m 49s
      🟥 MSVC14.42          Pass:   0%/1   | Total: 10m 36s | Avg: 10m 36s | Max: 10m 36s
      🟥 NVHPC24.7          Pass:   0%/2   | Total: 17m 41s | Avg:  8m 50s | Max:  9m 07s
    🟨 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  1h 23m | Avg: 10m 23s | Max: 11m 50s | Hits:  56%/4458  
      🟩 GCC                Pass: 100%/10  | Total:  1h 41m | Avg: 10m 11s | Max: 14m 21s | Hits:  59%/5572  
      🟥 MSVC               Pass:   0%/2   | Total: 21m 25s | Avg: 10m 42s | Max: 10m 49s
      🟥 NVHPC              Pass:   0%/2   | Total: 17m 41s | Avg:  8m 50s | Max:  9m 07s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  81%/22  | Total:  3h 44m | Avg: 10m 11s | Max: 14m 21s | Hits:  57%/10030 
    🟨 ctk
      🟥 12.0               Pass:   0%/1   | Total: 10m 49s | Avg: 10m 49s | Max: 10m 49s
      🟥 12.5               Pass:   0%/2   | Total: 17m 41s | Avg:  8m 50s | Max:  9m 07s
      🟨 12.8               Pass:  94%/19  | Total:  3h 15m | Avg: 10m 17s | Max: 14m 21s | Hits:  57%/10030 
    🟨 cudacxx
      🟥 nvcc12.0           Pass:   0%/1   | Total: 10m 49s | Avg: 10m 49s | Max: 10m 49s
      🟥 nvcc12.5           Pass:   0%/2   | Total: 17m 41s | Avg:  8m 50s | Max:  9m 07s
      🟨 nvcc12.8           Pass:  94%/19  | Total:  3h 15m | Avg: 10m 17s | Max: 14m 21s | Hits:  57%/10030 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 29m 30s | Avg:  9m 50s | Max: 14m 14s | Hits:  66%/1671  
      🟩 90a                Pass: 100%/1   | Total:  7m 45s | Avg:  7m 45s | Max:  7m 45s | Hits:  49%/557   
    🟨 std
      🟨 17                 Pass:  75%/4   | Total: 33m 24s | Avg:  8m 21s | Max:  9m 07s | Hits:  49%/1671  
      🟨 20                 Pass:  83%/18  | Total:  3h 10m | Avg: 10m 35s | Max: 14m 21s | Hits:  59%/8359  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 13m 28s | Avg: 6m 44s | Max: 10m 51s | Hits: 97%/296

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 13m 28s | Avg:  6m 44s | Max: 10m 51s | Hits:  97%/296   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 37s | Avg:  2m 37s | Max:  2m 37s | Hits:  95%/148   
      🟩 Test               Pass: 100%/1   | Total: 10m 51s | Avg: 10m 51s | Max: 10m 51s | Hits:  98%/148   
    
  • 🟩 python: Pass: 100%/1 | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 30m 39s | Avg: 30m 39s | Max: 30m 39s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@miscco
Copy link
Collaborator

miscco commented Feb 20, 2025

/ok to test

Copy link
Contributor

🟨 CI finished in 1h 42m: Pass: 68%/158 | Total: 2d 17h | Avg: 24m 53s | Max: 1h 22m | Hits: 46%/160518
  • 🟨 thrust: Pass: 46%/45 | Total: 12h 23m | Avg: 16m 31s | Max: 1h 01m | Hits: 55%/37420

    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 28m 57s | Avg:  5m 47s | Max: 10m 14s
      🟩 12.5               Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:   0%/3562  
      🟨 12.8               Pass:  50%/38  | Total:  9h 52m | Avg: 15m 36s | Max: 34m 41s | Hits:  61%/33858 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total:  8m 12s | Avg:  4m 06s | Max:  4m 07s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 28m 57s | Avg:  5m 47s | Max: 10m 14s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:   0%/3562  
      🟨 nvcc12.8           Pass:  52%/36  | Total:  9h 44m | Avg: 16m 14s | Max: 34m 41s | Hits:  61%/33858 
    🟨 cxx
      🟥 Clang14            Pass:   0%/4   | Total: 18m 01s | Avg:  4m 30s | Max:  4m 38s
      🟥 Clang15            Pass:   0%/2   | Total:  8m 40s | Avg:  4m 20s | Max:  4m 23s
      🟥 Clang16            Pass:   0%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 42s
      🟥 Clang17            Pass:   0%/2   | Total:  9m 30s | Avg:  4m 45s | Max:  4m 46s
      🟥 Clang18            Pass:   0%/7   | Total: 21m 58s | Avg:  3m 08s | Max:  4m 53s
      🟨 GCC7               Pass:  50%/2   | Total: 36m 19s | Avg: 18m 09s | Max: 31m 32s | Hits:  47%/1782  
      🟩 GCC8               Pass: 100%/1   | Total: 29m 02s | Avg: 29m 02s | Max: 29m 02s | Hits:  47%/1782  
      🟨 GCC9               Pass:  50%/2   | Total: 36m 49s | Avg: 18m 24s | Max: 32m 00s | Hits:  47%/1782  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 28s | Max: 34m 00s | Hits:  47%/3564  
      🟩 GCC11              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 00s | Max: 32m 12s | Hits:  47%/3564  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 08m | Avg: 34m 15s | Max: 34m 41s | Hits:  47%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 33m | Avg: 21m 23s | Max: 32m 35s | Hits:  74%/17820 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 20m 23s | Avg: 10m 11s | Max: 10m 14s
      🟥 MSVC14.42          Pass:   0%/3   | Total: 20m 42s | Avg:  6m 54s | Max: 10m 43s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:   0%/3562  
    🟨 cxx_family
      🟥 Clang              Pass:   0%/17  | Total:  1h 07m | Avg:  3m 57s | Max:  4m 53s
      🟨 GCC                Pass:  90%/21  | Total:  8h 33m | Avg: 24m 27s | Max: 34m 41s | Hits:  61%/33858 
      🟥 MSVC               Pass:   0%/5   | Total: 41m 05s | Avg:  8m 13s | Max: 10m 43s
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 01m | Hits:   0%/3562  
    🟨 gpu
      🟩 h100               Pass: 100%/2   | Total: 32m 14s | Avg: 16m 07s | Max: 20m 40s | Hits:  73%/3564  
      🟨 rtx2080            Pass:  42%/33  | Total: 10h 11m | Avg: 18m 31s | Max:  1h 01m | Hits:  42%/24946 
      🟨 rtx4090            Pass:  50%/10  | Total:  1h 39m | Avg:  9m 57s | Max: 28m 42s | Hits:  84%/8910  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 37m 58s | Avg: 18m 59s | Max: 26m 49s | Hits:  73%/3564  
    🟨 cpu
      🟨 amd64              Pass:  46%/43  | Total: 11h 46m | Avg: 16m 26s | Max:  1h 01m | Hits:  56%/35638 
      🟨 arm64              Pass:  50%/2   | Total: 36m 27s | Avg: 18m 13s | Max: 31m 34s | Hits:  47%/1782  
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total:  8m 12s | Avg:  4m 06s | Max:  4m 07s
      🟨 nvcc               Pass:  48%/43  | Total: 12h 15m | Avg: 17m 05s | Max:  1h 01m | Hits:  55%/37420 
    🟨 jobs
      🟨 Build              Pass:  44%/38  | Total: 11h 41m | Avg: 18m 28s | Max:  1h 01m | Hits:  45%/30292 
      🟨 TestCPU            Pass:  33%/3   | Total:  7m 29s | Avg:  2m 29s | Max:  7m 29s | Hits:  99%/1782  
      🟨 TestGPU            Pass:  75%/4   | Total: 33m 51s | Avg:  8m 27s | Max: 11m 34s | Hits:  99%/5346  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 32m 14s | Avg: 16m 07s | Max: 20m 40s | Hits:  73%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 12s | Avg: 32m 12s | Max: 32m 12s | Hits:  75%/1782  
    🟨 std
      🟨 17                 Pass:  40%/20  | Total:  5h 54m | Avg: 17m 43s | Max: 59m 34s | Hits:  41%/14255 
      🟨 20                 Pass:  47%/23  | Total:  5h 50m | Avg: 15m 14s | Max:  1h 01m | Hits:  62%/19601 
    
  • 🟨 libcudacxx: Pass: 65%/43 | Total: 13h 59m | Avg: 19m 31s | Max: 50m 08s | Hits: 42%/67825

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  63%/41  | Total: 13h 14m | Avg: 19m 23s | Max: 50m 08s | Hits:  43%/62152 
      🟩 arm64              Pass: 100%/2   | Total: 44m 35s | Avg: 22m 17s | Max: 22m 30s | Hits:  31%/5673  
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 17m 53s | Avg:  8m 56s | Max: 13m 22s | Hits:  94%/2926  
      🔍 rtx2080            Pass:  63%/41  | Total: 13h 41m | Avg: 20m 02s | Max: 50m 08s | Hits:  40%/64899 
    🚨 sm: 75 🚨
      🔥 75                 Pass:   0%/2   | Total: 31m 54s | Avg: 15m 57s | Max: 16m 56s
      🟩 90                 Pass: 100%/2   | Total: 17m 53s | Avg:  8m 56s | Max: 13m 22s | Hits:  94%/2926  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 08s | Avg: 31m 08s | Max: 31m 08s | Hits:  30%/2926  
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total: 53m 51s | Avg: 10m 46s | Max: 19m 22s
      🟩 12.5               Pass: 100%/2   | Total:  1h 11m | Avg: 35m 32s | Max: 35m 47s | Hits:   2%/5618  
      🟨 12.8               Pass:  72%/36  | Total: 11h 54m | Avg: 19m 50s | Max: 50m 08s | Hits:  46%/62207 
    🟨 cudacxx
      🟥 ClangCUDA18        Pass:   0%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 21m 16s
      🟥 nvcc12.0           Pass:   0%/5   | Total: 53m 51s | Avg: 10m 46s | Max: 19m 22s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 11m | Avg: 35m 32s | Max: 35m 47s | Hits:   2%/5618  
      🟨 nvcc12.8           Pass:  76%/34  | Total: 11h 12m | Avg: 19m 46s | Max: 50m 08s | Hits:  46%/62207 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  1h 23m | Avg: 20m 46s | Max: 22m 57s | Hits:  32%/5630  
      🟩 Clang15            Pass: 100%/2   | Total: 46m 59s | Avg: 23m 29s | Max: 25m 55s | Hits:  32%/5630  
      🟩 Clang16            Pass: 100%/2   | Total: 43m 14s | Avg: 21m 37s | Max: 22m 23s | Hits:  32%/5630  
      🟩 Clang17            Pass: 100%/2   | Total: 45m 09s | Avg: 22m 34s | Max: 23m 45s | Hits:  32%/5630  
      🟨 Clang18            Pass:  66%/6   | Total:  2h 06m | Avg: 21m 07s | Max: 50m 08s | Hits:  73%/8466  
      🟥 GCC7               Pass:   0%/2   | Total: 24m 47s | Avg: 12m 23s | Max: 22m 47s
      🟥 GCC8               Pass:   0%/1   | Total: 22m 16s | Avg: 22m 16s | Max: 22m 16s
      🟥 GCC9               Pass:   0%/2   | Total: 25m 57s | Avg: 12m 58s | Max: 24m 01s
      🟩 GCC10              Pass: 100%/2   | Total: 44m 00s | Avg: 22m 00s | Max: 23m 53s | Hits:  32%/5636  
      🟩 GCC11              Pass: 100%/2   | Total: 27m 52s | Avg: 13m 56s | Max: 22m 54s | Hits:  62%/5632  
      🟩 GCC12              Pass: 100%/2   | Total: 29m 29s | Avg: 14m 44s | Max: 24m 30s | Hits:  62%/5632  
      🟨 GCC13              Pass:  80%/10  | Total:  3h 19m | Avg: 19m 56s | Max: 48m 59s | Hits:  44%/14321 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 24m 15s | Avg: 12m 07s | Max: 12m 25s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 25m 03s | Avg: 12m 31s | Max: 12m 47s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 11m | Avg: 35m 32s | Max: 35m 47s | Hits:   2%/5618  
    🟨 cxx_family
      🟨 Clang              Pass:  75%/16  | Total:  5h 45m | Avg: 21m 34s | Max: 50m 08s | Hits:  43%/30986 
      🟨 GCC                Pass:  66%/21  | Total:  6h 13m | Avg: 17m 47s | Max: 48m 59s | Hits:  48%/31221 
      🟥 MSVC               Pass:   0%/4   | Total: 49m 18s | Avg: 12m 19s | Max: 12m 47s
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 11m | Avg: 35m 32s | Max: 35m 47s | Hits:   2%/5618  
    🟨 jobs
      🟨 Build              Pass:  64%/37  | Total: 11h 32m | Avg: 18m 42s | Max: 35m 47s | Hits:  42%/67825 
      🟥 NVRTC              Pass:   0%/2   | Total: 31m 54s | Avg: 15m 57s | Max: 16m 56s
      🟩 Test               Pass: 100%/3   | Total:  1h 52m | Avg: 37m 29s | Max: 50m 08s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 32s | Avg:  2m 32s | Max:  2m 32s
    🟨 cudacxx_family
      🟥 ClangCUDA          Pass:   0%/2   | Total: 42m 22s | Avg: 21m 11s | Max: 21m 16s
      🟨 nvcc               Pass:  68%/41  | Total: 13h 17m | Avg: 19m 26s | Max: 50m 08s | Hits:  42%/67825 
    🟨 std
      🟨 17                 Pass:  47%/21  | Total:  5h 43m | Avg: 16m 21s | Max: 35m 18s | Hits:  47%/27940 
      🟨 20                 Pass:  80%/21  | Total:  8h 13m | Avg: 23m 29s | Max: 50m 08s | Hits:  38%/39885 
    
  • 🟨 cub: Pass: 82%/45 | Total: 1d 12h | Avg: 48m 14s | Max: 1h 22m | Hits: 34%/44237

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  81%/43  | Total:  1d 10h | Avg: 47m 31s | Max:  1h 22m | Hits:  35%/41819 
      🟩 arm64              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  16%/2418  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  14%/2090  
      🔍 nvcc               Pass:  81%/43  | Total:  1d 10h | Avg: 47m 27s | Max:  1h 22m | Hits:  35%/42147 
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 27s | Max: 27m 11s | Hits:  71%/3627  
      🔍 rtx2080            Pass:  76%/34  | Total:  1d 06h | Avg: 54m 03s | Max:  1h 22m | Hits:  16%/30938 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 19m | Avg: 32m 27s | Max:  1h 03m | Hits:  78%/9672  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  78%/37  | Total:  1d 09h | Avg: 53m 49s | Max:  1h 22m | Hits:  16%/34565 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 53s | Avg: 21m 53s | Max: 21m 53s | Hits:  99%/1209  
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 18s | Avg: 19m 18s | Max: 19m 18s | Hits:  99%/1209  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 16s | Max: 26m 37s | Hits:  99%/3627  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 02m | Avg: 20m 45s | Max: 21m 36s | Hits:  99%/3627  
    🟨 ctk
      🟥 12.0               Pass:   0%/5   | Total:  2h 18m | Avg: 27m 42s | Max: 33m 31s
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2236  
      🟨 12.8               Pass:  92%/38  | Total:  1d 07h | Avg: 49m 46s | Max:  1h 22m | Hits:  35%/42001 
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  14%/2090  
      🟥 nvcc12.0           Pass:   0%/5   | Total:  2h 18m | Avg: 27m 42s | Max: 33m 31s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2236  
      🟨 nvcc12.8           Pass:  91%/36  | Total:  1d 05h | Avg: 48m 56s | Max:  1h 22m | Hits:  36%/39911 
    🟨 cxx
      🟨 Clang14            Pass:  50%/4   | Total:  3h 10m | Avg: 47m 31s | Max:  1h 02m | Hits:  17%/2422  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  16%/2418  
      🟩 Clang16            Pass: 100%/2   | Total:  1h 59m | Avg: 59m 51s | Max:  1h 01m | Hits:  16%/2418  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 02m | Hits:  16%/2418  
      🟩 Clang18            Pass: 100%/7   | Total:  6h 03m | Avg: 51m 58s | Max:  1h 07m | Hits:  41%/8135  
      🟨 GCC7               Pass:  50%/2   | Total:  1h 39m | Avg: 49m 49s | Max:  1h 08m | Hits:  16%/1211  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m | Hits:  16%/1211  
      🟨 GCC9               Pass:  50%/2   | Total:  1h 35m | Avg: 47m 40s | Max:  1h 02m | Hits:  16%/1211  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 35s | Max: 59m 57s | Hits:  16%/2422  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m | Hits:  16%/2418  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  16%/2418  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 20m | Avg: 40m 00s | Max:  1h 22m | Hits:  61%/13299 
      🟥 MSVC14.29          Pass:   0%/2   | Total: 17m 51s | Avg:  8m 55s | Max:  9m 25s
      🟥 MSVC14.42          Pass:   0%/2   | Total: 19m 03s | Avg:  9m 31s | Max:  9m 45s
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2236  
    🟨 cxx_family
      🟨 Clang              Pass:  88%/17  | Total: 15h 23m | Avg: 54m 19s | Max:  1h 07m | Hits:  27%/17811 
      🟨 GCC                Pass:  90%/22  | Total: 17h 49m | Avg: 48m 37s | Max:  1h 22m | Hits:  41%/24190 
      🟥 MSVC               Pass:   0%/4   | Total: 36m 54s | Avg:  9m 13s | Max:  9m 45s
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 10m | Hits:  10%/2236  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 27s | Max: 27m 11s | Hits:  71%/3627  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 22m | Avg:  1h 22m | Max:  1h 22m | Hits:  16%/1209  
    🟨 std
      🟨 17                 Pass:  70%/20  | Total: 16h 59m | Avg: 50m 58s | Max:  1h 10m | Hits:  16%/16681 
      🟨 20                 Pass:  92%/25  | Total: 19h 11m | Avg: 46m 02s | Max:  1h 22m | Hits:  45%/27556 
    
  • 🟨 cudax: Pass: 90%/22 | Total: 2h 14m | Avg: 6m 06s | Max: 13m 42s | Hits: 94%/10740

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  88%/18  | Total:  1h 59m | Avg:  6m 38s | Max: 13m 42s | Hits:  92%/8512  
      🟩 arm64              Pass: 100%/4   | Total: 14m 41s | Avg:  3m 40s | Max:  3m 52s | Hits:  98%/2228  
    🚨 cxx_family: MSVC 🚨
      🟩 Clang              Pass: 100%/8   | Total: 39m 57s | Avg:  4m 59s | Max: 11m 56s | Hits:  98%/4458  
      🟩 GCC                Pass: 100%/10  | Total: 56m 20s | Avg:  5m 38s | Max: 13m 42s | Hits:  98%/5572  
      🔥 MSVC               Pass:   0%/2   | Total: 19m 47s | Avg:  9m 53s | Max: 10m 01s
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 12s | Avg:  9m 06s | Max:  9m 27s | Hits:  31%/710   
    🔍 gpu: rtx2080 🔍
      🟩 h100               Pass: 100%/2   | Total: 17m 05s | Avg:  8m 32s | Max: 13m 42s | Hits:  98%/1114  
      🔍 rtx2080            Pass:  90%/20  | Total:  1h 57m | Avg:  5m 51s | Max: 12m 30s | Hits:  93%/9626  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  89%/19  | Total:  1h 36m | Avg:  5m 03s | Max: 10m 01s | Hits:  92%/9069  
      🟩 Test               Pass: 100%/3   | Total: 38m 08s | Avg: 12m 42s | Max: 13m 42s | Hits:  99%/1671  
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/4   | Total: 19m 36s | Avg:  4m 54s | Max:  8m 45s | Hits:  86%/2026  
      🔍 20                 Pass:  88%/18  | Total:  1h 54m | Avg:  6m 22s | Max: 13m 42s | Hits:  95%/8714  
    🟨 ctk
      🟥 12.0               Pass:   0%/1   | Total:  9m 46s | Avg:  9m 46s | Max:  9m 46s
      🟩 12.5               Pass: 100%/2   | Total: 18m 12s | Avg:  9m 06s | Max:  9m 27s | Hits:  31%/710   
      🟨 12.8               Pass:  94%/19  | Total:  1h 46m | Avg:  5m 35s | Max: 13m 42s | Hits:  98%/10030 
    🟨 cudacxx
      🟥 nvcc12.0           Pass:   0%/1   | Total:  9m 46s | Avg:  9m 46s | Max:  9m 46s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 12s | Avg:  9m 06s | Max:  9m 27s | Hits:  31%/710   
      🟨 nvcc12.8           Pass:  94%/19  | Total:  1h 46m | Avg:  5m 35s | Max: 13m 42s | Hits:  98%/10030 
    🟨 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s | Hits:  98%/559   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 15s | Avg:  4m 15s | Max:  4m 15s | Hits:  98%/557   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 24s | Avg:  4m 24s | Max:  4m 24s | Hits:  98%/557   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s | Hits:  98%/557   
      🟩 Clang18            Pass: 100%/4   | Total: 23m 07s | Avg:  5m 46s | Max: 11m 56s | Hits:  98%/2228  
      🟩 GCC10              Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s | Hits:  98%/559   
      🟩 GCC11              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s | Hits:  97%/557   
      🟩 GCC12              Pass: 100%/2   | Total: 16m 42s | Avg:  8m 21s | Max: 12m 30s | Hits:  98%/1114  
      🟩 GCC13              Pass: 100%/6   | Total: 31m 47s | Avg:  5m 17s | Max: 13m 42s | Hits:  98%/3342  
      🟥 MSVC14.39          Pass:   0%/1   | Total:  9m 46s | Avg:  9m 46s | Max:  9m 46s
      🟥 MSVC14.42          Pass:   0%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 12s | Avg:  9m 06s | Max:  9m 27s | Hits:  31%/710   
    🟨 cudacxx_family
      🟨 nvcc               Pass:  90%/22  | Total:  2h 14m | Avg:  6m 06s | Max: 13m 42s | Hits:  94%/10740 
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 20m 33s | Avg:  6m 51s | Max: 13m 42s | Hits:  98%/1671  
      🟩 90a                Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s | Hits:  98%/557   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 14m 05s | Avg: 7m 02s | Max: 11m 34s | Hits: 97%/296

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 14m 05s | Avg:  7m 02s | Max: 11m 34s | Hits:  97%/296   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 31s | Avg:  2m 31s | Max:  2m 31s | Hits:  95%/148   
      🟩 Test               Pass: 100%/1   | Total: 11m 34s | Avg: 11m 34s | Max: 11m 34s | Hits:  98%/148   
    
  • 🟩 python: Pass: 100%/1 | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 31m 13s | Avg: 31m 13s | Max: 31m 13s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Review
Development

Successfully merging this pull request may close these issues.

3 participants