Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix cuda::std::assume_aligned assertion #4023

Merged
merged 2 commits into from
Mar 5, 2025

Conversation

fbusato
Copy link
Contributor

@fbusato fbusato commented Mar 5, 2025

Description

static_assert needs to check the alignment not the size of the input type

@fbusato fbusato added the 3.0 Targeted for 3.0 release label Mar 5, 2025
@fbusato fbusato requested a review from miscco March 5, 2025 18:58
@fbusato fbusato self-assigned this Mar 5, 2025
@fbusato fbusato requested a review from a team as a code owner March 5, 2025 18:58
Co-authored-by: Bernhard Manfred Gruber <bernhardmgruber@gmail.com>
@fbusato fbusato enabled auto-merge (squash) March 5, 2025 19:56
Copy link
Contributor

github-actions bot commented Mar 5, 2025

🟩 CI finished in 1h 42m: Pass: 100%/158 | Total: 3d 14h | Avg: 32m 44s | Max: 1h 21m | Hits: 55%/250300
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 57m 13s | Max: 1h 21m | Hits: 34%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 54s | Max:  1h 21m | Hits:  35%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  21%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 15m | Avg:  1h 03m | Max:  1h 08m | Hits:  20%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 14m | Hits:  17%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 11h | Avg: 55m 32s | Max:  1h 21m | Hits:  37%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  20%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 15m | Avg:  1h 03m | Max:  1h 08m | Hits:  20%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 14m | Hits:  17%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 09h | Avg: 55m 06s | Max:  1h 21m | Hits:  38%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  20%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 56s | Max:  1h 21m | Hits:  35%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 10m | Avg:  1h 02m | Max:  1h 12m | Hits:  21%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 04m | Hits:  21%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 05m | Hits:  21%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  21%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  6h 00m | Avg: 51m 25s | Max:  1h 05m | Hits:  44%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m | Hits:  21%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  21%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  21%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 07m | Hits:  21%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  21%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  21%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 03m | Avg: 38m 32s | Max:  1h 17m | Hits:  64%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 18m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 38m | Avg:  1h 19m | Max:  1h 21m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 14m | Hits:  17%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 33m | Avg: 58m 28s | Max:  1h 12m | Hits:  31%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 46m | Avg: 51m 11s | Max:  1h 17m | Hits:  42%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 05m | Avg:  1h 16m | Max:  1h 21m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 29m | Avg:  1h 14m | Max:  1h 14m | Hits:  17%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 14m | Avg: 24m 52s | Max: 27m 48s | Hits:  73%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 06m | Max:  1h 21m | Hits:  20%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 09m | Avg: 31m 12s | Max:  1h 01m | Hits:  80%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 16h | Avg:  1h 04m | Max:  1h 21m | Hits:  20%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 11s | Avg: 22m 11s | Max: 22m 11s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 28s | Avg: 17m 28s | Max: 17m 28s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 02s | Max: 23m 40s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 35s | Max: 23m 28s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 14m | Avg: 24m 52s | Max: 27m 48s | Hits:  73%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 17m | Avg:  1h 17m | Max:  1h 17m | Hits:  21%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 22h 05m | Avg:  1h 06m | Max:  1h 18m | Hits:  20%/23591 
      🟩 20                 Pass: 100%/25  | Total: 20h 50m | Avg: 50m 01s | Max:  1h 21m | Hits:  46%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 22h 27m | Avg: 29m 57s | Max: 1h 05m | Hits: 75%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 37m 38s | Avg: 18m 49s | Max: 26m 16s | Hits:  88%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 21h 35m | Avg: 30m 07s | Max:  1h 05m | Hits:  75%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 52m 32s | Avg: 26m 16s | Max: 28m 02s | Hits:  76%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 57m | Avg: 35m 24s | Max: 59m 04s | Hits:  65%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  1h 40m | Avg: 50m 14s | Max: 52m 39s | Hits:  59%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 17h 50m | Avg: 28m 10s | Max:  1h 05m | Hits:  77%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 58s | Avg: 25m 29s | Max: 25m 48s | Hits:  76%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 57m | Avg: 35m 24s | Max: 59m 04s | Hits:  65%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 40m | Avg: 50m 14s | Max: 52m 39s | Hits:  59%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 59m | Avg: 28m 19s | Max:  1h 05m | Hits:  77%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 58s | Avg: 25m 29s | Max: 25m 48s | Hits:  76%/3554  
      🟩 nvcc               Pass: 100%/43  | Total: 21h 36m | Avg: 30m 09s | Max:  1h 05m | Hits:  75%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 52m | Avg: 28m 08s | Max: 29m 56s | Hits:  77%/7108  
      🟩 Clang15            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 14s | Max: 30m 22s | Hits:  76%/3554  
      🟩 Clang16            Pass: 100%/2   | Total: 54m 34s | Avg: 27m 17s | Max: 27m 33s | Hits:  76%/3554  
      🟩 Clang17            Pass: 100%/2   | Total: 58m 00s | Avg: 29m 00s | Max: 29m 07s | Hits:  76%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 26m | Avg: 20m 59s | Max: 27m 39s | Hits:  83%/12439 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 07s | Max: 32m 27s | Hits:  63%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 28m 14s | Avg: 28m 14s | Max: 28m 14s | Hits:  76%/1778  
      🟩 GCC9               Pass: 100%/2   | Total: 59m 25s | Avg: 29m 42s | Max: 30m 42s | Hits:  76%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 40s | Max: 32m 11s | Hits:  76%/3556  
      🟩 GCC11              Pass: 100%/2   | Total: 59m 04s | Avg: 29m 32s | Max: 30m 01s | Hits:  76%/3556  
      🟩 GCC12              Pass: 100%/2   | Total: 58m 54s | Avg: 29m 27s | Max: 29m 53s | Hits:  76%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 27m | Avg: 20m 47s | Max: 31m 58s | Hits:  86%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 58m | Avg: 59m 07s | Max: 59m 10s | Hits:  51%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 39m | Avg: 53m 11s | Max:  1h 05m | Hits:  50%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 40m | Avg: 50m 14s | Max: 52m 39s | Hits:  59%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 12m | Avg: 25m 26s | Max: 30m 22s | Hits:  79%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  8h 57m | Avg: 25m 34s | Max: 32m 27s | Hits:  79%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 37m | Avg: 55m 33s | Max:  1h 05m | Hits:  50%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 40m | Avg: 50m 14s | Max: 52m 39s | Hits:  59%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 30m 42s | Avg: 15m 21s | Max: 19m 11s | Hits:  88%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 18h 07m | Avg: 32m 56s | Max:  1h 01m | Hits:  72%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 50m | Avg: 23m 00s | Max:  1h 05m | Hits:  83%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 55m | Avg: 33m 02s | Max:  1h 05m | Hits:  72%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 01s | Avg: 16m 00s | Max: 32m 25s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 38s | Avg: 11m 09s | Max: 11m 33s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 30m 42s | Avg: 15m 21s | Max: 19m 11s | Hits:  88%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 30m 07s | Avg: 30m 07s | Max: 30m 07s | Hits:  76%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 40m | Avg: 35m 02s | Max:  1h 01m | Hits:  70%/35531 
      🟩 20                 Pass: 100%/23  | Total: 10h 09m | Avg: 26m 29s | Max:  1h 05m | Hits:  79%/40869 
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 14h 16m | Avg: 19m 55s | Max: 34m 19s | Hits: 50%/104700

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total: 14h 05m | Avg: 20m 37s | Max: 34m 19s | Hits:  48%/98973 
      🟩 arm64              Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 36s | Hits:  92%/5727  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 23s | Max: 26m 51s | Hits:  55%/13948 
      🟩 12.5               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 54s | Max: 34m 19s | Hits:  28%/5672  
      🟩 12.8               Pass: 100%/36  | Total: 11h 31m | Avg: 19m 12s | Max: 32m 01s | Hits:  51%/85080 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 42m 44s | Avg: 21m 22s | Max: 22m 39s | Hits:  27%/5688  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 36m | Avg: 19m 23s | Max: 26m 51s | Hits:  55%/13948 
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 07m | Avg: 33m 54s | Max: 34m 19s | Hits:  28%/5672  
      🟩 nvcc12.8           Pass: 100%/34  | Total: 10h 48m | Avg: 19m 05s | Max: 32m 01s | Hits:  53%/79392 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 42m 44s | Avg: 21m 22s | Max: 22m 39s | Hits:  27%/5688  
      🟩 nvcc               Pass: 100%/41  | Total: 13h 33m | Avg: 19m 50s | Max: 34m 19s | Hits:  52%/99012 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 30m | Avg: 22m 36s | Max: 24m 12s | Hits:  31%/11344 
      🟩 Clang15            Pass: 100%/2   | Total: 53m 14s | Avg: 26m 37s | Max: 27m 49s | Hits:  31%/5684  
      🟩 Clang16            Pass: 100%/2   | Total: 48m 54s | Avg: 24m 27s | Max: 24m 29s | Hits:  31%/5684  
      🟩 Clang17            Pass: 100%/2   | Total: 47m 55s | Avg: 23m 57s | Max: 24m 44s | Hits:  31%/5684  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 53m | Avg: 18m 57s | Max: 26m 33s | Hits:  42%/14235 
      🟩 GCC7               Pass: 100%/2   | Total: 11m 26s | Avg:  5m 43s | Max:  5m 55s | Hits:  92%/5622  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 41s | Avg:  5m 41s | Max:  5m 41s | Hits:  92%/2821  
      🟩 GCC9               Pass: 100%/2   | Total: 46m 22s | Avg: 23m 11s | Max: 25m 44s | Hits:  31%/5634  
      🟩 GCC10              Pass: 100%/2   | Total: 47m 35s | Avg: 23m 47s | Max: 25m 07s | Hits:  31%/5690  
      🟩 GCC11              Pass: 100%/2   | Total: 47m 02s | Avg: 23m 31s | Max: 24m 11s | Hits:  31%/5686  
      🟩 GCC12              Pass: 100%/2   | Total: 48m 23s | Avg: 24m 11s | Max: 25m 16s | Hits:  31%/5686  
      🟩 GCC13              Pass: 100%/10  | Total:  1h 51m | Avg: 11m 10s | Max: 26m 59s | Hits:  80%/14496 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 55m 28s | Avg: 27m 44s | Max: 28m 37s | Hits:  92%/5348  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 19s | Max: 32m 01s | Hits:  92%/5414  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 07m | Avg: 33m 54s | Max: 34m 19s | Hits:  28%/5672  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  5h 54m | Avg: 22m 08s | Max: 27m 49s | Hits:  34%/42631 
      🟩 GCC                Pass: 100%/21  | Total:  5h 18m | Avg: 15m 09s | Max: 26m 59s | Hits:  58%/45635 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 56m | Avg: 29m 01s | Max: 32m 01s | Hits:  92%/10762 
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 54s | Max: 34m 19s | Hits:  28%/5672  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 18m 31s | Avg:  9m 15s | Max: 13m 31s | Hits:  93%/2953  
      🟩 rtx2080            Pass: 100%/41  | Total: 13h 57m | Avg: 20m 26s | Max: 34m 19s | Hits:  49%/101747
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 12h 59m | Avg: 21m 04s | Max: 34m 19s | Hits:  50%/104660
      🟩 NVRTC              Pass: 100%/2   | Total: 33m 54s | Avg: 16m 57s | Max: 18m 15s | Hits:  90%/40    
      🟩 Test               Pass: 100%/3   | Total: 40m 46s | Avg: 13m 35s | Max: 14m 53s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 33m 54s | Avg: 16m 57s | Max: 18m 15s | Hits:  90%/40    
      🟩 90                 Pass: 100%/2   | Total: 18m 31s | Avg:  9m 15s | Max: 13m 31s | Hits:  93%/2953  
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 28s | Avg:  6m 28s | Max:  6m 28s | Hits:  93%/2953  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  7h 17m | Avg: 20m 50s | Max: 33m 30s | Hits:  52%/55967 
      🟩 20                 Pass: 100%/21  | Total:  6h 56m | Avg: 19m 50s | Max: 34m 19s | Hits:  48%/48733 
    
  • 🟩 cudax: Pass: 100%/22 | Total: 5h 16m | Avg: 14m 22s | Max: 18m 36s | Hits: 51%/11722

    🟩 cpu
      🟩 amd64              Pass: 100%/18  | Total:  4h 18m | Avg: 14m 23s | Max: 18m 36s | Hits:  54%/9406  
      🟩 arm64              Pass: 100%/4   | Total: 57m 21s | Avg: 14m 20s | Max: 15m 38s | Hits:  40%/2316  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 15m 42s | Avg: 15m 42s | Max: 15m 42s | Hits:  56%/277   
      🟩 12.5               Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max:  9m 36s | Hits:  58%/742   
      🟩 12.8               Pass: 100%/19  | Total:  4h 41m | Avg: 14m 49s | Max: 18m 36s | Hits:  50%/10703 
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 15m 42s | Avg: 15m 42s | Max: 15m 42s | Hits:  56%/277   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max:  9m 36s | Hits:  58%/742   
      🟩 nvcc12.8           Pass: 100%/19  | Total:  4h 41m | Avg: 14m 49s | Max: 18m 36s | Hits:  50%/10703 
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/22  | Total:  5h 16m | Avg: 14m 22s | Max: 18m 36s | Hits:  51%/11722 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total: 14m 40s | Avg: 14m 40s | Max: 14m 40s | Hits:  41%/581   
      🟩 Clang15            Pass: 100%/1   | Total: 17m 25s | Avg: 17m 25s | Max: 17m 25s | Hits:  40%/579   
      🟩 Clang16            Pass: 100%/1   | Total: 16m 56s | Avg: 16m 56s | Max: 16m 56s | Hits:  40%/579   
      🟩 Clang17            Pass: 100%/1   | Total: 18m 23s | Avg: 18m 23s | Max: 18m 23s | Hits:  40%/579   
      🟩 Clang18            Pass: 100%/4   | Total: 56m 01s | Avg: 14m 00s | Max: 16m 52s | Hits:  55%/2316  
      🟩 GCC10              Pass: 100%/1   | Total: 17m 28s | Avg: 17m 28s | Max: 17m 28s | Hits:  40%/581   
      🟩 GCC11              Pass: 100%/1   | Total: 17m 12s | Avg: 17m 12s | Max: 17m 12s | Hits:  40%/579   
      🟩 GCC12              Pass: 100%/2   | Total: 31m 57s | Avg: 15m 58s | Max: 18m 36s | Hits:  70%/1158  
      🟩 GCC13              Pass: 100%/6   | Total:  1h 19m | Avg: 13m 11s | Max: 15m 38s | Hits:  50%/3474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 42s | Avg: 15m 42s | Max: 15m 42s | Hits:  56%/277   
      🟩 MSVC14.42          Pass: 100%/1   | Total: 12m 35s | Avg: 12m 35s | Max: 12m 35s | Hits:  56%/277   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max:  9m 36s | Hits:  58%/742   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total:  2h 03m | Avg: 15m 25s | Max: 18m 23s | Hits:  48%/4634  
      🟩 GCC                Pass: 100%/10  | Total:  2h 25m | Avg: 14m 34s | Max: 18m 36s | Hits:  52%/5792  
      🟩 MSVC               Pass: 100%/2   | Total: 28m 17s | Avg: 14m 08s | Max: 15m 42s | Hits:  56%/554   
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 47s | Avg:  9m 23s | Max:  9m 36s | Hits:  58%/742   
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 26m 13s | Avg: 13m 06s | Max: 14m 19s | Hits:  70%/1158  
      🟩 rtx2080            Pass: 100%/20  | Total:  4h 50m | Avg: 14m 30s | Max: 18m 36s | Hits:  49%/10564 
    🟩 jobs
      🟩 Build              Pass: 100%/19  | Total:  4h 37m | Avg: 14m 34s | Max: 18m 36s | Hits:  42%/9985  
      🟩 Test               Pass: 100%/3   | Total: 39m 14s | Avg: 13m 04s | Max: 14m 19s | Hits:  99%/1737  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total: 37m 01s | Avg: 12m 20s | Max: 14m 19s | Hits:  60%/1737  
      🟩 90a                Pass: 100%/1   | Total: 12m 24s | Avg: 12m 24s | Max: 12m 24s | Hits:  40%/579   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 47m 14s | Avg: 11m 48s | Max: 14m 08s | Hits:  43%/2108  
      🟩 20                 Pass: 100%/18  | Total:  4h 29m | Avg: 14m 56s | Max: 18m 36s | Hits:  53%/9614  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 24s | Avg: 7m 42s | Max: 12m 57s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 24s | Avg:  7m 42s | Max: 12m 57s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s | Hits:  95%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 57s | Avg: 12m 57s | Max: 12m 57s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 01m | Avg: 1h 01m | Max: 1h 01m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 158)

# Runner
111 linux-amd64-cpu16
15 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
5 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1

@fbusato fbusato merged commit 9908bf5 into NVIDIA:main Mar 5, 2025
169 of 172 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
3.0 Targeted for 3.0 release
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants