Fix non-release kernel builds via CudaBuilder #322

LegNeato · 2025-11-26T04:17:44Z

First, we weren't handling all the types.
After fixing that, it exposed a libnvvm crash.
Also saw a type issue in one of the warp APIs used by vecadd so fixed that.

Fixes #320

First, we weren't handling all the types. After fixing that, it exposed a `libnvvm` crash. Also saw a type issue in one of the warp APIs used by vecadd so fixed that. Fixes Rust-GPU#320

nnethercote

One problem, one question. It would be nice to have some kind of test added to avoid regressing, too, though I'm not sure what that would look like.

nnethercote · 2025-11-30T22:12:19Z

crates/cuda_std/src/warp.rs

    extern "C" {
        #[link_name = "llvm.nvvm.match.any.sync.i64"]
-        fn __nvvm_warp_match_any_64(mask: u32, value: u64) -> u32;
+        fn __nvvm_warp_match_any_64(mask: u32, value: u64) -> u64;


This looks wrong. https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html has this:

declare i32 @llvm.nvvm.match.any.sync.i32(i32 %membermask, i32 %value) declare i32 @llvm.nvvm.match.any.sync.i64(i32 %membermask, i64 %value) declare {i32, i1} @llvm.nvvm.match.all.sync.i32(i32 %membermask, i32 %value) declare {i32, i1} @llvm.nvvm.match.all.sync.i64(i32 %membermask, i64 %value)

Not sure about the signed/unsigned mismatches, but the return value is definitely 32-bits.

Aside: The match_all_{32,64} functions below don't have link_name attributes the way the match_any_{32,64} functions do. Not sure if this is valid. I suspect these functions aren't tested at all!

Anyway, I think this change should be reverted.

Oh, hm. Ok, will revert.

nnethercote · 2025-11-30T22:20:36Z

crates/rustc_codegen_nvvm/src/ty.rs

+            TypeKind::Vector | TypeKind::ScalableVector => {
+                // Recurse on element type for vector floats
+                self.float_width(self.element_type(ty))
+            }


Are all of Half/BFloat/Vector/ScalableVector needed to fix the issue? I see that rustc_codegen_llvm only has Half. Seems wise to only add code that's necessary for the fix (and thus has some level of testing).

I think we need at least BFloat as well but I'll double check.

LegNeato · 2025-12-01T00:15:32Z

What do you think about changing the default here to be keyed off of #[cfg(debug_assertions)]? then we can just test the examples in debug and release modes...though I guess it won't test the manual override / host and device split case so we wouldn't be any better off test matrix-wise.

nnethercote · 2025-12-01T02:11:56Z

What do you think about changing the default here to be keyed off of #[cfg(debug_assertions)]?

sounds ok

Fix non-release kernel builds via CudaBuilder

ca2eed6

First, we weren't handling all the types. After fixing that, it exposed a `libnvvm` crash. Also saw a type issue in one of the warp APIs used by vecadd so fixed that. Fixes Rust-GPU#320

LegNeato requested a review from nnethercote November 29, 2025 03:27

nnethercote requested changes Nov 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix non-release kernel builds via CudaBuilder #322

Fix non-release kernel builds via CudaBuilder #322

Uh oh!

LegNeato commented Nov 26, 2025

Uh oh!

nnethercote left a comment

Uh oh!

nnethercote Nov 30, 2025

Uh oh!

LegNeato Dec 1, 2025

Uh oh!

nnethercote Nov 30, 2025

Uh oh!

LegNeato Dec 1, 2025

Uh oh!

LegNeato commented Dec 1, 2025

Uh oh!

nnethercote commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix non-release kernel builds via CudaBuilder #322

Are you sure you want to change the base?

Fix non-release kernel builds via CudaBuilder #322

Uh oh!

Conversation

LegNeato commented Nov 26, 2025

Uh oh!

nnethercote left a comment

Choose a reason for hiding this comment

Uh oh!

nnethercote Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

LegNeato Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

nnethercote Nov 30, 2025

Choose a reason for hiding this comment

Uh oh!

LegNeato Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

LegNeato commented Dec 1, 2025

Uh oh!

nnethercote commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants