[NVVM IR] NVVM IR Integration #907

abhilash1910 · 2025-08-26T13:51:13Z

Description

Abstract : The cuda/bindings backend of Cuda python has NVVM support through libnvvm api . However the frontend of cuda python does not support nvvm ir as input source. Since cuda python allows users to leverage a "pythonic dsl" format for writing the host code (taking care of launch parameters etc), it makes sense to also allow NVVM IR as an alternative input to the already included list of inputs {ptx, c++, lto ir} etc.

Discussion Link: #906

Fix #452

Changes made {to be made} in this PR:

Added cuda core linkage to cuda bindings nvvm counterpart
Cosmetic changes in user interface to use existing nvvm backend of cuda bindings.

Checklist

[ TBD ] New tests needed to be added to cover these changes.
[ TBD ] The documentation needs to be updated with these changes.

cc @leofang

copy-pr-bot · 2025-08-26T13:51:17Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

rwgk

Low-level review: Apart from the bare except, this looks good to me.

I defer to @leofang for the high-level take.

cuda_core/cuda/core/experimental/_utils/cuda_utils.pyx

leofang

Thanks, @abhilash1910, left some quick comments, will circle back later.

cuda_core/cuda/core/experimental/_module.py

cuda_core/cuda/core/experimental/_utils/cuda_utils.pyx

leofang

Thanks a lot @abhilash1910! I have reviewed the PR including the tests.

btw please also fix the linter errors. You can check them locally via pre-commit run -a.

cuda_core/cuda/core/experimental/_utils/cuda_utils.pyx

cuda_core/cuda/core/experimental/_module.py

cuda_core/cuda/core/experimental/_program.py

cuda_core/tests/test_program.py

leofang

Thanks @abhilash1910! Looks very good! A few minor comments for completeness. Let me trigger the CI in the meanwhile.

cuda_core/cuda/core/experimental/_program.py

abhilash1910 · 2025-09-17T08:38:46Z

pre-commit.ci autofix

cuda_core/cuda/core/experimental/_program.py

cuda_core/tests/test_program.py

cuda_core/cuda/core/experimental/_program.py

leofang · 2025-09-17T21:30:21Z

/ok to test e5b5ea4

rwgk

I only looked at the high-level structure; based on that, and given that we want to do #980: looks good to me.

leofang · 2025-09-17T22:38:24Z

All CI is green except for H100, which is known to have an unusual long queue currently (see nv-gha-runners discussion). I am impatient and let me admin-merge before calling a day.

leofang · 2025-09-17T22:41:20Z

Thanks a lot, @abhilash1910, and also @gmarkall @kkraus14 @rwgk !

abhilash1910 · 2025-09-18T03:31:19Z

Thanks a lot @leofang , and @gmarkall @rwgk @kkraus14 for all the reviews. Will follow-up on #981.

joker-eph · 2025-09-19T11:05:41Z

Using textual LLVM IR as an input to libNVVM is documented as deprecated, so I'm quite concerned that cuda-python is adding a new usage of this.

Another issue is that LLVM textual assembly format is more unstable than the bitcode and has no backward compatibility guarantee (contrario to the LLVM bitcode), which also likely why this was all deprecated in libNVVM.
I would think that this would be documented as, and restricted, to LLVM bitcode input only here.

Even with LLVM bitcode, there is a quite large issue of underlying compatibility with the libNVVM version: contrary to the analogy with C++ and NVRTC or PTX: the LLVM IR isn't versioned in the same way across cuda versions.

leofang · 2025-09-25T02:29:42Z

Hi @joker-eph:

Using textual LLVM IR as an input to libNVVM is documented as deprecated, so I'm quite concerned that cuda-python is adding a new usage of this.

libNVVM has been stating that the text IR is deprecated for several years, but we have not received any notice that it'd be actually removed
if it is removed now, numba-cuda will break right away
I think both bc and text formats are already supported by Program through this PR, because the underlying binding does not care which format that a sequence of Python bytes contains. I did tell Abhilash offline that it’s better to get bc tested as well. I think this ball was dropped along the way.

I would think that this would be documented as, and restricted, to LLVM bitcode input only here.

I am unable to parse this sentence 😛 Could you elaborate?

Even with LLVM bitcode, there is a quite large issue of underlying compatibility with the libNVVM version: contrary to the analogy with C++ and NVRTC or PTX: the LLVM IR isn't versioned in the same way across cuda versions.

This is understood. See how we generate compatible IR in the test and also this thread.

joker-eph · 2025-09-25T10:44:40Z

libNVVM has been stating that the text IR is deprecated for several years, but we have not received any notice that it'd be actually removed

I know that, I'm not sure how that addresses my comment though.

if it is removed now, numba-cuda will break right away

Why is numba-cuda using textual IR instead of encoding to bitcode?

I am unable to parse this sentence 😛 Could you elaborate?

Two parts to my sentence:

The documentation for this API should be "This is expected to use with bitcode"
The code could enforce that we only use bitcode.

This is understood. See how we generate compatible IR in the test and also #907 (comment).

This is understood by you maybe, what about the user that gets exposed to some unsafe APIs and that we may break in very subtle ways with future updates?
My concerns is that there is huge footgun hidden in there, and that it isn't a good API to add at all.

leofang · 2025-10-02T00:57:31Z

if it is removed now, numba-cuda will break right away

Why is numba-cuda using textual IR instead of encoding to bitcode?

The short answer is that it needs to patch LLVM text IR. It'd be better if we move this conversation to either the NVIDIA/numba-cuda repo, or the internal numba dev channel, happy to continue elsewhere. It is irrelevant here.

The documentation for this API should be "This is expected to use with bitcode"

The code could enforce that we only use bitcode.

Ah ok, thanks. 1 can be added I think, with a note that the text IR is deprecated upstream. 2 is not possible as already explained earlier (we can't tell if the user provides text or bitcode IR, without leaking the magic header in the public).

This is understood by you maybe, what about the user that gets exposed to some unsafe APIs and that we may break in very subtle ways with future updates? My concerns is that there is huge footgun hidden in there, and that it isn't a good API to add at all.

Our mission is to offer pythonic access to all CUDA components such that whatever users can do in C/C++, then can also do so without leaving Python. Unless I misunderstood what you meant, to me it sounds like the concern is "we should not make it easy to access libNVVM in Python," if so I'd wholeheartedly disagree 🙂

joker-eph · 2025-10-02T09:40:45Z

with a note that the text IR is deprecated upstream.

Why "upstream"? In general I use "upstream" to refer to LLVM codebase, but are you referring to libNVVM? This is weird to me to refer to the underlying library exposed here as "upstream": it the same product we ship and code-python should just expose it to python IMO.

(we can't tell if the user provides text or bitcode IR, without leaking the magic header in the public).

I don't quite understand what you mean by "leaking the magic header"? Checking if the input is bitcode seem like a trivial check to me: https://github.com/llvm/llvm-project/blob/04c01ff144a172230c053d73eb15831a4120db81/llvm/include/llvm/Bitcode/BitcodeReader.h#L244-L274

Our mission is to offer pythonic access to all CUDA components such that whatever users can do in C/C++, then can also do so without leaving Python. Unless I misunderstood what you meant, to me it sounds like the concern is "we should not make it easy to access libNVVM in Python," if so I'd wholeheartedly disagree 🙂

You are clearly misrepresenting what I wrote: there is a difference between "exposing python access to all CUDA components" and providing direct footguns to users. This is an important part of API design is to understand these footguns and think the API to avoid them. Just the fact that you use "pythonic" shows that you're already ready to deviate from just directly binding and exposing a "raw" direct access to anything: this should be about "feature" instead.
More importantly you're again putting aside the wrinkle that this is deprecated (and this was introduced at a time where libNVVM has a single version of LLVM as input, the fact that it is now a moving target is entirely new).

nvvm ir integration

dc3222d

abhilash1910 marked this pull request as draft August 26, 2025 13:51

rwgk reviewed Aug 26, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_utils/cuda_utils.pyx Outdated Show resolved Hide resolved

leofang mentioned this pull request Aug 26, 2025

Support NVVM IRs as input to Program #452

Closed

leofang assigned abhilash1910 Aug 26, 2025

leofang self-requested a review August 26, 2025 17:48

leofang added P0 High priority - Must do! feature New feature or request cuda.core Everything related to the cuda.core module labels Aug 26, 2025

leofang added this to the cuda.core beta 7 milestone Aug 26, 2025

add test

028a294

leofang requested changes Aug 27, 2025

View reviewed changes

abhilash1910 added 3 commits September 1, 2025 11:49

remove nvvm error handling from utils

a418f4b

use version dependent nvvm inclusion

92af3dd

fix nvvm compilation flow and test

bdd1671

abhilash1910 marked this pull request as ready for review September 1, 2025 17:34

Merge branch 'main' into nvvm

f4151bb

leofang requested changes Sep 2, 2025

View reviewed changes

abhilash1910 and others added 10 commits September 3, 2025 17:38

refactor

dc9a4e3

fix unwanted rebase

22d18b9

fix core linter errors

0f7fda4

refactor tests

9ed8051

refactor

bccda47

refactor

64436c1

ruff format

58317d0

ruff format

caf4f22

revert changes to cuda_utils

88237bc

new line

5e2e137

leofang reviewed Sep 5, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_program.py Outdated Show resolved Hide resolved

cuda_core/cuda/core/experimental/_program.py Outdated Show resolved Hide resolved

cuda_core/cuda/core/experimental/_program.py Show resolved Hide resolved

refresh

94c2e56

pre-commit-ci bot and others added 2 commits September 17, 2025 08:39

[pre-commit.ci] auto code formatting

34bf2cc

user major minor

6b130bb

kkraus14 reviewed Sep 17, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_program.py Show resolved Hide resolved

cuda_core/cuda/core/experimental/_program.py Outdated Show resolved Hide resolved

leofang requested changes Sep 17, 2025

View reviewed changes

cuda_core/tests/test_program.py Outdated Show resolved Hide resolved

cuda_core/tests/test_program.py Show resolved Hide resolved

leofang reviewed Sep 17, 2025

View reviewed changes

cuda_core/cuda/core/experimental/_program.py Outdated Show resolved Hide resolved

leofang and others added 6 commits September 17, 2025 15:58

fix test

d96c848

Merge branch 'main' into nvvm

fcd7c0c

fix IR - again

8331ecf

fix nvvm option handling

2fa944e

remove redundant IR & fix linter

4d32276

avoid extra copy + ensure compiled objcode loadable

e5b5ea4

rwgk mentioned this pull request Sep 17, 2025

[ENH]: Robust imports of optional dependencies (nvvm, nvJitLink) #980

Open

1 task

leofang approved these changes Sep 17, 2025

View reviewed changes

leofang enabled auto-merge (squash) September 17, 2025 21:54

rwgk approved these changes Sep 17, 2025

View reviewed changes

leofang disabled auto-merge September 17, 2025 22:38

leofang merged commit fcfeba0 into NVIDIA:main Sep 17, 2025
48 checks passed

leofang mentioned this pull request Sep 17, 2025

NVVM support - follow-up #981

Open

brandon-b-miller mentioned this pull request Oct 10, 2025

[MNT] Drop NUMBA_CUDA_USE_NVIDIA_BINDING; always use cuda.core and cuda.bindings as fallback NVIDIA/numba-cuda#479

Merged

[NVVM IR] NVVM IR Integration #907

[NVVM IR] NVVM IR Integration #907

Uh oh!

Conversation

abhilash1910 commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

copy-pr-bot bot commented Aug 26, 2025

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

leofang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leofang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leofang left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abhilash1910 commented Sep 17, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

leofang commented Sep 17, 2025

Uh oh!

rwgk left a comment

Choose a reason for hiding this comment

Uh oh!

leofang commented Sep 17, 2025

Uh oh!

Uh oh!

leofang commented Sep 17, 2025

Uh oh!

abhilash1910 commented Sep 18, 2025

Uh oh!

joker-eph commented Sep 19, 2025

Uh oh!

leofang commented Sep 25, 2025

Uh oh!

joker-eph commented Sep 25, 2025

Uh oh!

leofang commented Oct 2, 2025

Uh oh!

joker-eph commented Oct 2, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

abhilash1910 commented Aug 26, 2025 •

edited

Loading