-
Notifications
You must be signed in to change notification settings - Fork 191
Closed
2 / 22 of 2 issues completedLabels
P0High priority - Must do!High priority - Must do!cuda.coreEverything related to the cuda.core moduleEverything related to the cuda.core moduleenhancementAny code-related improvementsAny code-related improvements
Milestone
Description
This issue tracks an internal discussion with QA. This simple snippet shows why using cuda.core
today on Windows might fail, depending on if it's TCC or WDDM mode:
>>> from cuda import cuda, cudart
>>> print(cudart.cudaGetDevice())
(<cudaError_t.cudaSuccess: 0>, 0)
>>> print(cuda.cuDeviceGetMemPool(0))
(<CUresult.CUDA_ERROR_NOT_SUPPORTED: 801>, <CUmemoryPool 0x0>)
cuda.core
currently assumes the stream-ordered memory allocator is available. However, CUDA on Windows is a bit more complicated than on Linux, since there are two operation modes:
- In the WDDM mode (which is the case during
cuda.core
development), things should work just fine. - In the TCC mode (as reported by QA), this is unsupported
We need some treatments to make it usable on TCC.
Sub-issues
Metadata
Metadata
Assignees
Labels
P0High priority - Must do!High priority - Must do!cuda.coreEverything related to the cuda.core moduleEverything related to the cuda.core moduleenhancementAny code-related improvementsAny code-related improvements