tilelang.carver.arch.driver.cuda_driver¶
Classes¶
Functions¶
|
|
|
|
|
|
|
|
|
Get the maximum dynamic shared memory size in bytes, kilobytes, or megabytes. |
|
|
|
Get the number of streaming multiprocessors (SMs) on the CUDA device. |
|
Get the maximum number of 32-bit registers available per block. |
Module Contents¶
- class tilelang.carver.arch.driver.cuda_driver.cudaDeviceAttrNames¶
-
- cudaDevAttrMaxThreadsPerBlock: int = 1¶
- cudaDevAttrMaxRegistersPerBlock: int = 12¶
- cudaDevAttrMaxPersistingL2CacheSize: int = 108¶
- tilelang.carver.arch.driver.cuda_driver.get_cuda_device_properties(device_id=0)¶
- 参数:
device_id (int)
- 返回类型:
torch.cuda._CudaDeviceProperties | None
- tilelang.carver.arch.driver.cuda_driver.get_device_name(device_id=0)¶
- 参数:
device_id (int)
- 返回类型:
str | None
- 参数:
device_id (int)
format (str)
- 返回类型:
int | None
- tilelang.carver.arch.driver.cuda_driver.get_device_attribute(attr, device_id=0)¶
- 参数:
attr (int)
device_id (int)
- 返回类型:
int
Get the maximum dynamic shared memory size in bytes, kilobytes, or megabytes.
- 参数:
device_id (int)
format (str)
- 返回类型:
int | None
- tilelang.carver.arch.driver.cuda_driver.get_persisting_l2_cache_max_size(device_id=0)¶
- 参数:
device_id (int)
- 返回类型:
int
- tilelang.carver.arch.driver.cuda_driver.get_num_sms(device_id=0)¶
Get the number of streaming multiprocessors (SMs) on the CUDA device.
- 参数:
device_id (int, optional) -- The CUDA device ID. Defaults to 0.
- 返回:
The number of SMs on the device.
- 返回类型:
int
- 抛出:
RuntimeError -- If unable to get the device properties.
- tilelang.carver.arch.driver.cuda_driver.get_registers_per_block(device_id=0)¶
Get the maximum number of 32-bit registers available per block.
- 参数:
device_id (int)
- 返回类型:
int