tilelang.layout.gemm_sp¶
Wrapping Layouts.
Functions¶
|
|
|
Make a layout of metadata that is compatible with cutlass sm90 compression kernel. Note that layout atom is the same for smem and gmem. |
|
Make a layout of metadata that is compatible with cutlass sm8x compression kernel. Note that layout atom is the same for smem and gmem. |
|
Module Contents¶
- tilelang.layout.gemm_sp.decompose_col_major(index_1d, basis)¶
- Parameters:
index_1d (int)
basis (list[int])
- Return type:
list[int]
- tilelang.layout.gemm_sp.make_cutlass_metadata_layout_sm90(buffer, mma_dtype, block_k)¶
Make a layout of metadata that is compatible with cutlass sm90 compression kernel. Note that layout atom is the same for smem and gmem.
- Parameters:
buffer (tvm.tir.Buffer) – metadata buffer shape, for sm90 it should be a 8-bit type
mma_dtype (str) – dtype of mma operand A, different dtypes result in different layout atom
block_k (int) – tiling size along K dim, different block_ks results in different layout atom.
- tilelang.layout.gemm_sp.make_cutlass_metadata_layout_sm8x(buffer, mma_dtype)¶
- Make a layout of metadata that is compatible with cutlass sm8x compression kernel. Note that layout atom is the same for smem and gmem.
- Parameters:
buffer (tvm.tir.Buffer) – metadata buffer shape, for sm80 it should be a 16bit type
mma_dtype (str)
- tilelang.layout.gemm_sp.make_cutlass_metadata_layout(buffer, mma_dtype='float16', arch=None, **extra_args)¶
- Parameters:
buffer (tvm.tir.Buffer)
mma_dtype (str)
arch (str | None)