Programming Guides Overview¶
This section provides a practical guide to writing high‑performance kernels with Tile Language (tile‑lang). It mirrors the structure of a similar guide in another project and adapts it to tile‑lang concepts and APIs.
Audience: Developers implementing custom GPU/CPU kernels with tile‑lang
Prereqs: Basic Python, NumPy/Tensor concepts, and familiarity with GPU programming notions
Scope: Language basics, control flow, instructions, autotuning, and type system
What You’ll Learn¶
How to structure kernels with TileLang’s core DSL constructs
How to move data across global/shared/fragment and pipeline compute
How to apply autotuning to tile sizes and schedules
How to specify and work with dtypes in kernels
Suggested Reading Order¶
Language Basics
Control Flow
Instructions
Autotuning
Type System