cuTile is a programming model for writing parallel kernels for NVIDIA GPUs
copied from cf-post-staging / cutile-pythoncuTile Python is a programming language for NVIDIA GPUs that generates kernels based on Tile IR. It requires NVIDIA Driver r580+ and CUDA Toolkit 13.1+.