cutlass

Anaconda Verified

CUDA Templates for Linear Algebra Subroutines

Versions

Installation

To install this package, run one of the following:

Conda

$conda install anaconda::cutlass

Usage Tracking

Version

4 / 8 versions selected

Downloads (Last 6 months): 0

Description

CUTLASS is a collection of abstractions for implementing high-performance matrix-matrix multiplication (GEMM) and related computations at all levels and scales within CUDA. It incorporates strategies for hierarchical decomposition and data movement. CUTLASS decomposes these "moving parts" into reusable, modular software components and abstractions.

About

Summary

CUDA Templates for Linear Algebra Subroutines

Last Updated

May 29, 2026 at 15:41

License

BSD-3-Clause

Supported Platforms

linux-aarch64

linux-64

win-64

Home

https://github.com/NVIDIA/cutlass

GitHub Repository

https://github.com/NVIDIA/cutlass

Documentation

https://docs.nvidia.com/cutlass