About Anaconda Help Download Anaconda

conda-forge / packages / cosma-scalapack 2.8.4

Communication-Optimal Matrix-Multiplication (COSMA)

copied from cf-post-staging / cosma-scalapack

Installers

  • linux-64 v2.8.4
  • osx-64 v2.8.4
  • linux-aarch64 v2.8.4
  • linux-ppc64le v2.8.4
  • osx-arm64 v2.8.4

conda install

To install this package run one of the following:
conda install conda-forge::cosma-scalapack

Description

COSMA is a parallel, high-performance, GPU-accelerated, matrix-matrix multiplication algorithm that is communication-optimal for all combinations of matrix dimensions, number of processors and memory sizes, without the need for any parameter tuning. The key idea behind COSMA is to first derive a tight optimal sequential schedule and only then parallelize it, preserving I/O optimality between processes. This stands in contrast with the 2D and 3D algorithms, which fix process domain decomposition upfront and then map it to the matrix dimensions, which may result in asymptotically more communication. The final design of COSMA facilitates the overlap of computation and communication, ensuring speedups and applicability of modern mechanisms such as RDMA. COSMA allows to not utilize some processors in order to optimize the processor grid, which reduces the communication volume even further and increases the computation volume per processor.


© 2026 Anaconda, Inc. All Rights Reserved. (v4.2.20) Legal | Privacy Policy