ScaLAPACK is a library of high-performance linear algebra routines for parallel distributed memory machines