Flash Attention: Fast and Memory-Efficient Exact Attention
conda install conda-forge::flash-attn-layer-norm