CD-HIT is a program for clustering DNA/protein sequence database at high identity with tolerance.
anaconda login
conda install ostrokach::cd-hit