CD-HIT is a program for clustering DNA/protein sequence database at high identity with tolerance.
conda install agbiome::cdhit