Fast and accurate set similarity estimation via containment min hash (for genomic datasets).
conda install bioconda::cmash