bioconductor-sitepath
Phylogeny-based sequence clustering with site polymorphism
Phylogeny-based sequence clustering with site polymorphism
To install this package, run one of the following:
Using site polymorphism is one of the ways to cluster DNA/protein sequences but it is possible for the sequences with the same polymorphism on a single site to be genetically distant. This package is aimed at clustering sequences using site polymorphism and their corresponding phylogenetic trees. By considering their location on the tree, only the structurally adjacent sequences will be clustered. However, the adjacent sequences may not necessarily have the same polymorphism. So a branch-and-bound like algorithm is used to minimize the entropy representing the purity of site polymorphism of each cluster.
Summary
Phylogeny-based sequence clustering with site polymorphism
Last Updated
Dec 14, 2024 at 16:46
License
MIT + file LICENSE
Total Downloads
32.9K
Supported Platforms