Clumps: A Sequential Clustering Approach to Partitioning Sets of Phylogenies with Non-Identical Leaf Sets
Clumps: A Sequential Clustering Approach to Partitioning Sets of Phylogenies with Non-Identical Leaf Sets
Serra Silva, A.; Wilkinson, M.
AbstractPost-processing of sets of inferred phylogenetic trees often focuses on the canonical case of consensus of (multi)sets of phylogenetic trees on the same leaf set. However, with growing numbers of phylogenomic studies resorting to summary and/or supertree methods to obtain a phylogeny, the amount of (multi)sets of trees with non-identical leaf sets also increases. In an attempt identify trees with non-identical leaf sets that are topologically similar, we define a new sequential subsetting approach, \"clumps of trees\", based on the distance between any tree in a set and the set\'s supertree. While clumps were developed with trees with non-identical leaf sets in mind, they can be applied to (multi)sets of trees with identical taxonomic sampling. Unlike islands of trees, clumps will not always be mutually exclusive, thus making them more similar to families of trees.