Sound Colless-like balance indices for multifurcating trees
Abstract
The Colless index is one of the most popular and natural balance indices for bifurcating phylogenetic trees, but it makes no sense for multifurcating trees. In this paper we propose a family of Colless-like balance indices CD,f, which depend on a dissimilarity D and a function f:N R≥ 0, that generalize the Colless index to multifurcating phylogenetic trees. We provide two functions f such that the most balanced phylogenetic trees according to the corresponding indices CD,f are exactly the fully symmetric ones. Next, for each one of these two functions f and for three popular dissimilarities D (the variance, the standard deviation, and the mean deviation from the median), we determine the range of values of CD,f on the sets of phylogenetic trees with a given number n of leaves. We end the paper by assessing the performance of one of these indices on TreeBASE and using it to show that the trees in this database do not seem to follow either the uniform model for multifurcating trees or the α-γ-model, for any values of α and γ.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.