Probabilities of unranked and ranked anomaly zones under birth-death models
Abstract
A labeled gene tree topology that is more probable than the labeled gene tree topology matching a species tree is called anomalous. Species trees that can generate such anomalous gene trees are said to be in the anomaly zone. Here, probabilities of unranked and ranked gene tree topologies under the multispecies coalescent are considered. A ranked tree depicts not only the topological relationship among gene lineages, as an unranked tree does, but also the sequence in which the lineages coalesce. In this article, we study how the parameters of a species tree simulated under a constant rate birth-death process can affect the probability that the species tree lies in the anomaly zone. We find that with more than five taxa, it is possible for species trees have both AGTs and ARGTs. The probability of being in either type of anomaly zones increases with more taxa. The probability of AGTs also increases with higher speciation rates. We observe that the probabilities of unranked anomaly zones are higher and grow much faster than those of ranked anomaly zones as the speciation rate increases. Our simulation shows that the most probable ranked gene tree is likely to have the same unranked topology as the species tree. We design the software PRANC which computes probabilities of ranked gene tree topologies given a species tree under the coalescent model.
Turn this paper into a lesson
ArcXiv compiles a structured reading guide from this paper's metadata: plain-English importance, contributions, prerequisite concepts, which sections to read first, flashcards, and a quiz. Grounded in the abstract, never invented.