Point estimates in phylogenetic reconstructions
نویسندگان
چکیده
MOTIVATION The construction of statistics for summarizing posterior samples returned by a Bayesian phylogenetic study has so far been hindered by the poor geometric insights available into the space of phylogenetic trees, and ad hoc methods such as the derivation of a consensus tree makeup for the ill-definition of the usual concepts of posterior mean, while bootstrap methods mitigate the absence of a sound concept of variance. Yielding satisfactory results with sufficiently concentrated posterior distributions, such methods fall short of providing a faithful summary of posterior distributions if the data do not offer compelling evidence for a single topology. RESULTS Building upon previous work of Billera et al., summary statistics such as sample mean, median and variance are defined as the geometric median, Fréchet mean and variance, respectively. Their computation is enabled by recently published works, and embeds an algorithm for computing shortest paths in the space of trees. Studying the phylogeny of a set of plants, where several tree topologies occur in the posterior sample, the posterior mean balances correctly the contributions from the different topologies, where a consensus tree would be biased. Comparisons of the posterior mean, median and consensus trees with the ground truth using simulated data also reveals the benefits of a sound averaging method when reconstructing phylogenetic trees. AVAILABILITY AND IMPLEMENTATION We provide two independent implementations of the algorithm for computing Fréchet means, geometric medians and variances in the space of phylogenetic trees. TFBayes: https://github.com/pbenner/tfbayes, TrAP: https://github.com/bacak/TrAP.
منابع مشابه
Do complex population histories drive higher estimates of substitution rate in phylogenetic reconstructions?
Our curiosity about biodiversity compels us to reconstruct the evolutionary past of species. Molecular evolutionary theory now allows parameterization of mathematically sophisticated and detailed models of DNA evolution, which have resulted in a wealth of phylogenetic histories. But reconstructing how species and population histories have played out is critically dependent on the assumptions we...
متن کاملProblems and Cautions With Sequence Mismatch Analysis and Bayesian Skyline Plots to Infer Historical Demography.
Sequence mismatch analysis (MMA) and Bayesian skyline plots (BSP) are commonly used to reconstruct historical demography. A survey of 173 research articles (2009-2014), which included estimates of historical population sizes from mtDNA or cpDNA, shows a widespread genetic signature of demographic or spatial population expansion in species of all major taxonomic groups. Associating these expansi...
متن کاملCounting ancestral reconstructions in a fixed phylogeny
We give formulas for calculating in polynomial time the number of ancestral reconstructions for a tree with binary leafand root labels for each number of 0 → 1 and 1 → 0 arcs. For trees of fixed degree, the corresponding numbers of 0 → 0 and 1 → 1 arcs can be deduced. We calculate intervals for the relative cost of 0 → 1 and 1 → 0 transitions over which the same labelings remain the cheapest.
متن کاملWhich came first: The lizard or the egg? Robustness in phylogenetic reconstruction of ancestral states.
Changes in parity mode between egg-laying (oviparity) and live-bearing (viviparity) have occurred repeatedly throughout vertebrate evolution. Oviparity is the ancestral amniote state, and viviparity has evolved many times independently within amniotes (especially in lizards and snakes), with possibly a few reversions to oviparity. In amniotes, the shelled egg is considered a complex structure t...
متن کاملTimetrees: beyond cladograms, phenograms, and phylograms
For several historical reasons discussed herein, until recently the absolute temporal dimension of many phylogenetic trees has been relatively ignored whereas the branching (cladistic) aspect typically has been the focus of most phylogeny-reconstruction efforts. This unfortunate neglect of “timetrees” is now being remedied, as this book will attest. Many scientifi c benefi ts can emerge from su...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 30 شماره
صفحات -
تاریخ انتشار 2014