Phase Transition in Distance-Based Phylogeny Reconstruction

نویسنده

  • Sébastien Roch
چکیده

We introduce a new distance-based phylogeny reconstruction technique which provably achieves, at sufficiently short branch lengths, a logarithmic sequence-length requirement—improving significantly over previous polynomial bounds for distance-based methods and matching existing results for general methods. The technique is based on an averaging procedure that implicitly reconstructs ancestral sequences. In the same token, we extend previous results on phase transitions in phylogeny reconstruction to general time-reversible models. More precisely, we show that in the so-called Kesten-Stigum zone (roughly, a region of the parameter space where ancestral sequences are well approximated by “linear combinations” of the observed sequences) sequences of length O(log n) suffice for reconstruction when branch lengths are discretized. Here n is the number of extant species. Our results challenge, to some extent, the conventional wisdom that estimates of evolutionary distances alone carry significantly less information about phylogenies than full sequence datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On simulated annealing phase transitions in phylogeny reconstruction

Phylogeny reconstruction with global criteria is NP-complete or NP-hard, hence in general requires a heuristic search. We investigate the powerful, physically inspired, general-purpose heuristic simulated annealing, applied to phylogeny reconstruction. Simulated annealing mimics the physical process of annealing, where a liquid is gently cooled to form a crystal. During the search, periods of e...

متن کامل

A stochastic local search algorithm for distance-based phylogeny reconstruction.

In many interesting cases, the reconstruction of a correct phylogeny is blurred by high mutation rates and/or horizontal transfer events. As a consequence, a divergence arises between the true evolutionary distances and the differences between pairs of taxa as inferred from available data, making the phylogenetic reconstruction a challenging problem. Mathematically, this divergence translates i...

متن کامل

Improved Bayesian Phylogenetic Inference in a Statistical Alignment Framework Advanced Software Design for StatAlign

Long-term trends in computational phylogenetics show a steady transition of focus from traditional tree reconstruction methods towards Bayesian approaches. The early distance based techniques such as UPGMA and Neighbour Joining are today considered less accurate primarily due to the loss of information when condensing sequence data into a distance matrix. Maximum parsimony is fast but suffers f...

متن کامل

Genes order and phylogenetic reconstruction: application to γ-Proteobacteria

We study the problem of phylogenetic reconstruction based on gene order for whole genomes. We define three genomic distances between whole genomes represented by signed sequences, based on the matching of similar segments of genes and on the notions of breakpoints, conserved intervals and common intervals. We use these distances and distance based phylogenetic reconstruction methods to compute ...

متن کامل

Distance-Based Phylogeny Reconstruction: Safety and Edge Radius

HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1108.5781  شماره 

صفحات  -

تاریخ انتشار 2009