Success of Phylogenetic Methods in the Four - Taxon Case
نویسندگان
چکیده
?The success of 16 methods of phylogenetic inference was examined using consis? tency and simulation analysis. Success?the frequency with which a tree-making method cor? rectly identified the true phylogeny?was examined for an unrooted four-taxon tree. In this study, tree-making methods were examined under a large number of branch-length conditions and under three models of sequence evolution. The results are plotted to facilitate comparisons among the methods. The consistency analysis indicated which methods converge on the correct tree given infinite sample size. General parsimony, transversion parsimony, and weighted par? simony are inconsistent over portions of the graph space examined, although the area of incon? sistency varied. Lake's method of invariants consistently estimated phylogeny over all of the graph space when the model of sequence evolution matched the assumptions of the invariants method. However, when one of the assumptions of the invariants method was violated, Lake's method of invariants became inconsistent over a large portion of the graph space. In general, the distance methods (neighbor joining, weighted least squares, and unweighted least squares) consistently estimated phylogeny over all of the graph space examined when the assumptions of the distance correction matched the model of evolution used to generate the model trees. When the assumptions of the distance methods were violated, the methods became inconsistent over portions of the graph space. UPGMA was inconsistent over a large area of the graph space, no matter which distance was used. The simulation analysis showed how tree-making methods perform given limited numbers of character data. In some instances, the simulation results differed quantitatively from the consistency analysis. The consistency analysis indicated that Lake's method of invariants was consistent over all of the graph space under some conditions, whereas the simulation analysis showed that Lake's method of invariants performs poorly over most of the graph space for up to 500 variable characters. Parsimony, neighbor-joining, and the least-squares methods performed well under conditions of limited amount of character change and branch-length variation. By weighting the more slowly evolving characters or using dis? tances that correct for multiple substitution events, the area in which tree-making methods are misleading can be reduced. Good performance at high rates of change was obtained only by giving increased weight to slowly evolving characters (e.g., transversion parsimony, weighted parsimony). UPGMA performed well only when branch lengths were close in length. [Phylogeny estimation; simulation; parsimony; Lake's invariants; UPGMA; neighbor joining; weighted least squares; unweighted least squares; tree space.] Molecular systematists may choose among over 100 methods of phylogenetic estimation (Swofford and Olsen, 1990; Hil? lis et al., 1993). One of the goals of system? atics research is to winnow this pool of methods, separating those that perform well from those that perform poorly. This testing procedure forms the basis for im? proving the trees that systematists pro? duce; poor methods are discarded during this procedure, and better methods of phy? logeny estimation can be incrementally improved in subsequent cycles. In this pa? per, we compare the effectiveness of meth? ods of phylogenetic inference for molec? ular data under a wide variety of conditions and identify those conditions under which particular methods perform well or poorly. Computer simulations of the efficiency of tree-making methods have become more sophisticated over the past two decades. In general, more recent computer simulations have examined a larger number of meth? ods of phylogenetic inference under a larg? er number of evolutionary models of se? quence evolution (Peacock and Boulter, 1975; Blanken et al., 1982; Tateno et al., 1982; Saitou, 1988; Sourdis and Nei, 1988; Jin and Nei, 1990; Nei, 1991). Previous computer simulations that have examined the performance of phylogenetic methods, however, have explored only a few model phylogenies and branch-length variations. This limitation in those studies is impor? tant because the relative performance of methods depends on the conditions under
منابع مشابه
Molecular phylogeny of Ephippidae (Actinopteri: Acanthuriformes)
The family Ephippidae consists of eight genera and 15 species. The genus Platax with five species distributed throughout Indo-Pacific, seemingly has more diversity than other genera of the family. Phylogenetic relationships among species and genera of the family, however, are not fully resolved. 48 sequences of eight species (four genera) of batfishes are available in the GenBank and have been ...
متن کاملPhylogenetic analysis and intraspecific variation: performance of parsimony, likelihood, and distance methods.
Intraspecific variation is abundant in all types of systematic characters but is rarely addressed in simulation studies of phylogenetic method performance. We compared the accuracy of 15 phylogenetic methods using simulations to (1) determine the most accurate method(s) for analyzing polymorphic data (under simplified conditions) and (2) test if generalizations about the performance of phylogen...
متن کاملPhylogeny of Astragalus sect. Alopecuroidei based on the combined nrDNA ITS and morphology
A phylogenetic analysis of Astragalus sect. Alopecuroidei based on nrDNA ITS as well as morphological character data is presented. A total of fifty informative morphological characters were analyzed to reconstruct phylogenetic relationships for 23 taxa of the sect. Alopecuroidei and two species of sect. Laxiflori plus A. sieversianus and A. caryolobus as outgroups. The present analysis revealed...
متن کاملDistinguishing Convergence on Phylogenetic Networks
In phylogenetics, the evolutionary history of a group of taxa, for example, groups of species, genera or subspecies, can be modelled using a phylogenetic tree. Alternatively, we can model evolutionary history with a phylogenetic network. On phylogenetic networks, edges that have previously evolved independently from a common ancestor may subsequently converge for a period of time. Examples of p...
متن کاملComparing bootstrap and posterior probability values in the four-taxon case.
Assessment of the reliability of a given phylogenetic hypothesis is an important step in phylogenetic analysis. Historically, the nonparametric bootstrap procedure has been the most frequently used method for assessing the support for specific phylogenetic relationships. The recent employment of Bayesian methods for phylogenetic inference problems has resulted in clade support being expressed i...
متن کاملPolyhedral geometry of phylogenetic rogue taxa.
It is well known among phylogeneticists that adding an extra taxon (e.g. species) to a data set can alter the structure of the optimal phylogenetic tree in surprising ways. However, little is known about this "rogue taxon" effect. In this paper we characterize the behavior of balanced minimum evolution (BME) phylogenetics on data sets of this type using tools from polyhedral geometry. First we ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010