MOSAIC: segmenting multiple aligned DNA sequences
نویسندگان
چکیده
منابع مشابه
MOSAIC: segmenting multiple aligned DNA sequences
UNLABELLED MOSAIC is a set of tools for the segmentation of multiple aligned DNA sequences into homogeneous zones. The segmentation is based on the distribution of mutational events along the alignment. As an example, the analysis of one repeated sequence belonging to the subtelomeric regions of the yeast genome is presented. AVAILABILITY Free access from ftp://ftp.biomath.jussieu.fr/pub/pape...
متن کاملNew stopping criteria for segmenting DNA sequences.
We propose a solution on the stopping criterion in segmenting inhomogeneous DNA sequences with complex statistical patterns. This new stopping criterion is based on Bayesian information criterion in the model selection framework. When this criterion is applied to telomere of S. cerevisiae and the complete sequence of E. coli, borders of biologically meaningful units were identified, and a more ...
متن کاملSimplifying the mosaic description of DNA sequences.
By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct domains through a standard recursive segmentation procedure. Each domain, while significantly different from its neighbors, may, however, share compositional similarity with one or more distant (non-neighboring) domains. We thus obtain a coarse-grained description of the given DNA string in terms of...
متن کاملRevTrans: multiple alignment of coding DNA from aligned amino acid sequences
The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit from the information that is implicit in empirical substitution matrices such as BLOSUM-62. Taken t...
متن کاملMaximum Entropy Weighting of Aligned Sequences of Proteins or DNA
In a family of proteins or other biological sequences like DNA the various subfamilies are often very unevenly represented. For this reason a scheme for assigning weights to each sequence can greatly improve performance at tasks such as database searching with profiles or other consensus models based on multiple alignments. A new weighting scheme for this type of database search is proposed. In...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2001
ISSN: 1367-4803,1460-2059
DOI: 10.1093/bioinformatics/17.2.196