Sublinear growth of information in DNA sequences
نویسندگان
چکیده
منابع مشابه
Sublinear growth of information in DNA sequences.
We introduce a novel method to analyse complete genomes and recognise some distinctive features by means of an adaptive compression algorithm, which is not DNA-oriented, based on the Lempel-Ziv scheme. We study the Information Content as a function of the number of symbols encoded by the algorithm and we analyse the dictionary created by the algorithm. Preliminary results are shown concerning r...
متن کاملInformation Analysis of DNA Sequences
The problem of differentiating the informational content of coding (exons) and noncoding (introns) regions of a DNA sequence is one of the central problems of genomics. The introns are estimated to be nearly 95% of the DNA and since they do not seem to participate in the process of transcription of amino-acids, they have been termed “junk DNA.” Although it is believed that the non-coding region...
متن کاملInformation weights of nucleotides in DNA sequences
The coding sequence in DNA molecule is considered as a message necessary to be transferred to receiver, the proteins, through a noisy information channel and each nucleotide is assigned a respective information weight. With the help of the nucleotide substitution matrix we estimated the lower bound of the amount of information carried out by nucleotides which is not subject of mutations. We use...
متن کاملSublinear Time Motif Discovery from Multiple Sequences
In this paper, a natural probabilistic model for motif discovery has been used to experimentally test the quality of motif discovery programs. In this model, there are k background sequences, and each character in a background sequence is a random character from an alphabet, Σ. A motif G = g1g2 . . . gm is a string of m characters. In each background sequence is implanted a probabilistically-ge...
متن کاملMutual Information Content of Homologous DNA Sequences
The necessary information to reproduce and keep an organism is codified in acid nucleic molecules. Deepening the knowledge about how the information is stored in these bio-sequences can lead to more efficient methods of comparing genomic sequences. In the present study, we analyzed the quantity of information contained in a DNA sequence that can be useful to identify sequences homologous to it....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bulletin of Mathematical Biology
سال: 2005
ISSN: 0092-8240
DOI: 10.1016/j.bulm.2004.10.005