A formalism for studying long - range correlations in many - alphabets sequences

نویسندگان

  • S. L. Narasimhan
  • Joseph A. Nathan
  • K. P. N. Murthy
چکیده

S. L. Narasimhan, Joseph A. Nathan, P. S. R. Krishna and K. P. N. Murthy Solid State Physics Division, Reactor Physics Design Division Bhabha Atomic Research Centre, Mumbai-400085, India. Materials Science Division, Indira Gandhi Centre for Atomic Research, Kalpakkam 603102, Tamilnadu, India. Abstract We formulate a mean-field-like theory of long-range correlated L-alphabets sequences, which are actually systems with (L − 1) independent parameters. Depending on the values of these parameters, the variance on the average number of any given symbol in the sequence shows a linear or a superlinear dependence on the total length of the sequence. We present exact solution to the four-alphabets and three-alphabets sequences. We also demonstrate that a mapping of the given sequence into a smaller alphabets sequence (namely, a coarsegraining process) does not necessarily imply that long-range correlations found in the latter would correspond to those of the former.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Long range correlations in DNA sequences

The so called long range correlation properties of DNA sequences are studied using the variance analyses of the density distribution of a single or a group of nucleotides in a model independent way. This new method which was suggested earlier has been applied to extract slope parameters that characterize the correlation properties for several intron containing and intron less DNA sequences. An ...

متن کامل

Reduced amino acid alphabets exhibit an improved sensitivity and selectivity in fold assignment

MOTIVATION Many proteins with vastly dissimilar sequences are found to share a common fold, as evidenced in the wealth of structures now available in the Protein Data Bank. One idea that has found success in various applications is the concept of a reduced amino acid alphabet, wherein similar amino acids are clustered together. Given the structural similarity exhibited by many apparently dissim...

متن کامل

P87: The Role of the Long Non-Coding RNA Sequences (LncRNAs) in Neurological Disorders

Precise interpretation of the transcriptome sequences in the several species showed that the major part of genome has been transcribed; however, just a few amounts of the transcription sequences have open-reading frames which are conversed during the evolution. So, it is unlikely that many of the transcribed sequences code the proteins. Among the all human non-coding transcripts, at least 10000...

متن کامل

Understanding Long-range Correlations in DNA Sequences

In this paper, we review the literature on statistical long-range correlation in DNA sequences. We examine the current evidence for these correlations, and conclude that a mixture of many length scales (including some relatively long ones) in DNA sequences is responsible for the observed 1=f -like spectral component. We note the complexity of the correlation structure in DNA sequences. The obse...

متن کامل

High energy factorization in nucleus-nucleus collisions III. Long range rapidity correlations

We obtain a novel result in QCD for long range rapidity correlations between gluons produced in the collision of saturated high energy hadrons or nuclei. This result, obtained in a high energy factorization framework, provides strong justification for the Glasma flux tube picture of coherent strong color fields. Our formalism can be applied to “near side ridge” events at RHIC and in future stud...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005