A filtering strategy to reduce reference bias in measurements of allele-specific expression

نویسندگان

  • Joshua Bradley
  • Zia Khan
چکیده

Understanding how genetic variation leads to phenotypic variation is a fundamental goal in genetics. One of the challenges of this goal is to distinguish variation in a genome that has an effect on phenotype from variation that has no effect. One way to determine if a genetic variant is likely to have a phenotypic effect is to determine if that variant affects gene regulation. Presumably through their effect on gene regulation these variants affect organismal phenotypes. Examining heterozygous sites in a genome can identify the presence of one class of variants, called cisregulatory variants. In the absence of a functional variant, sequencing based measurements of gene expression should show no bias toward one allele. The presence of allelic bias implies that functional genetic variation affects gene regulation. One key issue in identifying these allele-specific events is a systematic bias toward the reference allele, an artifact of read alignment that creates false positive allele-specific expression (ASE). While N masking is one popular method used to reduce reference bias, it does not eliminate all sources of reference bias. We present a novel metric, the homology score, to characterize Single Nucleotide Polymorphisms (SNPs) based on the homology of their surrounding genomic region. When combined with N-masking, we are able to identify and filter out sources of reference bias not accounted for by use of N-masking alone. In general, this metric can assist in removing SNPs likely to yield false positive ASE.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A new strategy to reduce allelic bias in RNA-Seq readmapping

Accurate estimation of expression levels from RNA-Seq data entails precise mapping of the sequence reads to a reference genome. Because the standard reference genome contains only one allele at any given locus, reads overlapping polymorphic loci that carry a non-reference allele are at least one mismatch away from the reference and, hence, are less likely to be mapped. This bias in read mapping...

متن کامل

Comparison of Two Veterinary Blood Glucose Meters and One Human-Based Glucose Meter for Use in Dogs

Background: Recently, tendency to use veterinary specific Portable blood glucose meter (PBGMs) has increased. However, assessment of their analytical and clinical accuracy is a matter of concern. OBJECTIVES: To assess accuracy of two veterinary (AlphaTRAK2 and CERA-PET) and one human-based (Bionime) PBGMs for canine blood samples. METHODS: In this cross-sectional study, a total of 160 client-ow...

متن کامل

Allelic imbalance metre (Allim), a new tool for measuring allele-specific gene expression with RNA-seq data

Estimating differences in gene expression among alleles is of high interest for many areas in biology and medicine. Here, we present a user-friendly software tool, Allim, to estimate allele-specific gene expression. Because mapping bias is a major problem for reliable estimates of allele-specific gene expression using RNA-seq, Allim combines two different strategies to account for the mapping b...

متن کامل

Adaptive Filtering Strategy to Remove Noise from ECG Signals Using Wavelet Transform and Deep Learning

Introduction: Electrocardiogram (ECG) is a method to measure the electrical activity of the heart which is performed by placing electrodes on the surface of the body. Physicians use observation tools to detect and diagnose heart diseases, the same is performed on ECG signals by cardiologists. In particular, heart diseases are recognized by examining the graphic representation of heart signals w...

متن کامل

Independence of color intensity variation in red flesh apples from the number of repeat units in promoter region of the MdMYB10 gene as an allele to MdMYB1 and MdMYBA

MdMYB10 gene expression results in accumulation of anthocyanin in many tissues including flesh of applefruit. The MdMYB1 and MdMYBA genes are close homologues to MdMYB10 gene and both are responsiblefor red color phenotype in apple fruit skin. In the current study, an apple genome sequence draft analysisindicated that these three genes are located in a unique contig. Further a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015