Error detection in SNP data by considering the likelihood of recombinational history implied by three-site combinations

نویسندگان

  • Donna M. Toleno
  • Peter L. Morrell
  • Michael T. Clegg
چکیده

MOTIVATION Errors in nucleotide sequence and SNP genotyping data are problematic when inferring haplotypes. Previously published methods for error detection in haplotype data make use of pedigree information; however, for many samples, individuals are not related by pedigree. This article describes a method for detecting errors in haplotypes by considering the recombinational history implied by the patterns of variation, three SNPs at a time. RESULTS Coalescent simulations provide evidence that the method is robust to high levels of recombination as well as homologous gene conversion, indicating that patterns produced by both proximate and distant SNPs may be useful for detecting unlikely three-site haplotypes. AVAILABILITY The perl script implementing the described method is called EDUT (Error Detection Using Triplets) and is available on request from the authors. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مطالعات درخت تصمیم در برآورد ریسک ابتلا به سرطان سینه با استفاده از چند شکلی‌های تک نوکلوئیدی

Abstract Introduction:   Decision tree is the data mining tools to collect, accurate prediction and sift information from massive amounts of data that are used widely in the field of computational biology and bioinformatics. In bioinformatics can be predict on diseases, including breast cancer. The use of genomic data including single nucleotide polymorphisms is a very important ...

متن کامل

P-244: Analysis of Genomic and Cell Free DNA of A let-7 microRNA Binding Site of KRAS Gene Polymorphisms in Endometriosis

Background: Endometriosis is one of the most common benign gynecological diseases which is characterized by endometriallike tissue growing outside the uterine cavity. Although the pathology of endometriosis remains unknown, the genetic predisposition plays an apparent role. Several genes have been contributed to endometriosis, but it seems KRAS has a crucial role, because its activation results...

متن کامل

Finding type 2 diabetes causal single nucleotide polymorphism combinations and functional modules from genome-wide association data

BACKGROUND Due to the low statistical power of individual markers from a genome-wide association study (GWAS), detecting causal single nucleotide polymorphisms (SNPs) for complex diseases is a challenge. SNP combinations are suggested to compensate for the low statistical power of individual markers, but SNP combinations from GWAS generate high computational complexity. METHODS We aim to dete...

متن کامل

Genotyping common SNP and a microsatellite sequence closely linked to waxy gene in rice by DNA based markers

The potential of different DNA based molecular markers was examined for the detection of single nucleotide polymorphism (SNP) in the waxy gene and a microsatellite (SSR) sequence closely linked to it in a collection of rice varieties. DNA was extracted from leaf samples of 68 different rice cultivars by the CTAB method and specific primers were designed for the amplification of waxy gene and SS...

متن کامل

vegetation change detection using multi-temporal remotly sensed data during recent three decades by artificial intelligence technique (Case study: protected area of Bashgol)

Quantitative and qualitative information of vegetation and its changes in duration of time as a basic foundation of determination of  habitat quality, priority of protected area and also determination of price of ecosystem services in order to optimum management of natural resources and sustainable development is a very important technical point. In other hand, researchers are interested in rem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 23 14  شماره 

صفحات  -

تاریخ انتشار 2007