HAPGEN2: simulation of multiple disease SNPs

نویسندگان

  • Zhan Su
  • Jonathan Marchini
  • Peter Donnelly
چکیده

MOTIVATION Performing experiments with simulated data is an inexpensive approach to evaluating competing experimental designs and analysis methods in genome-wide association studies. Simulation based on resampling known haplotypes is fast and efficient and can produce samples with patterns of linkage disequilibrium (LD), which mimic those in real data. However, the inability of current methods to simulate multiple nearby disease SNPs on the same chromosome can limit their application. RESULTS We introduce a new simulation algorithm based on a successful resampling method, HAPGEN, that can simulate multiple nearby disease SNPs on the same chromosome. The new method, HAPGEN2, retains many advantages of resampling methods and expands the range of disease models that current simulators offer. AVAILABILITY HAPGEN2 is freely available from http://www.stats.ox.ac.uk/~marchini/software/gwas/gwas.html. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Association study of four polymorphisms in the interleukin-7 receptor alpha gene with multiple sclerosis in Eastern Iran

Objective(s): Multiple sclerosis (MS) is an autoimmune demyelinating disease of the central nervous system (CNS) with unknown etiology. Various genetics and environmental factors contribute to the pathogenesis of the disease. The interleukin-7 receptor alpha chain (IL-7Ra) was identified as the first non-major histocompatibility complex (non-MHC) MS susceptibility locus. In this study we are tr...

متن کامل

Single Nucleotide Polymorphisms and Association Studies: A Few Critical Points

Uncovering DNA sequence variations that correlate with phenotypic changes, e.g., diseases, is the aim of sequence variation studies. Common types sequence variations are Single nucleotide polymorphism (SNP, pronounced snip).SNPs are the third-generation molecular marker. SNP represents a DNA sequence variant of a single base pair with the minor allele occurring in more than 1% of a given popula...

متن کامل

A Bayesian Hierarchical Model for Relating Multiple SNPs within Multiple Genes to Disease Risk

A variety of methods have been proposed for studying the association of multiple genes thought to be involved in a common pathway for a particular disease. Here, we present an extension of a Bayesian hierarchical modeling strategy that allows for multiple SNPs within each gene, with external prior information at either the SNP or gene level. The model involves variable selection at the SNP leve...

متن کامل

Power estimation of multiple SNP association test of case-control study and application.

At the current stage, a large number of single nucleotide polymorphisms (SNPs) have been deployed in searching for genes underlying complex diseases. A powerful method is desirable for efficient analysis of SNP data. Recently, a novel method for multiple SNP association test using a combination of allelic association (AA) and Hardy-Weinberg disequilibrium (HWD) has been proposed. However, the p...

متن کامل

بررسی پلی مورفیسم ژن رسپتور اینترلوکین VIIدر بیماران مبتلا به ام اس

Introduction: Multiple Sclerosis is a chronic disease of central nervous system. Disease is more common in young adults and females and causes neurologic symptoms and signs. Cytokine IL-7 is a 25– kDa glycoprotein that has an important role in Lymphopoiesis. Interleukin VII receptor gene has been identified to be associated with multiple sclerosis, so its assessment is important. Methods: We i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 27 16  شماره 

صفحات  -

تاریخ انتشار 2011