Boosting EM for Radiation Hybrid and Genetic Mapping

نویسندگان

  • Thomas Schiex
  • Patrick Chabrier
  • Martin Bouchez
  • Denis Milan
چکیده

Radiation hybrid (RH) mapping is a somatic cell technique that is used for ordering markers along a chromosome and estimating physical distances between them. It nicely complements the genetic mapping technique, allowing for finer resolution. Like genetic mapping, RH mapping consists in finding a marker ordering that maximizes a given criteria. Several software packages have been recently proposed to solve RH mapping problems. Each package offers specific criteria and specific ordering techniques. The most general packages look for maximum likelihood maps and may cope with errors, unknowns and polyploid hybrids at the cost of limited computational efficiency. More efficient packages look for minimum breaks or two-points approximated maximum likelihood maps but ignore errors, unknowns and polyploid hybrids. In this paper, we present a simple improvement of the EM algorithm [5] that makes maximum likelihood estimation much more efficient (in practice and to some extent in theory too). The boosted EM algorithm can deal with unknowns in both error-free haploid data and error-free backcross data. Unknowns are usually quite limited in RH mapping but cannot be ignored when one deals with genetic data or multiple populations/panels consensus mapping (markers being not necessarily typed in all panels/populations). These improved EM algorithms have been implemented in the Cart haGène software. We conclude with a comparison with similar packages (RHMAP and MapMaker) using simulated data sets and present preliminary results on mixed simultaneous RH/genetic mapping on pig data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

haGene: multipopulation integrated genetic and radiation hybrid mapping

Summary: Cart haGene: is an integrated genetic and radiation hybrid (RH) mapping tool which can deal with multiple populations, including mixtures of genetic and RH data. Cart haGene: performs multipoint maximum likelihood estimations with accelerated expectation– maximization algorithms for some pedigrees and has sophisticated algorithms for marker ordering. Dedicated heuristics for framework ...

متن کامل

Markov Chain Monte Carlo Methods for Radiation Hybrid Mapping

The ordering of genetic loci is central to genetic mapping at all levels. Markov chain Monte Carlo (MCMC) techniques can provide estimates of the posterior density of orders while accounting naturally for missing data, data errors, and unknown parameters. MCMC sampling schemes have been proposed for mapping problems such as linkage mapping and radiation hybrid mapping. The sampling schemes tend...

متن کامل

A Hybrid Framework for Building an Efficient Incremental Intrusion Detection System

In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...

متن کامل

A Hybrid Algorithm of Electromagnetism - like and Genetic for Recurrent Neural Fuzzy Controller Design

Based on the electromagnetism-like algorithm (EM), we propose a novel hybrid learning algorithms which is the improved EM algorithm with genetic algorithm technique (IEMGA) for recurrent fuzzy neural system design. IEMGA are composed of initialization, local search, total force calculation, movement, and evaluation. They are hybridization of EM and GA. EM algorithm is a population-based meta-he...

متن کامل

Estimation of genetic parameters for quantitative and qualitative traits in cotton cultivars (Gossypium hirsutum L. & Gossypium barbadense L.) and new scaling test of additive– dominance model

A complete diallel cross of nine cotton genotypes (Gossypium hirsutum L. & Gossypium barbadense L.) viz Delinter, Sindose-80, Omoumi, Bulgare-539, Termez-14, Red leaf (Native species), B-557, Brown fiber and Siokra-324 having diverse genetic origins was conducted over two years to determine the potential for the improvement of yield, its components, oil and fiber qual...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001