Sparse representation and Bayesian detection of genome copy number alterations from microarray data

نویسندگان

Roger Pique-Regi

Jordi Monso-Varona

Antonio Ortega

Robert C. Seeger

Timothy Triche

Shahab Asgharzadeh

چکیده

MOTIVATION Genomic instability in cancer leads to abnormal genome copy number alterations (CNA) that are associated with the development and behavior of tumors. Advances in microarray technology have allowed for greater resolution in detection of DNA copy number changes (amplifications or deletions) across the genome. However, the increase in number of measured signals and accompanying noise from the array probes present a challenge in accurate and fast identification of breakpoints that define CNA. This article proposes a novel detection technique that exploits the use of piece wise constant (PWC) vectors to represent genome copy number and sparse Bayesian learning (SBL) to detect CNA breakpoints. METHODS First, a compact linear algebra representation for the genome copy number is developed from normalized probe intensities. Second, SBL is applied and optimized to infer locations where copy number changes occur. Third, a backward elimination (BE) procedure is used to rank the inferred breakpoints; and a cut-off point can be efficiently adjusted in this procedure to control for the false discovery rate (FDR). RESULTS The performance of our algorithm is evaluated using simulated and real genome datasets and compared to other existing techniques. Our approach achieves the highest accuracy and lowest FDR while improving computational speed by several orders of magnitude. The proposed algorithm has been developed into a free standing software application (GADA, Genome Alteration Detection Algorithm). AVAILABILITY http://biron.usc.edu/~piquereg/GADA

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Hidden Markov Models and Sparse Bayesian Learning for Detection of Copy Number Alterations

Abnormal genome copy number alterations (CNAs) are associated with many severe diseases. Advances in microarray technology have greatly improved the resolution of detection of DNA copy number changes. This poses a challenge to existing computational methods to process the data accurately and efficiently. We compare two approaches to CNA detection for speed and accuracy. The first is a modified ...

متن کامل

SPARSE REPRESENTATION MODELS AND APPLICATIONS TO BIOINFORMATICS by Roger Pique - Regi

Microarrays and new sequencing techniques offer a high throughput platform to study the whole genome with the unprecedented capability of measuring millions of genomic features on a single essay. This massive parallel measurement power has an enormous potential for research in Biology and Medicine with the ultimate objective of identifying and learning the biological processes occurring in diff...

متن کامل

I-37: Establishing High Resolution Genomic Profiles of Single Cells Using Microarray and Next-Generation Sequencing Technologies

The nature and pace of genome mutation is largely unknown. Standard methods to investigate DNA-mutation rely on arraying or sequencing DNA from a population of cells, hence the genetic composition of individual cells is lost and de novo mutation in cell(s) is concealed within the bulk signal. We developed methods based on (SNP-) arraying and next-generation sequencing of single-cell whole-genom...

متن کامل

Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA

MOTIVATION The complexity of a large number of recently discovered copy number polymorphisms is much higher than initially thought, thus making it more difficult to detect them in the presence of significant measurement noise. In this scenario, separate normalization and segmentation is prone to lead to many false detections of changes in copy number. New approaches capable of jointly modeling ...

متن کامل

O-38: Concurrent Whole-Genome Haplotyping and Copy-Number Profiling of Single Cells

Background Methods for haplotyping and DNA copynumber typing of single cells are paramount for studying genomic heterogeneity and enabling genetic diagnosis. Before analyzing the DNA of a single cell by microarray or next-generation sequencing, a whole-genome amplification (WGA) process is required, but it substantially distorts the frequency and composition of the cell’s alleles. As a conseque...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Bioinformatics

دوره 24 3 شماره

صفحات -

تاریخ انتشار 2008

Sparse representation and Bayesian detection of genome copy number alterations from microarray data

نویسندگان

چکیده

منابع مشابه

Comparison of Hidden Markov Models and Sparse Bayesian Learning for Detection of Copy Number Alterations

SPARSE REPRESENTATION MODELS AND APPLICATIONS TO BIOINFORMATICS by Roger Pique - Regi

I-37: Establishing High Resolution Genomic Profiles of Single Cells Using Microarray and Next-Generation Sequencing Technologies

Joint estimation of copy number variation and reference intensities on multiple DNA arrays using GADA

O-38: Concurrent Whole-Genome Haplotyping and Copy-Number Profiling of Single Cells

عنوان ژورنال:

اشتراک گذاری