Supervised feature selection in mass spectrometry-based proteomic profiling by blockwise boosting
نویسندگان
چکیده
When feature selection in mass spectrometry is based on single m/z values, problems arise from the fact that variability is not only in vertical but also in horizontal direction, i.e. also slightly differing m/z values may correspond to the same feature. Hence, we propose to use the full spectra as input to a classifier, but to select small groups -- or blocks -- of adjacent m/z values, instead of single m/z values only. For that purpose we modify the LogitBoost to obtain a version of the so-called blockwise boosting procedure for classification. It is shown that blockwise boosting has high potential in predictive proteomics.
منابع مشابه
A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum
Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...
متن کاملComparison of Supervised Classification Methods for Protein Profiling in Cancer Diagnosis
A key challenge in clinical proteomics of cancer is the identification of biomarkers that could allow detection, diagnosis and prognosis of the diseases. Recent advances in mass spectrometry and proteomic instrumentations offer unique chance to rapidly identify these markers. These advances pose considerable challenges, similar to those created by microarray-based investigation, for the discove...
متن کاملA Multi-objective Genetic Programming Biomarker Detection Approach in Mass Spectrometry Data
Mass spectrometry is currently the most commonly used technology in biochemical research for proteomic analysis. The main goal of proteomic profiling using mass spectrometry is the classification of samples from different clinical states. This requires the identification of proteins or peptides (biomarkers) that are expressed differentially between different clinical states. However, due to the...
متن کاملMachine learning methods for predictive proteomics
The search for predictive biomarkers of disease from high-throughput mass spectrometry (MS) data requires a complex analysis path. Preprocessing and machine-learning modules are pipelined, starting from raw spectra, to set up a predictive classifier based on a shortlist of candidate features. As a machine-learning problem, proteomic profiling on MS data needs caution like the microarray case. T...
متن کاملMathematical Framework and Wavelets Applications in Proteomics for Cancer Study
Cancer is a proteomic disease. Though MALDI-TOF mass spectrometry allows direct measurement of the protein signature of tissue, blood, or their biological samples, and holds tremendous potential for disease diagnosis and treatment, key challenges remain in the processing of proteomic data. In this chapter, we will introduce a wavelet based mathematical framework and computational tools for prot...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 25 8 شماره
صفحات -
تاریخ انتشار 2009