A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

نویسندگان

  • H. Montazery Kordy Ph.D. Student in Biomedical engineering, Tarbiat Modarres University, Tehran, Iran
  • M. H. Miran Baygi Assistant Professor, Biomedical Engineering Group, Tarbiat Modarres University, Tehran, Iran
  • M. H. Moradi Associate Professor, Faculty of Biomedical Engineering, Amir Kabir University of Technology, Tehran, Iran
چکیده مقاله:

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such patterns is the optimum selection of feature subset from mass spectrum data. Materials and Methods: In this research, the data corresponding to proteomic patterns of serum from patients with ovarian cancer was analyzed in two independent groups. Using a mathematical model, the baseline and electrical noises were eliminated in the preprocessing stage with subsequent normalization of mass spectra. The proposed method uses a hybrid algorithm based on a statistical test and Bhattacharyya distance measure. Using the final prediction error criteria, the best feature subset was selected from 15154 data points while maintaining the resolution and the valuable information. The selected feature subset was then used for the detection of biomarkers within the mass spectrum. Results: Using the method of k-fold cross validation, the samples under study were divided into two sets called the learning and test. Using the least threshold value, the points having significance difference (p-value < 0.05) were selected. The best subset was then extracted from the remaining points such that it would have the maximum information content. By doing so, the number of input variables was reduced from 15154 to 80 points. In the next step, 16 and 6 biomarkers were selected for the two independent dataset. The obtained results show accuracy, specificity as well as sensitivity of 100%. Discussion and Conclusion: To diagnose a disease in medicine is an example of pattern recognition in engineering and physical science. In this paper, a filter approach is introduced for feature subset selection which extracts appropriate features in the input space by using the combination of statistical method and distance measure based on information criteria. The result of this study emphasizes that the use of combination approach in feature extraction and selection in high dimensional data can appropriately separate the pattern classes in addition to maintaining the information content. 

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

a new hybrid feature subset selection algorithm for the analysis of ovarian cancer data using laser mass spectrum

introduction: amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. the chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. a major challenge in extracting such ...

متن کامل

a hybrid feature subset selection algorithm for analysis of high correlation proteomic data

pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. the surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (seldi-tof ms) has been used to generate proteomic profiles from biological fluids. mass spectrometry yields redundant noisy data that the most data points are irrelevant features ...

متن کامل

Feature Subset Selection Using a Genetic Algorithm Feature Subset Selection Using a Genetic Algorithm

Practical pattern classiication and knowledge discovery problems require selection of a subset of attributes or features (from a much larger set) to represent the patterns to be classiied. This paper presents an approach to the multi-criteria optimization problem of feature subset selection using a genetic algorithm. Our experiments demonstrate the feasibility of this approach for feature subse...

متن کامل

A Parallel Genetic Algorithm Based Method for Feature Subset Selection in Intrusion Detection Systems

Intrusion detection systems are designed to provide security in computer networks, so that if the attacker crosses other security devices, they can detect and prevent the attack process. One of the most essential challenges in designing these systems is the so called curse of dimensionality. Therefore, in order to obtain satisfactory performance in these systems we have to take advantage of app...

متن کامل

A Hybrid Feature Subset Selection Algorithm for Analysis of High Correlation Proteomic Data

Pathological changes within an organ can be reflected as proteomic patterns in biological fluids such as plasma, serum, and urine. The surface-enhanced laser desorption and ionization time-of-flight mass spectrometry (SELDI-TOF MS) has been used to generate proteomic profiles from biological fluids. Mass spectrometry yields redundant noisy data that the most data points are irrelevant features ...

متن کامل

A Binary Pso-aco Hybrid Algorithm for Feature Subset Selection

Feature Selection is the process of selecting a subset of features available, allowing a certain objective function to be optimized, from the data containing noisy,irrelevant and redundant features. This paper presents a novel feature selection method that is based on hybridization of ACO with a binary PSO to obtain excellent properties of two algorithms by synthesizing them and aims at achievi...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}


عنوان ژورنال

دوره 4  شماره Issue 1,2

صفحات  83- 96

تاریخ انتشار 2007-06-01

با دنبال کردن یک ژورنال هنگامی که شماره جدید این ژورنال منتشر می شود به شما از طریق ایمیل اطلاع داده می شود.

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023