Application of Dimensionality Reduction techniques in Real time Dataset
نویسندگان
چکیده
Data pre-processing in data mining refers to transforming the raw data into understandable format for further analysis. The real time data is incomplete, robust and unorganised which should be cleaned and transformed to make it efficient for preprocessing. In this paper, we have discussed about three dimensionality reduction techniques namely Principal Component Analysis (PCA), Singular Value decomposition and Learning Vector Quantization applied to solar irradiance dataset. The dataset consists of temperature, solar irradiance, and humidity data for 25 years for selected eight south Indian cities. The dimensionality reduction was done by applying the above mentioned three algorithms and their efficiency was evaluated, to find the best suiting algorithm to apply for the dataset. Index Terms Data Mining, Dimensionality Reduction, Learning Vector Quantization, Principal Component Analysis (PCA), Singular Value Decomposition.
منابع مشابه
2D Dimensionality Reduction Methods without Loss
In this paper, several two-dimensional extensions of principal component analysis (PCA) and linear discriminant analysis (LDA) techniques has been applied in a lossless dimensionality reduction framework, for face recognition application. In this framework, the benefits of dimensionality reduction were used to improve the performance of its predictive model, which was a support vector machine (...
متن کاملImpact of linear dimensionality reduction methods on the performance of anomaly detection algorithms in hyperspectral images
Anomaly Detection (AD) has recently become an important application of hyperspectral images analysis. The goal of these algorithms is to find the objects in the image scene which are anomalous in comparison to their surrounding background. One way to improve the performance and runtime of these algorithms is to use Dimensionality Reduction (DR) techniques. This paper evaluates the effect of thr...
متن کاملDiagnosis of Diabetes Using an Intelligent Approach Based on Bi-Level Dimensionality Reduction and Classification Algorithms
Objective: Diabetes is one of the most common metabolic diseases. Earlier diagnosis of diabetes and treatment of hyperglycemia and related metabolic abnormalities is of vital importance. Diagnosis of diabetes via proper interpretation of the diabetes data is an important classification problem. Classification systems help the clinicians to predict the risk factors that cause the diabetes or pre...
متن کاملDeveloping a Filter-Wrapper Feature Selection Method and its Application in Dimension Reduction of Gen Expression
Nowadays, increasing the volume of data and the number of attributes in the dataset has reduced the accuracy of the learning algorithm and the computational complexity. A dimensionality reduction method is a feature selection method, which is done through filtering and wrapping. The wrapper methods are more accurate than filter ones but perform faster and have a less computational burden. With ...
متن کاملStudy on Dimensionality Reduction Techniques and Applications
Data is not collected only for data mining. Data accumulates in an unprecedented speed. Data preprocessing is an important part for effective machine learning and data mining. Data mining is discovering interesting knowledge from large amounts of data, which is the integral part of the KDD (Knowledge Discovery in Databases), which is the overall process of converting raw data into useful inform...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016