An Efficient Hybrid Comparative Study Based on Aco, Pso, K-means with K-medoids for Cluster Analysis

نویسنده

  • S. Akila
چکیده

S.Keerthana1, Mrs. S. Akila2 1Research Scholar, Department of Computer Science, Vellalar College for Women, Erode, Tamilnadu, India 2Assistant Professor, Dept. of Computer Science, Vellalar College for Women, Erode, Tamilnadu, India ---------------------------------------------------------------------***--------------------------------------------------------------------Abstract Clustering is a popular data analysis and mining technique. A popular technique for clustering is based on k-means such that the data is partitioned into K clusters. However, the k-means algorithm highly depends on the initial state and converges to local optimum. The existing work presents a hybrid evolutionary algorithm to solve nonlinear partitional clustering problem. The evolutionary algorithm is the combination of FAPSO (fuzzy adaptive particle swarm optimization), ACO (ant colony optimization) and k-means algorithms, called FAPSO-ACO– K, which can find better cluster partition. Then k-means clustering is applied to get cluster results. K-means clustering is sensitive to the outliers and a set of objects closest to a centroid may be empty, in which case centroids cannot be updated. In k-means difficult to predict K-Value and different initial partitions can result in different final clusters. The objective of the proposed work is to overcome these problems, the K-medoids clustering algorithm where representative objects called medoids are considered instead of centroids. Because it uses the most centre located object in a cluster. The algorithm has excellent feature which requires the distance between every pairs of objects only once and uses this distance at every iterative step. It is less sensitive to outliers compared with the K-means clustering. It gives better performance than K-means clustering. Minimize the sensitivity of k-means to outliers. Pick the actual objects to represent clusters instead of mean values. Each remain object is clustered with the representative object (Medoid) to which is the most similar. The performance of the proposed work is evaluated through several benchmark data sets. The simulation result shows that the performance of the proposed work is better than the existing algorithm in terms of accuracy, recall, precision and F-measure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Hybrid Data Clustering Algorithm Using Modified Krill Herd Algorithm and K-MEANS

Data clustering is the process of partitioning a set of data objects into meaning clusters or groups. Due to the vast usage of clustering algorithms in many fields, a lot of research is still going on to find the best and efficient clustering algorithm. K-means is simple and easy to implement, but it suffers from initialization of cluster center and hence trapped in local optimum. In this paper...

متن کامل

An efficient hybrid algorithm based on modified imperialist competitive algorithm and K-means for data clustering

Clustering techniques have received attention in many fields of study such as engineering, medicine, biology and data mining. The aim of clustering is to collect data points. The K-means algorithm is one of the most common techniques used for clustering. However, the results of K-means depend on the initial state and converge to local optima. In order to overcome local optima obstacles, a lot o...

متن کامل

Intrusion Detection based on a Novel Hybrid Learning Approach

Information security and Intrusion Detection System (IDS) plays a critical role in the Internet. IDS is an essential tool for detecting different kinds of attacks in a network and maintaining data integrity, confidentiality and system availability against possible threats. In this paper, a hybrid approach towards achieving high performance is proposed. In fact, the important goal of this paper ...

متن کامل

A Comparative Analysis of Particle Swarm Optimization and K-means Algorithm For Text Clustering Using Nepali Wordnet

The volume of digitized text documents on the web have been increasing rapidly. As there is huge collection of data on the web there is a need for grouping(clustering) the documents into clusters for speedy information retrieval. Clustering of documents is collection of documents into groups such that the documents within each group are similar to each other and not to documents of other groups...

متن کامل

A hybrid DEA-based K-means and invasive weed optimization for facility location problem

In this paper, instead of the classical approach to the multi-criteria location selection problem, a new approach was presented based on selecting a portfolio of locations. First, the indices affecting the selection of maintenance stations were collected. The K-means model was used for clustering the maintenance stations. The optimal number of clusters was calculated through the Silhou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017