A Detailed Study and Analysis of different Partitional Data Clustering Techniques

نویسنده

  • Mydhili K Nair
چکیده

The concept of Data Clustering is considered to be very significant in various application areas like text mining, fraud detection, health care, image processing, bioinformatics etc. Due to its application in a variety of domains, various techniques are presented by many research domains in the literature. Data Clustering is one of the important tasks that make up Data Mining. Clustering can be classified into different types such as partitional, hierarchical, spectral, density-based, grid-based, model based etc. Among the different types of clustering available, partitional clustering is the most widely used in most of the applications since the computation involved is not very complex. Hence lot of research has been carried out in clustering using partitional method. In this paper, it is proposed to do a comprehensive study of the different partitional clustering techniques used in the literature which will also provide an insight into the recent problems in the same area. In this paper, sixteen research articles have been taken which are published by different publishers between the years 2005 and 2013. Various algorithms come under partitional clustering among which Bisecting K-Means is an excellent one that gives a good quality output for clustering large number of data. Also a broad analysis is carried out to provide an insight into the importance of the various approaches which can in turn throw light to developments in the same area.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

C ONSTRAINT BASED P ARTITIONAL C LUSTERING – A C OMPREHENSIVE S TUDY AND A NALYSIS Aparna

Data clustering is the concept of forming predefined number of clusters where the data points within each cluster are very similar to each other and the data points between clusters are dissimilar to each other. The concept of clustering is widely used in various domains like bioinformatics, medical data, imaging, marketing study and crime analysis. The popular types of clustering techniques ar...

متن کامل

Partitional Clustering Experiments on Document Datasets

The purpose of this study is evaluation and comparison of some criterion functions used for document clustering. Each function is evaluated by using different clustering methods and different datasets. Detailed experiments show that some clustering criterion functions perform better than rest. Results of experiments are also consistent with previous works which compares same criterion functions.

متن کامل

Shared farthest neighbor approach to clustering of high dimensionality, low cardinality data

Clustering algorithms are routinely used in biomedical disciplines, and are a basic tool in bioinformatics. Depending on the task at hand, there are two most popular options, the central partitional techniques and the Agglomerative Hierarchical Clustering techniques and their derivatives. These methods are well studied and well established. However, both categories have some drawbacks related t...

متن کامل

PSO based Multidimensional Data Clustering: A Survey

Data clustering is considered as one of the most promising data analysis methods in data mining and on the other side KMeans is the well known partitional clustering technique. Nevertheless, K-Means and other partitional clustering techniques struggle with some challenges where dimension is the core concern. The different challenges associated with clustering techniques are preknowledge of init...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014