kmeans clustering

نتایج جستجو برای: kmeans clustering

تعداد نتایج: 103000 فیلتر نتایج به سال:

Frequency Sensitive Competitive Learning for Balanced Clustering on High-dimensional Hyperspheres

2004

Arindam Banerjee

Competitive learning mechanisms for clustering in general suffer from poor performance for very high dimensional ( ) data because of “curse of dimensionality” effects. In applications such as document clustering, it is customary to normalize the high dimensional input vectors to unit length, and it is sometimes also desirable to obtain balanced clusters, i.e., clusters of comparable sizes. The ...

متن کامل

Clustering with Bregman Divergences

Journal: :Journal of Machine Learning Research 2004

Arindam Banerjee Srujana Merugu Inderjit S. Dhillon Joydeep Ghosh

A wide variety of distortion functions, such as squared Euclidean distance, Mahalanobis distance, Itakura-Saito distance and relative entropy, have been used for clustering. In this paper, we propose and analyze parametric hard and soft clustering algorithms based on a large class of distortion functions known as Bregman divergences. The proposed algorithms unify centroid-based parametric clust...

متن کامل

Measuring unsupervised acoustic clustering through phoneme pair merge-and-split tests

2005

John Kominek Alan W. Black

Subphonetic discovery through segmental clustering is a central step in building a corpus-based synthesizer. To help decide what clustering algorithm to use we employed mergeand-split tests on English fricatives. Compared to reference of 2%, Gaussian EM achieved a misclassification rate of 6%, Kmeans 10%, while predictive CART trees performed poorly.

متن کامل

Actionable Clustering

2014

Amit Dhurandhar Xiang Wang

In this paper, we define a new notion for a clustering to be useful, called actionable clustering. This notion is motivated by applications across various domains such as in business, education, public policy and healthcare. We formalize this notion by adding a novel constraint to traditional unsupervised clustering. We argue that this notion is different from semi-supervised clustering, superv...

متن کامل

Optimizing K-Means by Fixing Initial Cluster Centers

2014

Neeti Arora Mahesh Motwani

Data mining techniques help in business decision making and predicting behaviors and future trends. Clustering is a data mining technique used to make groups of objects that are somehow similar in characteristics. Clustering analyzes data objects without consulting a known class label or category i.e. it is an unsupervised data mining technique. Kmeans is a widely used partitional clustering al...

متن کامل

Clustering Methods without Given Number of Clusters

2013

Peng Xu Fei Liu

As we know, kmeans method is a very effective algorithm of clustering. Its most powerful feature is the scalability and simplicity. However, the most disadvantage is that we must know the number of clusters in the first place, which is usually a difficult problem in practice. In this paper, we propose a new approach– peak-searching clustering– to realize clustering without given the number of c...

متن کامل

Active Learning with Clustering

2011

Zalán Bodó Zsolt Minier Lehel Csató

Active learning is an important field of machine learning and it is becoming more widely used in case of problems where labeling the examples in the training data set is expensive. In this paper we present a clustering-based algorithm used in the Active Learning Challenge. The algorithm is based on graph clustering with normalized cuts, and uses kmeans to extract representative points from the ...

متن کامل

A Distributed Agent Implementation of Multiple Species Flocking Model for Document Partitioning Clustering

2006

Xiaohui Cui Thomas E. Potok

The Flocking model, first proposed by Craig Reynolds, is one of the first bio-inspired computational collective behavior models that has many popular applications, such as animation. Our early research has resulted in a flock clustering algorithm that can achieve better performance than the Kmeans or the Ant clustering algorithms for data clustering. This algorithm generates a clustering of a g...

متن کامل

A Text Clustering System based on k-means Type Subspace Clustering and Ontology

2006

Liping Jing Michael K. Ng Xinhua Yang Joshua Zhexue Huang

This paper presents a text clustering system developed based on a k-means type subspace clustering algorithm to cluster large, high dimensional and sparse text data. In this algorithm, a new step is added in the k-means clustering process to automatically calculate the weights of keywords in each cluster so that the important words of a cluster can be identified by the weight values. For unders...

متن کامل

Breast Cancer Mass Detection in Mammograms using Kmeans and Fuzzy Cmeans Clustering

Journal: :International Journal of Computer Applications 2011

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید