k means cluster

نتایج جستجو برای: k means cluster

تعداد نتایج: 880962 فیلتر نتایج به سال:

CNAK: Cluster number assisted K-means

Journal: :Pattern Recognition 2021

Determining the number of clusters present in a dataset is an important problem cluster analysis. Conventional clustering techniques generally assume this parameter to be provided up front. %user supplied. %Recently, robustness any given algorithm analyzed measure stability/instability which turn determines number. In paper, we propose method analyzes stability for predicting Under same computa...

متن کامل

An efficient K-means algorithm for Massive Data

Journal: :CoRR 2016

Marco Capó Aritz Pérez Martínez José Antonio Lozano

Due to the progressive growth of the amount of data available in a wide variety of scientific fields, it has become more difficult to manipulate and analyze such information. Even though datasets have grown in size, the K-means algorithm remains as one of the most popular clustering methods, in spite of its dependency on the initial settings and high computational cost, especially in terms of d...

متن کامل

Streaming k-means approximation

2009

Nir Ailon Ragesh Jaiswal Claire Monteleoni

We provide a clustering algorithm that approximately optimizes the k-means objective, in the one-pass streaming setting. We make no assumptions about the data, and our algorithm is very light-weight in terms of memory, and computation. This setting is applicable to unsupervised learning on massive data sets, or resource-constrained devices. The two main ingredients of our theoretical work are: ...

متن کامل

Notes on using Determinantal Point Processes for Clustering with Applications to Text Clustering

Journal: :CoRR 2014

Apoorv Agarwal Anna Choromanska Krzysztof Choromanski

In this paper, we compare three initialization schemes for the KMEANS clustering algorithm: 1) random initialization (KMEANSRAND), 2) KMEANS++, and 3) KMEANSD++. Both KMEANSRAND and KMEANS++ have a major that the value of k needs to be set by the user of the algorithms. (Kang 2013) recently proposed a novel use of determinantal point processes for sampling the initial centroids for the KMEANS a...

متن کامل

On Clustering Histograms with k-Means by Using Mixed α-Divergences

Journal: :Entropy 2014

Frank Nielsen Richard Nock Shun-ichi Amari

Clustering sets of histograms has become popular thanks to the success of the generic method of bag-of-X used in text categorization and in visual categorization applications. In this paper, we investigate the use of a parametric family of distortion measures, called the α-divergences, for clustering histograms. Since it usually makes sense to deal with symmetric divergences in information retr...

متن کامل

On the Communication Complexity of Distributed Clustering

Journal: :CoRR 2015

Qin Zhang

In this paper we give a first set of communication lower bounds for distributed clustering problems, in particular, for k-center, k-median and k-means. When the input is distributed across a large number of machines and the number of clusters k is small, our lower bounds match the current best upper bounds up to a logarithmic factor. We have designed a new composition framework in our proofs fo...

متن کامل

K-maximin clustering: a maximin correlation approach to partition-based clustering

Journal: :IEICE Electronic Express 2009

Taehoon Lee Seung Jean Kim Eui-Young Chung Sungroh Yoon

We propose a new clustering algorithm based upon the maximin correlation analysis (MCA), a learning technique that can minimize the maximum misclassification risk. The proposed algorithm resembles conventional partition clustering algorithms such as k-means in that data objects are partitioned into k disjoint partitions. On the other hand, the proposed approach is unique in that an MCA-based ap...

متن کامل

On the Consistency of k-means++ algorithm

Journal: :CoRR 2017

Mieczyslaw A. Klopotek

We prove in this paper that the expected value of the objective function of the k-means++ algorithm for samples converges to population expected value. As k-means++, for samples, provides with constant factor approximation for k-means objectives, such an approximation can be achieved for the population with increase of the sample size. This result is of potential practical relevance when one is...

متن کامل

The k-means-u* algorithm: non-local jumps and greedy retries improve k-means++ clustering

Journal: :CoRR 2017

Bernd Fritzke

We present a new clustering algorithm called k-means-u* which in many cases is able to significantly improve the clusterings found by k-means++, the current de-facto standard for clustering in Euclidean spaces. First we introduce the k-means-u algorithm which starts from a result of k-means++ and attempts to improve it with a sequence of non-local “jumps” alternated by runs of standard k-means....

متن کامل

K - Means Clustering for Automatic Image Segmentation

2008

Xiaoyi Jiang

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید