نتایج جستجو برای: k means cluster

تعداد نتایج: 880962  

2005
Alfred Ultsch

A new clustering algorithm based on grid projections is proposed. This algorithm, called U*C, uses distance information together with density structures. The number of clusters is determined automatically. The validity of the clusters found can be judged by the U*-Matrix visualization on top of the grid. A U*-Matrix gives a combined visualization of distance and density structures of a high dim...

2013
Przemyslaw Spurek Jacek Tabor Krzysztof Misztal

k-means is the basic method applied in many data clustering problems. As is known, its natural modification can be applied to projection clustering by changing the cost function from the squared-distance from the point to the squared distance from the affine subspace. However, to apply thus approach we need the beforehand knowledge of the dimension. In this paper we show how to modify this appr...

2004
Sanjiv K. Bhatia

Clustering is used to organize data for efficient retrieval. One of the problems in clustering is the identification of clusters in given data. A popular technique for clustering is based on K-means such that the data is partitioned into K clusters. In this method, the number of clusters is predefined and the technique is highly dependent on the initial identification of elements that represent...

2014
Raheela Asif Agathe Merceron Mahmood K. Pathan

This paper investigates how performance of students progresses during their studies. Progression of a student is defined as a tuple that shows how a year average stays the same, increases or decreases compared to first year. Taking the data of two consecutive cohorts and using k-means clustering, five meaningful types of progressions are put in evidence and intuitively visualized with a deviati...

2015
Leszek J. Chmielewski Maciej Janowicz Arkadiusz Orlowski

K-means clustering algorithm has been used to classify patterns of Japanese candlesticks which accompany the prices of several assets registered in the Warsaw stock exchange (GPW). It has been found that the trend reversals seem to be preceded by specific combinations of candlesticks with notable frequency. Surprisingly, the same patterns appear in both bullish and bearish trend reversals. The ...

1999
Clara Pizzuti Domenico Talia Giorgio Vonella

A method for the initialisation step of clustering algorithms is presented. It is based on the concept of cluster as a high density region of points. The search space is modelled as a set of d-dimensional cells. A sample of points is chosen and located into the appropriate cells. Cells are iteratively split as the number of points they receive increases. The regions of the search space having a...

2004
Yuji Kaneda Naonori Ueda Kazumi Saito

In this paper, we propose a new document clustering method based on the K-means method (kmeans). In our method, we allow only finite candidate vectors to be representative vectors of kmeans. We also propose a method for constructing these candidate vectors using documents that have the same word. We participated in NTCIR-4 WEB Task D (Topic Classification Task) and experimentally compared our m...

2016
Chaoyue Liu Mikhail Belkin

Clustering, in particular k-means clustering, is a central topic in data analysis. Clustering with Bregman divergences is a recently proposed generalization of k-means clustering which has already been widely used in applications. In this paper we analyze theoretical properties of Bregman clustering when the number of the clusters k is large. We establish quantization rates and describe the lim...

2004
Guihong Cao Dawei Song Peter Bruza

One way of representing semantics could be via a high dimensional conceptual space constructed by certain lexical semantic space models. Concepts (words), represented as a vector of other words in the semantic space, can be categorized via clustering techniques into a number of regions reflecting different contexts. The conventional clustering algorithms, e.g., K-means method, however, normally...

Journal: :J. Classification 2010
Mark Ming-Tso Chiang Boris G. Mirkin

The issue of determining “the right number of clusters” in K-Means has attracted considerable interest, especially in the recent years. Cluster overlap appears to be a factor most affecting the clustering results. This paper proposes an experimental setting for comparison of different approaches at data generated from Gaussian clusters with the controlled parameters of betweenand within-cluster...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید