Amplifying the Block Matrix Structure for Spectral Clustering
نویسنده
چکیده
Spectral clustering methods perform well in cases where classical methods (K-means, single linkage, etc.) fail. However, for very non-compact clusters, they also tend to have problems. In this paper, we propose three improvements which we show that perform better in such cases. We suggest that spectral decomposition is merely a method for determining the block structure of the affinity matrix. Consequently, it is advantageous for clustering techniques if the affinity matrix has a clear block structure. We propose two independent steps to achieve this goal. In the first, which we term context-dependent affinity, we compute point affinities by taking their neighborhoods into account. In the second, the conductivity method, we aim at amplifying the block structure of the affinity matrix. Combining these two enables us to achieve a clear block-diagonal structure, despite starting with very weak affinities. For the last step, clustering spectral images, K-means is commonly used. Instead, as a third improvement, we suggest using our K-lines algorithm. When compared to other clustering algorithms, our methods display promising performance on both artificial and real-world data sets.
منابع مشابه
On the Remarkable Formula for Spectral Distance of Block Southeast Submatrix
This paper presents a remarkable formula for spectral distance of a given block normal matrix $G_{D_0} = begin{pmatrix} A & B \ C & D_0 end{pmatrix} $ to set of block normal matrix $G_{D}$ (as same as $G_{D_0}$ except block $D$ which is replaced by block $D_0$), in which $A in mathbb{C}^{ntimes n}$ is invertible, $ B in mathbb{C}^{ntimes m}, C in mathbb{C}^{mti...
متن کاملNew Methods for Spectral Clustering
Analyzing the affinity matrix spectrum is an increasingly popular data clustering method. We propose three new algorithmic components which are appropriate for improving performance of spectral clustering. First, observing the eigenvectors suggests to use a K-lines algorithm instead of the commonly applied K-means. Second, the clustering works best if the affinity matrix has a clear block struc...
متن کاملConsistent parameter estimation in general stochastic block models with overlaps
This paper considers the parameter estimation problem in Stochastic Block Model with Overlaps (SBMO), which is a quite general instance of random graph model allowing for overlapping community structure. We present the new algorithm successive projection overlapping clustering (SPOC) which combines the ideas of spectral clustering and geometric approach for separable non-negative matrix factori...
متن کاملOn the Spectrum of a Family of Preconditioned Block Toeplitz Matrices
Abstract. Research on preconditioning Toeplitz matrices with circulant matrices has been active recently. The preconditioning technique can be easily generalized to block Toeplitz matrices. That is, for a block Toeplitz matrix T consisting of N N blocks with M M elements per block, a block circulant matrix R is used with the same block structure as its preconditioner. In this research, the spec...
متن کاملSpectral clustering and the high-dimensional Stochastic Block Model
Networks or graphs can easily represent a diverse set of data sources that are characterized by interacting units or actors. Social networks, representing people who communicate with each other, are one example. Communities or clusters of highly connected actors form an essential feature in the structure of several empirical networks. Spectral clustering is a popular and computationally feasibl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005