Amplifying the Block Matrix Structure for Spectral Clustering

نویسنده

  • Igor Fischer
چکیده

Spectral clustering methods perform well in cases where classical methods (K-means, single linkage, etc.) fail. However, for very non-compact clusters, they also tend to have problems. In this paper, we propose three improvements which we show that perform better in such cases. We suggest that spectral decomposition is merely a method for determining the block structure of the affinity matrix. Consequently, it is advantageous for clustering techniques if the affinity matrix has a clear block structure. We propose two independent steps to achieve this goal. In the first, which we term context-dependent affinity, we compute point affinities by taking their neighborhoods into account. In the second, the conductivity method, we aim at amplifying the block structure of the affinity matrix. Combining these two enables us to achieve a clear block-diagonal structure, despite starting with very weak affinities. For the last step, clustering spectral images, K-means is commonly used. Instead, as a third improvement, we suggest using our K-lines algorithm. When compared to other clustering algorithms, our methods display promising performance on both artificial and real-world data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Remarkable Formula for Spectral Distance of Block Southeast Submatrix

‎‎‎This paper presents a remarkable formula for spectral distance of a given block normal matrix $G_{D_0} = begin{pmatrix}‎ ‎A & B \‎ ‎C & D_0‎ ‎end{pmatrix} $ to set of block normal matrix $G_{D}$ (as same as $G_{D_0}$ except block $D$ which is replaced by block $D_0$)‎, ‎in which $A in mathbb{C}^{ntimes n}$ is invertible‎, ‎$ B in mathbb{C}^{ntimes m}‎, ‎C in mathbb{C}^{mti...

متن کامل

New Methods for Spectral Clustering

Analyzing the affinity matrix spectrum is an increasingly popular data clustering method. We propose three new algorithmic components which are appropriate for improving performance of spectral clustering. First, observing the eigenvectors suggests to use a K-lines algorithm instead of the commonly applied K-means. Second, the clustering works best if the affinity matrix has a clear block struc...

متن کامل

Consistent parameter estimation in general stochastic block models with overlaps

This paper considers the parameter estimation problem in Stochastic Block Model with Overlaps (SBMO), which is a quite general instance of random graph model allowing for overlapping community structure. We present the new algorithm successive projection overlapping clustering (SPOC) which combines the ideas of spectral clustering and geometric approach for separable non-negative matrix factori...

متن کامل

On the Spectrum of a Family of Preconditioned Block Toeplitz Matrices

Abstract. Research on preconditioning Toeplitz matrices with circulant matrices has been active recently. The preconditioning technique can be easily generalized to block Toeplitz matrices. That is, for a block Toeplitz matrix T consisting of N N blocks with M M elements per block, a block circulant matrix R is used with the same block structure as its preconditioner. In this research, the spec...

متن کامل

Spectral clustering and the high-dimensional Stochastic Block Model

Networks or graphs can easily represent a diverse set of data sources that are characterized by interacting units or actors. Social networks, representing people who communicate with each other, are one example. Communities or clusters of highly connected actors form an essential feature in the structure of several empirical networks. Spectral clustering is a popular and computationally feasibl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005