semi supervised clustering

نتایج جستجو برای: semi supervised clustering

تعداد نتایج: 271512 فیلتر نتایج به سال:

Fuzzy semi-supervised co-clustering for text documents

Journal: :Fuzzy Sets and Systems 2013

Yang Yan Lihui Chen William-Chandra Tjhi

In this paper we propose a new heuristic semi-supervised fuzzy co-clustering algorithm (SS-HFCR) for categorization of large web documents. In this approach, the clustering process is carried out by incorporating some prior knowledge in the form of pair-wise constraints provided by users into the fuzzy co-clustering framework. Each constraint specifies whether a pair of documents “must” or “can...

متن کامل

Semi-supervised spectral clustering with application to detect population stratification

2013

Binghui Liu Xiaotong Shen Wei Pan

In genetic association studies, unaccounted population stratification can cause spurious associations in a discovery process of identifying disease-associated genetic markers. In such a situation, prior information is often available for some subjects' population identities. To leverage the additional information, we propose a semi-supervised clustering approach for detecting population stratif...

متن کامل

Deep Clustering using Auto-Clustering Output Layer

Journal: :CoRR 2017

Ozsel Kilinc Ismail Uysal

In this paper, we propose a novel method to enrich the representation provided to the output layer of feedforward neural networks in the form of an auto-clustering output layer (ACOL) which enables the network to naturally create sub-clusters under the provided main class labels. In addition, a novel regularization term is introduced which allows ACOL to encourage the neural network to reveal i...

متن کامل

Hierarchical and Reweighting Cluster Kernels for Semi-Supervised Learning

2010

Z. Bodó L. Csató Zalán Bodó Lehel Csató

Abstract: Recently semi-supervised methods gained increasing attention and many novel semi-supervised learning algorithms have been proposed. These methods exploit the information contained in the usually large unlabeled data set in order to improve classification or generalization performance. Using data-dependent kernels for kernel machines one can build semi-supervised classifiers by buildin...

متن کامل

Semi-Supervised Clustering via Matrix Factorization

2008

Fei Wang Tao Li Changshui Zhang

The recent years have witnessed a surge of interests of semi-supervised clustering methods, which aim to cluster the data set under the guidance of some supervisory information. Usually those supervisory information takes the form of pairwise constraints that indicate the similarity/dissimilarity between the two points. In this paper, we propose a novel matrix factorization based approach for s...

متن کامل

Semi-Supervised Learning Using Kernel Spectral Clustering Core Model

2015

Siamak Mehrkanoon Johan A.K. Suykens

A multi-class semi-supervised learning algorithm formulated as a regularized kernel spectral clustering (KSC) approach is proposed. The method is bale to address both semi-supervised classification and clustering. In addition a low embedding dimension is utilized to reveal the existing number of clusters. Thanks to the Nyström approximation technique, the approach can be scaled up for analyzing...

متن کامل

Semi-Supervised Learning for Web Text Clustering

2006

Bingru Yang Wei Song Zhangyan Xu

Supervised learning algorithms usually require large amounts of training data to learn reasonably accurate classifiers. Yet, for many text classification tasks, providing labeled training documents is expensive, while unlabeled documents are readily available in large quantities. Learning from both, labeled and unlabeled documents, in a semi-supervised framework is a promising approach to reduc...

متن کامل

Semi-Supervised Clustering Using Genetic Algorithms

1999

Ayhan Demiriz Kristin P. Bennett Mark J. Embrechts

A semi-supervised clustering algorithm is proposed that combines the benefits of supervised and unsupervised learning methods. Data are segmented/clustered using an unsupervised learning technique that is biased toward producing segments or clusters as pure as possible in terms of class distribution. These clusters can then be used to predict the class of future points. For example in database ...

متن کامل

Semi-supervised Clustering using Combinatorial MRFs

2006

Ron Bekkerman Mehran Sahami

A combinatorial random variable is a discrete random variable defined over a combinatorial set (e.g., a power set of a given set). In this paper we introduce combinatorial Markov random fields (Comrafs), which are Markov random fields where some of the nodes are combinatorial random variables. We argue that Comrafs are powerful models for unsupervised learning by showing their relationship with...

متن کامل

Relaxed Oracles for Semi-Supervised Clustering

Journal: :CoRR 2017

Taewan Kim Joydeep Ghosh

Pairwise “same-cluster” queries are one of the most widely used forms of supervision in semi-supervised clustering. However, it is impractical to ask human oracles to answer every query correctly. In this paper, we study the influence of allowing “not-sure” answers from a weak oracle and propose an effective algorithm to handle such uncertainties in query responses. Two realistic weak oracle mo...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید