نتایج جستجو برای: semi supervised clustering

تعداد نتایج: 271512  

Journal: :Fuzzy Sets and Systems 2013
Yang Yan Lihui Chen William-Chandra Tjhi

In this paper we propose a new heuristic semi-supervised fuzzy co-clustering algorithm (SS-HFCR) for categorization of large web documents. In this approach, the clustering process is carried out by incorporating some prior knowledge in the form of pair-wise constraints provided by users into the fuzzy co-clustering framework. Each constraint specifies whether a pair of documents “must” or “can...

2013
Binghui Liu Xiaotong Shen Wei Pan

In genetic association studies, unaccounted population stratification can cause spurious associations in a discovery process of identifying disease-associated genetic markers. In such a situation, prior information is often available for some subjects' population identities. To leverage the additional information, we propose a semi-supervised clustering approach for detecting population stratif...

Journal: :CoRR 2017
Ozsel Kilinc Ismail Uysal

In this paper, we propose a novel method to enrich the representation provided to the output layer of feedforward neural networks in the form of an auto-clustering output layer (ACOL) which enables the network to naturally create sub-clusters under the provided main class labels. In addition, a novel regularization term is introduced which allows ACOL to encourage the neural network to reveal i...

2010
Z. Bodó L. Csató Zalán Bodó Lehel Csató

Abstract: Recently semi-supervised methods gained increasing attention and many novel semi-supervised learning algorithms have been proposed. These methods exploit the information contained in the usually large unlabeled data set in order to improve classification or generalization performance. Using data-dependent kernels for kernel machines one can build semi-supervised classifiers by buildin...

2008
Fei Wang Tao Li Changshui Zhang

The recent years have witnessed a surge of interests of semi-supervised clustering methods, which aim to cluster the data set under the guidance of some supervisory information. Usually those supervisory information takes the form of pairwise constraints that indicate the similarity/dissimilarity between the two points. In this paper, we propose a novel matrix factorization based approach for s...

2015
Siamak Mehrkanoon Johan A.K. Suykens

A multi-class semi-supervised learning algorithm formulated as a regularized kernel spectral clustering (KSC) approach is proposed. The method is bale to address both semi-supervised classification and clustering. In addition a low embedding dimension is utilized to reveal the existing number of clusters. Thanks to the Nyström approximation technique, the approach can be scaled up for analyzing...

2006
Bingru Yang Wei Song Zhangyan Xu

Supervised learning algorithms usually require large amounts of training data to learn reasonably accurate classifiers. Yet, for many text classification tasks, providing labeled training documents is expensive, while unlabeled documents are readily available in large quantities. Learning from both, labeled and unlabeled documents, in a semi-supervised framework is a promising approach to reduc...

1999
Ayhan Demiriz Kristin P. Bennett Mark J. Embrechts

A semi-supervised clustering algorithm is proposed that combines the benefits of supervised and unsupervised learning methods. Data are segmented/clustered using an unsupervised learning technique that is biased toward producing segments or clusters as pure as possible in terms of class distribution. These clusters can then be used to predict the class of future points. For example in database ...

2006
Ron Bekkerman Mehran Sahami

A combinatorial random variable is a discrete random variable defined over a combinatorial set (e.g., a power set of a given set). In this paper we introduce combinatorial Markov random fields (Comrafs), which are Markov random fields where some of the nodes are combinatorial random variables. We argue that Comrafs are powerful models for unsupervised learning by showing their relationship with...

Journal: :CoRR 2017
Taewan Kim Joydeep Ghosh

Pairwise “same-cluster” queries are one of the most widely used forms of supervision in semi-supervised clustering. However, it is impractical to ask human oracles to answer every query correctly. In this paper, we study the influence of allowing “not-sure” answers from a weak oracle and propose an effective algorithm to handle such uncertainties in query responses. Two realistic weak oracle mo...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید