نتایج جستجو برای: word clustering

تعداد نتایج: 205729  

2008
Mohammad Bahrani Hossein Sameti Nazila Hafezi Saeedeh Momtazi

In this paper a new method for automatic word clustering is presented. We used this method for building n-gram language models for Persian continuous speech recognition (CSR) systems. In this method, each word is specified by a feature vector that represents the statistics of parts of speech (POS) of that word. The feature vectors are clustered by k-means algorithm. Using this method causes a r...

Journal: :Journal of Clinical and Experimental Neuropsychology 2013

2017
Dmitry Ustalov Alexander Panchenko Christian Biemann

This paper presents a new graph-based approach that induces synsets using synonymy dictionaries and word embeddings. First, we build a weighted graph of synonyms extracted from commonly available resources, such as Wiktionary. Second, we apply word sense induction to deal with ambiguous words. Finally, we cluster the disambiguated version of the ambiguous input graph into synsets. Our meta-clus...

2007
Lean YU Shouyang WANG Kin Keung LAI

In this study, a multistage modular self-organizing map (SOM) model is proposed for parallel web text clustering. In the first stage, the large textual datasets are divided into some small disjoint datasets (i.e., task decomposition). In the second stage, each small data set is input into different unitary SOM models for word clustering map (i.e., modularization learning). In this stage, differ...

Journal: :CoRR 2015
Manan Mohan Goyal Neha Agrawal Manoj Kumar Sarma Nayan Jyoti Kalita

Keeping in consideration the high demand for clustering, this paper focuses on understanding and implementing K-means clustering using two different similarity measures. We have tried to cluster the documents using two different measures rather than clustering it with Euclidean distance. Also a comparison is drawn based on accuracy of clustering between fuzzy and cosine similarity measure. The ...

Journal: :Speech Communication 1995
Sven C. Martin Jörg Liermann Hermann Ney

CLUSTERING Sven Martin, J org Liermann, Hermann Ney Lehrstuhl f ur Informatik VI, RWTH Aachen, University of Technology, D-52056 Aachen, Germany ABSTRACT. This paper presents and analyzes improved algorithms for clustering bigram and trigram word equivalence classes, and their respective results: 1) We give a detailed time complexity analysis of bigram clustering algorithms. 2) We present an ...

Journal: :Cognitive science 2010
Kit Ying Chan Michael S. Vitevitch

Network science provides a new way to look at old questions in cognitive science by examining the structure of a complex system, and how that structure might influence processing. In the context of psycholinguistics, clustering coefficient-a common measure in network science-refers to the extent to which phonological neighbors of a target word are also neighbors of each other. The influence of ...

2016
Taeho Jo

In this research, we propose the string vector based AHC (Agglomerative Hierarchical Clustering) algorithm as the approach to the word clustering. In the previous works on text clustering, it was successful to encode texts into string vectors by improving the performance of text clustering; it provided the motivation of doing this research. In this research, we encode words into string vectors,...

2017
Jianbo Ye Yanran Li Zhaohui Wu James Zijun Wang Wenjie Li Jia Li

Word embeddings have become widelyused in document analysis. While a large number of models for mapping words to vector spaces have been developed, it remains undetermined how much net gain can be achieved over traditional approaches based on bag-of-words. In this paper, we propose a new document clustering approach by combining any word embedding with a state-of-the-art algorithm for clusterin...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید