text similarity

نتایج جستجو برای: text similarity

تعداد نتایج: 268086 فیلتر نتایج به سال:

Text-Based User-kNN: Measuring User Similarity Based on Text Reviews

2014

Maria Terzi Matthew Rowe Maria Angela Ferrario Jon Whittle

This article reports on a modification of the user-kNN algorithm that measures the similarity between users based on the similarity of text reviews, instead of ratings. We investigate the performance of text semantic similarity measures and we evaluate our text-based user-kNN approach by comparing it to a range of ratings-based approaches in a ratings prediction task. We do so by using datasets...

متن کامل

Semantic Similarity Measures for Malay Sentences

2007

Shahrul Azman Mohd. Noah Amru Yusrin Amruddin Nazlia Omar

The concept of semantic similarity is an important element in many applications such as information extraction, information retrieval, document clustering and ontology learning. Most of the previous works regarding semantic similarity measures have been traditionally defined between words or concepts (i.e. word-to-word similarity), thus ignoring the text or sentence that the concepts participat...

متن کامل

A Fuzzy Similarity Approach in Text Classification Task

2004

Dwi H. Widyantoro John Yen

We present a fuzzy similarity approach to solve a text categorization problem. The effectiveness of various fuzzy conjunction and disjunction operators used in fuzzy similarity formula and several document representations were evaluated using test sets from three text document collections. Based on empirical results obtained from using these collections, a special case of the fuzzy similarity f...

متن کامل

A German Corpus for Text Similarity Detection Tasks

Journal: :CoRR 2017

Juan-Manuel Torres-Moreno Gerardo Sierra Peter Peinl

Text similarity detection aims at measuring the degree of similarity between a pair of texts. Corpora available for text similarity detection are designed to evaluate the algorithms to assess the paraphrase level among documents. In this paper we present a textual German corpus for similarity detection. The purpose of this corpus is to automatically assess the similarity between a pair of texts...

متن کامل

Detecting Text Similarity over Short Passages: Exploring Linguistic Feature Combinations via Machine Learning

1999

Vasileios Hatzlvassiloglou Judith L. Klavans Eleazar Eskin

We present a new composite similarity metric that combines information from multiple linguistic indicators to measure semantic distance between pairs of small textual units. Several potential features are investigated and an optireal combination is selected via machine learning. We discuss a more restrictive definition of similarity than traditional, document-level and information retrieval-ori...

متن کامل

The Similarity of Essay Examination Results using Preprocessing Text Mining with Cosine Similarity and Nazief-Adriani Algorithms

Journal: :Turkish Journal of Computer and Mathematics Education (TURCOMAT) 2021

متن کامل

Similarity Measures for Short Segments of Text

2007

Donald Metzler Susan T. Dumais Christopher Meek

Measuring the similarity between documents and queries has been extensively studied in information retrieval. However, there are a growing number of tasks that require computing the similarity between two very short segments of text. These tasks include query reformulation, sponsored search, and image retrieval. Standard text similarity measures perform poorly on such tasks because of data spar...

متن کامل

Short Text Similarity Measure Based on Double Vector Space Model

2016

Ying Liu Dongmei Li

Short text similarity measure is the basis of classification and duplicate checking of the short texts. Allowing for the insufficient consideration of the sentence semantic and structure information in similarity calculation between two short texts, we propose a novel method of short text similarity calculation based on double vector space model on the basis of traditional vector space model. C...

متن کامل

Discriminative Topic Segmentation of Text and Speech

2010

Mehryar Mohri Pedro J. Moreno Eugene Weinstein

We explore automated discovery of topicallycoherent segments in speech or text sequences. We give two new discriminative topic segmentation algorithms which employ a new measure of text similarity based on word co-occurrence. Both algorithms function by finding extrema in the similarity signal over the text, with the latter algorithm using a compact support-vector based description of a window ...

متن کامل

Calculating Statistical Similarity between Sentences

2011

Junsheng Zhang Yunchuan Sun Huilin Wang Yanqing He

Sentence similarity plays an important role in text-related research and applications. It is closely related to word similarity and document similarity. The statistical similarity measures between sentences, based on symbolic characteristics and structural information, could measure the similarity between sentences without any prior knowledge but only on the statistical information of sentences...

متن کامل

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید