نتایج جستجو برای: edit distance

تعداد نتایج: 242096  

2009
Timo Mertens Daniel Schneider Joachim Köhler

We describe how complementary search spaces, addressed by two different methods used in Spoken Term Detection (STD), can be merged for German subword STD. We propose fuzzysearch techniques on lattices to narrow the gap between subword and word retrieval. The first technique is based on an edit-distance, where no a priori knowledge about confusions is employed. Additionally, we propose a weighti...

2004
Maarten Grachten Josep Lluís Arcos

The concept of melodic similarity has become increasingly relevant in the light of music retrieval and music content processing systems. We propose a new way of measuring melodic similarity, based on analyses of the melody according to the Implication/Realization (I/R) model [7] for melodic structure and cognition. The similarity is assessed as the edit-distance between these I/R analyses. We p...

2011
Zhixu Li Laurianne Sitbon Xiaofang Zhou

This paper introduces PartSS, a new partition-based filtering for tasks performing string comparisons under edit distance constraints. PartSS offers improvements over the state-of-the-art method NGPP with the implementation of a new partitioning scheme and also improves filtering abilities by exploiting theoretical results on shifting and scaling ranges, thus accelerating the rate of calculatin...

Journal: :CoRR 2015
Roger Bilisoly

Walter Skeat published his critical edition of William Langland’s 14 century alliterative poem, Piers Plowman, in 1886. In preparation for this he located forty-five manuscripts, and to compare dialects, he published excerpts from each of these. This paper does three statistical analyses using these excerpts, each of which mimics a task he did in writing his critical edition. First, he combined...

Journal: :Pattern Recognition Letters 2003
Andrea Torsello Edwin R. Hancock

This paper presents a new method for computing the tree edit distance problem with uniform edit cost. We commence by showing that any tree obtained with a sequence of cut operations is a subtree of the transitive closure of the original tree, we show that the necessary condition for any subtree to be a solution can be reduced to a clique problem in a derived structure. Using this idea we transf...

2014
Ryan Cotterell Nanyun Peng Jason Eisner

String similarity is most often measured by weighted or unweighted edit distance d(x, y). Ristad and Yianilos (1998) defined stochastic edit distance—a probability distribution p(y | x) whose parameters can be trained from data. We generalize this so that the probability of choosing each edit operation can depend on contextual features. We show how to construct and train a probabilistic finite-...

Journal: :Procesamiento del Lenguaje Natural 2012
Marta Vila Mark Dras

Finding an adequate paraphrase representation formalism is a challenging issue in Natural Language Processing. In this paper, we analyse the performance of Tree Edit Distance as a paraphrase representation baseline. Our experiments using Edit Distance Textual Entailment Suite show that, as Tree Edit Distance consists of a purely syntactic approach, paraphrase alternations not based on structura...

2012
Marc Schoolderman Kees Koster Marc Seutter

Being able to automatically correct spelling errors is useful in cases where the set of documents is too vast to involve human interaction. In this bachelor's thesis, we investigate an implementation that attempts to perform such corrections using a lexicon and edit distance measure. We compare the familiar Levenshtein and Damerau-Levenshtein distances to modi cations where each edit operation ...

Journal: :CoRR 2012
Hicham Gueddah

In this paper, we present a new approach dedicated to correcting the spelling errors of the Arabic language. This approach corrects typographical errors like inserting, deleting, and permutation. Our method is inspired from the Levenshtein algorithm, and allows a finer and better scheduling than Levenshtein. The results obtained are very satisfactory and encouraging, which shows the interest of...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید