نتایج جستجو برای: الگوریتم levenshtein

تعداد نتایج: 22948  

Journal: :Theor. Comput. Sci. 1995
Marie-France Sagot Alain Viari Henry Soldano

We present in this paper a peptide matching approach to the multiple comparison of a set of protein sequences. This approach consists in looking for all the words that are common to q of these sequences, where q is a parameter. The comparison between words is done by using as reference an object called a model. In the case of proteins, a model is a product of subsets of the alphabet Σ of the am...

Journal: :Lontar Komputer : Jurnal Ilmiah Teknologi Informasi 2019

Journal: :CoRR 2009
Filippo Petroni Maurizio Serva

The idea of measuring distance between languages seems to have its roots in the work of the French explorer Dumont D’Urville (2). He collected comparative words lists of various languages during his voyages aboard the Astrolabe from 1826 to 1829 and, in his work about the geographical division of the Pacific, he proposed a method to measure the degree of relation among languages. The method use...

Journal: :CoRR 2009
Filippo Petroni Maurizio Serva

Phylogenetic trees can be reconstructed from the matrix which contains the distances between all pairs of languages in a family. Recently, we proposed a new method which uses normalized Levenshtein distances among words with same meaning and averages on all the items of a given list. Decisions about the number of items in the input lists for language comparison have been debated since the begin...

Journal: :Polibits 2012
Iulia Danaila Liviu P. Dinu Vlad Niculae Octavia-Maria Sulea

Near-duplicate detection is important when dealing with large, noisy databases in data mining tasks. In this paper, we present the results of applying the Rank distance and the Smith-Waterman distance, along with more popular string similarity measures such as the Levenshtein distance, together with a disjoint set data structure, for the problem of near-duplicate detection.

Journal: :Des. Codes Cryptography 2017
Peter Boyvalenkov P. D. Dragnev Douglas P. Hardin Edward B. Saff Maya Stoyanova

We obtain universal bounds on the energy of codes and for designs in Hamming spaces. Our bounds hold for a large class of potential functions, allow unified treatment, and can be viewed as a generalization of the Levenshtein bounds for maximal codes.

2011
B. P. Pande H. S. Dhami

In the present work an innovative attempt is being made to develop a novel conflation method that exploits the phonetic quality of words and uses some standard NLP tools like LD (Levenshtein Distance) and LCS (Longest Common Subsequence) for Stemming process. General Terms Information Retrieval (IR), Stemming.

1997
Jan Holub

We present detailed description of simulation of nondeterministic nite automata (NFA) for approximate string matching. This simulation uses bit parallelism and used algorithm is called Shift-Or algorithm. Using knowledge of simulation of NFA by Shift-Or algorithm we design modi cation of ShiftOr algorithm for approximate string matching using generalized Levenshtein distance and modi cation for...

نمودار تعداد نتایج جستجو در هر سال

با کلیک روی نمودار نتایج را به سال انتشار فیلتر کنید