Longest Common Extensions with Recompression
نویسنده
چکیده
Given two positions i and j in a string T of length N , a longest common extension (LCE) query asks for the length of the longest common prefix between suffixes beginning at i and j. A compressed LCE data structure is a data structure that stores T in a compressed form while supporting fast LCE queries. In this article we show that the recompression technique is a powerful tool for compressed LCE data structures. We present a new compressed LCE data structure of size O(z lg(N/z)) that supports LCE queries in O(lgN) time, where z is the size of Lempel-Ziv 77 factorization without self-reference of T. Given T as an uncompressed form, we show how to build our data structure in O(N) time and space. Given T as a grammar compressed form, i.e., an straight-line program of size n generating T, we show how to build our data structure in O(n lg(N/n)) time and O(n + z lg(N/z)) space. Our algorithms are deterministic and always return correct answers.
منابع مشابه
Modifications of the Landau-Vishkin Algorithm Computing Longest Common Extensions via Suffix Arrays and Efficient RMQ computations
Approximate string matching is an important problem in Computer Science. The standard solution for this problem is an O(mn) running time and space dynamic programming algorithm for two strings of length m and n. Landau and Vishkin developed an algorithm which uses suffix trees for accelerating the computation along the dynamic programming table and reaching space and running time in O(nk), wher...
متن کاملA Modification of the Landau-Vishkin Algorithm Computing Longest Common Extensions via Suffix Arrays
Approximate string matching is an essential problem in many areas related to Computer Science including biological sequence processing. The standard solution of this problem is an O(mn) running time and space dynamic programming algorithm for two strings of length m and n. Landau and Vishkin developed an algorithm which uses suffix trees for accelerating the computation along the dynamic progra...
متن کاملFinding All Solutions of Equations in Free Groups and Monoids with Involution
The aim of this paper is to present a PSPACE algorithm which yields a finite graph of exponential size and which describes the set of all solutions of equations in free groups as well as the set of all solutions of equations in free monoids with involution in the presence of rational constraints. This became possible due to the recently invented recompression technique of the second author. He ...
متن کاملLongest Common Extensions in Sublinear Space
The longest common extension problem (LCE problem) is to construct a data structure for an input string T of length n that supports LCE(i, j) queries. Such a query returns the length of the longest common prefix of the suffixes starting at positions i and j in T . This classic problem has a well-known solution that uses O(n) space and O(1) query time. In this paper we show that for any trade-of...
متن کاملDecompression Illness: Clinical Aspects of 5278 Consecutive Cases Treated in a Single Hyperbaric Unit
BACKGROUND Decompression illness (DCI) is a major concern in pressure-related activities. Due to its specific prerequisite conditions, DCI is rare in comparison with other illnesses and most physicians are inexperienced in treatment. In a fishery area in northern China, during the past decade, tens of thousands of divers engaged in seafood harvesting and thousands suffered from DCI. We establis...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017