Efficient and Compact Representations of Some Non-canonical Prefix-Free Codes
نویسندگان
چکیده
For many kinds of prefix-free codes there are efficient and compact alternatives to the traditional tree-based representation. Since these put the codes into canonical form, however, they can only be used when we can choose the order in which codewords are assigned to characters. In this paper we first show how, given a probability distribution over an alphabet of σ characters, we can store a nearly optimal alphabetic prefix-free code in o(σ) bits such that we can encode and decode any character in constant time. We then consider a kind of code introduced recently to reduce the space usage of wavelet matrices (Claude, Navarro, and Ordóñez, Information Systems, 2015). They showed how to build an optimal prefix-free code such that the codewords’ lengths are non-decreasing when they are arranged such that their reverses are in lexicographic order. We show how to store such a code in O (
منابع مشابه
Canonical Trees, Compact Prefix-Free Codes, and Sums of Unit Fractions: A Probabilistic Analysis
For fixed t ≥ 2, we consider the class of representations of 1 as sum of unit fractions whose denominators are powers of t or equivalently the class of canonical compact t-ary Huffman codes or equivalently rooted t-ary plane “canonical” trees. We study the probabilistic behaviour of the height (limit distribution is shown to be normal), the number of distinct summands (normal distribution), the...
متن کاملFast Construction of Disposable Prefix-Free Codes
Some data compression techniques use large numbers of prefix-free codes. The following two techniques do so: adaptive Huffman encoding and bit recycling. Adaptive Huffman encoding allows successive symbols to be encoded where each one is encoded according to the statistics of the symbols seen so far. Bit recycling, on the other hand, is a technique that is designed to improve the efficiency of ...
متن کامل-
Consider the semidirect product group H ×? K, where H and K are two arbitrary locally compact groups and K is also abelian. We introduce the continuous wavelet transform associated to some square integrable representations H ×? K. Moreover, we try to obtain a concrete form for admissible vectors of these integrable representations.
متن کاملFinite State Models in the Study of Comma-Free Codes
The definition of comma-free codes is extended to be more comprehensive and include codes with variable length codewords and codes with an unlimited number of codewords. Finite state recognizers are used to represent commafree codes in a compact form, making available the well established methods of manipulating and analysing finite state models. An efficient algorithm is presented to test the ...
متن کاملFaster Implementation of Canonical Minimum Redundancy Prefix Codes
Huffman coding [2] has been widely used in data, image, and video compression. Based on the single-side growing Huffman tree (SHT), Moffat and Turpin [3] recently presented an efficient data structure for representing the SHT. It requires (n + 2d)w bits, where n denotes the number of source symbols; d denotes the depth of the SHT; and w bits are required to save one source symbol or one integer...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016