Dynamic Nonuniform Data Approximation in Databases with Haar Wavelet

نویسندگان

  • Su Chen
  • Antonio Nucci
چکیده

Data synopsis is a lossy compressed representation of data stored into databases that helps the query optimizer to speed up the query process, e.g. time to retrieve the data from the database. An efficient data synopsis must provide accurate information about the distribution of data to the query optimizer at any point in time. Due to the fact that some data will be queried more often than others, a good data synopsis should consider the use of nonuniform accuracy, e.g. provide better approximation of data that are queried the most. Although, the generation of data synopsis is a critical step to achieve a good approximation of the initial data representation, data synopsis must be updated over time when dealing with time varying data. In this paper, we introduce new Haar wavelet synopses for nonuniform accuracy and time-varying data that can be generated in linear time and space, and updated in sublinear time. We further introduce two linear algorithms, called 2-Step and M-Step for the Point-wise approximation problem that clearly outperforms previous algorithms known in literature, and two new algorithm called, Data Mapping and Weight Mapping for the Range-sum approximation problem that, to the best of our knowledge, represent a key research milestone as being the first linear algorithm for arbitrary weights. For both scenarios, we focus not only on the generation of the data synopsis but also on their updates over time. The efficiency of our new data synopses is validated against other linear methods by using both synthetic and real data sets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Solving infinite horizon optimal control problems of nonlinear interconnected large-scale dynamic systems via a Haar wavelet collocation scheme

We consider an approximation scheme using Haar wavelets for solving a class of infinite horizon optimal control problems (OCP's) of nonlinear interconnected large-scale dynamic systems. A computational method based on Haar wavelets in the time-domain is proposed for solving the optimal control problem. Haar wavelets integral operational matrix and direct collocation method are utilized to find ...

متن کامل

APPROXIMATION SOLUTION OF TWO-DIMENSIONAL LINEAR STOCHASTIC FREDHOLM INTEGRAL EQUATION BY APPLYING THE HAAR WAVELET

In this paper, we introduce an efficient method based on Haar wavelet to approximate a solutionfor the two-dimensional linear stochastic Fredholm integral equation. We also give an example to demonstrate the accuracy of the method.  

متن کامل

A Fast Approximation Scheme for Probabilistic Wavelet Synopses

Several studies have demonstrated the effectiveness of Haar wavelets in reducing large amounts of data down to compact wavelet synopses that can be used to obtain fast, accurate approximate query answers. While Haar wavelets were originally designed for minimizing the overall root-mean-squared (i.e., L2-norm) error in the data approximation, the recently-proposed idea of probabilistic wavelet s...

متن کامل

BFT: A Relational-based Bit Filtration Technique for Efficient Approximate String Joins in Biological Databases

Joining massive tables in relational databases have received substantial attention in the past decade. Numerous filtration and indexing techniques have been proposed to reduce the curse of dimensionality. This paper proposes a novel approach to map the problem of pairwise whole genome comparison into an approximate join operation in the wellestablished relational database context. We propose a ...

متن کامل

Modified Wavelet Method for Solving Two-dimensional Coupled System of Evolution Equations

As two-dimensional coupled system of nonlinear partial differential equations does not give enough smooth solutions, when approximated by linear, quadratic and cubic polynomials and gives poor convergence or no convergence. In such cases, approximation by zero degree polynomials like Haar wavelets (continuous functions with finite jumps) are most suitable and reliable. Therefore, modified numer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JCP

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2007