Concentric Hyperspaces and Disk Allocation for Fast Parallel Range Searching
نویسندگان
چکیده
Data partitioning and declustering have been extensively used in the past to parallelize I/O for range queries. Numerous declustering and disk allocation techniques have been proposed in the literature. However, most of these techniques were primarily designed for two-dimensional data and for balanced partitioning of the data space. As databases increasingly integrate multimedia information in the form of image, video, and audio data, it is necessary to extend the declustering techniques for multidimensional data. In this paper, we first establish that traditional declustering techniques do not scale for high-dimensional data. We then propose several new partitioning schemes based on concentric hyperspaces. We then develop disk allocation methods for each of the proposed schemes. We conclude with an evaluation of range queries based on these schemes and show that partitioning based on concentric hyperspaces has a significant advantage over balanced partitioning approach for parallel I/O.
منابع مشابه
Circular Data-space Partitioning for Similarity Queries and Parallel Disk Allocation
In a multiple disk environment it is desirable to have techniques for efficient parallel execution of similarity queries. Usually many buckets that may have the query result are needed to be retrieved from secondary storage, which is a costly operation. To achieve efficiency, there are two major factors that need to be considered. These are the number of buckets retrieved by the query, and the ...
متن کاملcient Disk Allocation for Fast Similarity Searching
As databases increasingly integrate non-textual information it is becoming necessary to support eecient similarity searching in addition to range searching. Recently, declustering techniques have been proposed for improving the performance of similarity searches through parallel I/O. In this paper, we propose a new scheme which provides good declus-tering for similarity searching. In particular...
متن کاملDisk Allocation Methods for Parallelizing Grid Files
The grid file [1] is a well known access method for multi-dimensional and spatial data. The response time needed to process path and range queries on the grid file access method can be improved significantly by distributing the data pages over multiple disks. This paper explores the disk allocation methods used to allocate the data pages of grid file among a set of disks, which can be accessed ...
متن کاملEvaluation of Disk Allocation Methods for Parallelizing Spatial Queries on Grid Files‡
Spatial Database Systems are characterized by large amounts of geometric and geographic data. Query response times in these systems are crucial, since these systems are often used interactively for decision support systems. The Grid file[1] is a well-known spatial access method that has great potential for parallelism, which reduces the response time of spatial queries for time-critical on-line...
متن کاملPerfect Allocation Methods for Spatial Queries in Parallel Disk Systems
A disk-allocation method assigns a disk-id to each unit of spatial data. Allocating spatial data over multiple disks to distribute the I/O cost of query processing uniformly over available disks can tremendously speed up the processing. An allocation method is called perfect for a query set if it balances the I/O load on each disk in processing any query in a query set. Some of the interesting ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999