Automating Physical Database Design Automating Physical Database Design: An Extensible Approach
نویسندگان
چکیده
In a high-level query language such as SQL, queries yield the same result no matter how the logical schema is physically implemented. Nevertheless, a query's cost can vary by orders of magnitude among di erent physical implementations of the same logical schema, even with the most modern query optimizers. Therefore, designing a low-cost physical implementation is an important pragmatic problem|one that requires a sophisticated understanding of physical design options and query strategies, and that involves estimating query costs, a tedious and error-prone process when done manually. We have devised a simple framework for automating physical design in relational or post-relational DBMSs and in database programming languages. Within this framework, design options are uniformly represented as \features", and designs are represented by \con ict"-free sets of features. (Mutually exclusive features con ict. An example would be two primary indexes on the same table.) The uniform representation of design options as features accommodates a greater variety of design options than previous approaches; adding a new design option (e.g. a new index type) merely entails characterizing it as a feature with appropriate parameters. We propose an approximation algorithm, based on this framework, that nds low-cost physical designs. In an initial phase, the algorithm examines the logical schema, data statistics, and queries, and generates \useful features"|features that might reduce query costs. In a subsequent phase, the algorithm uses the DBMS's cost estimates to nd \best features"|features that belong to the lowestcost designs for each individual query. Finally, the algorithm searches among con ict-free subsets of the best features of all the queries to nd organizations with low global cost estimates. We have implemented a prototype physical design assistant for the INGRES relational DBMS, and we evaluate its designs for several benchmarks, including ASSSAP. Our experiments with the prototype show that it can produce good designs, and that the critical factor limiting their quality is the accuracy of query cost estimates. The prototype implementation isolates dependencies on INGRES, permitting our framework to produce design assistants for a wide range of relational, nested-relational, and object-oriented DBMSs. To my parents, Barbara L. Rozen and Jerome G. Rozen, Jr., and to my daughter, Kate.
منابع مشابه
Automating Physical Database Design: An Extensible Approach
In a high-level query language such as SQL, queries yield the same result no matter how the logical schema is physically implemented. Nevertheless, a query's cost can vary by orders of magnitude among di erent physical implementations of the same logical schema, even with the most modern query optimizers. Therefore, designing a low-cost physical implementation is an important pragmatic problem|...
متن کاملA Framework for Automating Physical Database Design
We propose a two-pha.se algorithm for physical database design. In phase one the algorithm, for each logical query, loses rules to determine characteristics of a physical design (such as indexes) that would be beneficial to the query, and selects a physical design that yields a low cost estimate for that query. In phase two we use a notion of compromase between physical database designs. St,art...
متن کاملAutonomous Selection of Indexes and Materialized Views in Two Commerical Database Management Systems
The selection of the most beneficial indexes and materialized views (MVs) is a well-known issue in physical database design. The chosen set of indexes and MVs is crucial to the overall performance of the DBMS as it relates to specific workloads. Since this is such an important aspect of physical database design, database administrators (DBAs) spend a great amount of time selecting the best inde...
متن کاملAutoAdmin: Self-Tuning Database SystemsTechnology
The AutoAdmin research project was launched in the Fall of 1996 in Microsoft Research with the goal of making database systems significantly more self-tuning. Initially, we focused on automating the physical design for relational databases. Our research effort led to successful incorporation of our tuning technology in Microsoft SQL Server and was subsequently also followed by similar functiona...
متن کاملDatplex: an Extensible Design-aid Tool for Physical Database Design
In this paper, we describe DATPLEX, an extensible design-aid tool for physical database design. DAT-PLEX adopts a layered approach where each layer improves over its input design; and is \extensible" in the sense that an additional layer can be \plugged" into the tool, or an existing layer can be removed from the tool. The current implementation comprises three layers that improve the logical s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993