Knowledge Driven Query Sharding

نویسندگان

  • Adam Krasuski
  • Marcin S. Szczuka
چکیده

We present the idea of an approach to database query sharding that makes use of knowledge about data structure and purpose. It is based on a case study for a database system that contains information about documents. By making use of knowledge about the data structure and the specific top-k queries to be processed we demonstrate a method for avoiding costly and unnecessary steps in query answering. We also demonstrate how the knowledge of data structure may be used to perform sharding and how such sharding may improve performance. We propose generalization of our findings that could lead to self-optimization and self-tuning in RDBMS engines, especially for column-based solutions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding efficient Shard Keys with a learning process on query logs in Database Sharding

Nowadays industry has been experiencing a dramatic data growth. Not only this data has to be processed properly, but also it must be stored somewhere with smart strategy to be able to write and read with highest possible speed. Over the past decades, vendors have been motivated to migrate their brown-field database solutions to a distributed version through Partitioning/Sharding concepts. (In)a...

متن کامل

Skyler: Dynamic, Workload-Aware Data Sharding across Multiple Data Centres

Popular online services such as Facebook and Twitter use multiple data centers (DCs) to service globally distributed user requests with the lowest possible latency. Existing distributed data stores either fully replicate or statically shard data across DCs. Full replication limits scalability and static sharding fixes data to a single sharding policy, unable to adapt with user and popularity sh...

متن کامل

Database Sharding: To Provide Fault Tolerance and Scalability of Big Data on the Cloud

In this paper, the authors present an architecture and implementation of a distributed database system using sharding to provide high availability, fault-tolerance, and scalability of large databases in the cloud. Sharding, or horizontal partitioning, is used to disperse the data among the data nodes located on commodity servers for effective management of big data on the cloud. Database Shardi...

متن کامل

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Distributed Database: Concepts and Applications

The author wants to submit the endeavor about the Distributed Database storage concepts and usefulness in large enterprises. One of the most useful storage technique for Distributed Database is sharding and the concept of database sharding has gained popularity over the past several years due to the enormous growth in transaction volume and size of business-application databases and database se...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012