JTop Algorithms for Top-k Join Queries

نویسندگان

Reza Akbarinia

Ihab F. Ilyas

M. Tamer Özsu

Patrick Valduriez

چکیده

Top-k join queries have become very important in many important areas of computing. One of the most efficient algorithms for top-k join queries is the Rank-Join algorithm [17] [18]. However, there are many cases where Rank-Join does much unnecessary access to the input data sources. In this report, we first show that there are many cases where Rank-Join's stopping mechanism is not efficient, and it does much unnecessary accesses to the input data sources. Then, we propose JTop, a family of much more efficient algorithms for top-k queries. We prove that our algorithms always perform less work than Rank-Join, and thus are more efficient. We also show that the performance of our algorithms can be O(n) times better than that of Rank-Join where n is the number of data items in the database. We evaluated the performance of our algorithms through experimentation over databases with different distributions. The results show that over the tested databases our algorithms significantly outperform Rank-Join.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Processing Top-k Join Queries

We consider the problem of efficiently finding the top-k answers for join queries over web-accessible databases. Classical algorithms for finding top-k answers use branch-and-bound techniques to avoid computing scores of all candidates in identifying the top-k answers. To be able to apply such techniques, it is critical to efficiently compute (lower and upper) bounds and expected scores of cand...

متن کامل

CPR : Complex Pattern Ranking for Evaluating Top - k Pattern Queries over Event Streams

Most existing approaches to complex event processing over streaming data rely on the assumption that the matches to the queries are rare and that the goal of the system is to identify these few matches within the incoming deluge of data. In many applications, such as stock market analysis and user credit card purchase pattern monitoring, however the matches to the user queries are in fact plent...

متن کامل

The RankGroup Join Algorithm: Top-k Query Processing in XML Datasets

This project investigates top-k queries in XML datasets. We propose a syntactical addition to XQuery to accommodate top-k XML queries. We then propose a 3-step process to realize these top-k XML queries using a relational database and a new join operator, RankGroup. Our preliminary implementation shows promise in dramatically reducing the running time and number of tuples accessed during such q...

متن کامل

Sum-Max Monotonic Ranked Joins for Evaluating Top-K Twig Queries on Weighted Data Graphs

In many applications, the underlying data (the web, an XML document, or a relational database) can be seen as a graph. These graphs may be enriched with weights, associated with the nodes and edges of the graph, denoting application specific desirability/penalty assessments, such as popularity, trust, or cost. A particular challenge when considering such weights in query processing is that resu...

متن کامل

Optimizing Multiple Top-K Queries over Joins

Advanced Data Mining applications require more and more support from relational database engines. Especially clustering applications in high dimensional features space demand a proper support of multiple Top-k queries in order to perform projected clustering. Although some research tackles to problem of optimizing restricted ranking (top-k) queries, there is no solution considering more than on...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2008

JTop Algorithms for Top-k Join Queries

نویسندگان

چکیده

منابع مشابه

Processing Top-k Join Queries

CPR : Complex Pattern Ranking for Evaluating Top - k Pattern Queries over Event Streams

The RankGroup Join Algorithm: Top-k Query Processing in XML Datasets

Sum-Max Monotonic Ranked Joins for Evaluating Top-K Twig Queries on Weighted Data Graphs

Optimizing Multiple Top-K Queries over Joins

عنوان ژورنال:

اشتراک گذاری