Efficient Mining of Frequent Item Sets in Heterogeneous Data
نویسنده
چکیده
Association rule mining has recently become a popular area of research. The most expensive step of discovering association rules is to find so-called frequent item sets. The focus of this paper is efficient mining of frequent item sets when the input data contains categorical and quantitative attributes. We propose a new Apriori-like algorithm to solve this problem. The new algorithm, that we have called Gradual Apriori, generates about 30% less candidates than the traditional algorithm. More importantly, the new algorithm has much (several times) smaller memory requirements. The main disadvantage of Gradual Apriori is an increased number of iterations that needs to be performed. However, in many cases the new algorithm should still result in better overall performance.
منابع مشابه
A Novel Approach for finding Frequent Item Sets with Hybrid Strategies
Frequent item sets mining plays an important role in association rules mining. Over the years, a variety of algorithms for finding frequent item sets in very large transaction databases have been developed. Therefore, a number of methods have been proposed recently to discover approximate frequent item sets. This paper proposes an efficient SMine (Sorted Mine) Algorithm for finding frequent ite...
متن کاملAn efficient hash based algorithm for mining closed frequent item sets
Association rule discovery has emerged as an important problem in knowledge discovery and data mining. The association mining task consists of identifying the frequent item sets, and then forming conditional implication rules among them. Efficient algorithms to discover frequent patterns are crucial in data mining research. Finding frequent item sets is computationally the most expensive step i...
متن کاملEfficient Graph Structure for the Mining of Frequent Itemsets from Data Streams
In this paper, we propose a graph structure which captures important data streams. This graph can be easily maintained and mined for frequent item sets as well as various other patterns like constrained item sets. This graph captures the contents of transaction in a window and arranges nodes according to some canonical order that is unaffected by changes in item frequency. This graph structure ...
متن کاملAn Efficient Algorithm for Mining Fuzzy Temporal Data
Mining patterns from fuzzy temporal data is an important data mining problem. One of these mining task is to find locally frequent sets, In most of the earlier works fuzziness was considered in the time attribute of the datasets .Although a couple of works have been done in dealing with such data, little has been done on the implementation side. In this article, we propose an efficient implemen...
متن کاملAn Efficient Data Mining Technique for Generating Frequent Item sets
Frequent item generation is a key approach in association rule mining. The Data mining is the process of generating frequent itemsets that satisfy minimum support. Efficient algorithms to mine frequent patterns are crucial in data mining. Since the Apriori algorithm was proposed to generate the frequent item sets, there have been several methods proposed to improve its performance. But they do ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007