Efficient Mining of Frequent Item Sets in Heterogeneous Data

نویسنده

  • Igor Tatarinov
چکیده

Association rule mining has recently become a popular area of research. The most expensive step of discovering association rules is to find so-called frequent item sets. The focus of this paper is efficient mining of frequent item sets when the input data contains categorical and quantitative attributes. We propose a new Apriori-like algorithm to solve this problem. The new algorithm, that we have called Gradual Apriori, generates about 30% less candidates than the traditional algorithm. More importantly, the new algorithm has much (several times) smaller memory requirements. The main disadvantage of Gradual Apriori is an increased number of iterations that needs to be performed. However, in many cases the new algorithm should still result in better overall performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Approach for finding Frequent Item Sets with Hybrid Strategies

Frequent item sets mining plays an important role in association rules mining. Over the years, a variety of algorithms for finding frequent item sets in very large transaction databases have been developed. Therefore, a number of methods have been proposed recently to discover approximate frequent item sets. This paper proposes an efficient SMine (Sorted Mine) Algorithm for finding frequent ite...

متن کامل

An efficient hash based algorithm for mining closed frequent item sets

Association rule discovery has emerged as an important problem in knowledge discovery and data mining. The association mining task consists of identifying the frequent item sets, and then forming conditional implication rules among them. Efficient algorithms to discover frequent patterns are crucial in data mining research. Finding frequent item sets is computationally the most expensive step i...

متن کامل

Efficient Graph Structure for the Mining of Frequent Itemsets from Data Streams

In this paper, we propose a graph structure which captures important data streams. This graph can be easily maintained and mined for frequent item sets as well as various other patterns like constrained item sets. This graph captures the contents of transaction in a window and arranges nodes according to some canonical order that is unaffected by changes in item frequency. This graph structure ...

متن کامل

An Efficient Algorithm for Mining Fuzzy Temporal Data

Mining patterns from fuzzy temporal data is an important data mining problem. One of these mining task is to find locally frequent sets, In most of the earlier works fuzziness was considered in the time attribute of the datasets .Although a couple of works have been done in dealing with such data, little has been done on the implementation side. In this article, we propose an efficient implemen...

متن کامل

An Efficient Data Mining Technique for Generating Frequent Item sets

Frequent item generation is a key approach in association rule mining. The Data mining is the process of generating frequent itemsets that satisfy minimum support. Efficient algorithms to mine frequent patterns are crucial in data mining. Since the Apriori algorithm was proposed to generate the frequent item sets, there have been several methods proposed to improve its performance. But they do ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007