Case Mining from Large Databases
نویسندگان
چکیده
This paper presents an approach of case mining to automatically discover case bases from large datasets in order to improve both the speed and the quality of case based reasoning. Case mining constructs a case base from a large raw dataset with an objective to improve the case-base reasoning systems’ efficiency and quality. Our approach starts from a raw database of objects with class attributes together with a historical database of past action sequences on these objects. The object databases can be customer records and the historical action logs can be the technical advises given to the customers to solve their problems. Our goal is to discover effective and highly representative problem descriptions associated with solution plans that accomplish their tasks. To maintain efficiency of computation, data mining methods are employed in the process of composing the case base. We motivate the application of the case mining model using a financial application example, and demonstrate the effectiveness of the model using both real and simulated datasets.
منابع مشابه
Numeric Multi-Objective Rule Mining Using Simulated Annealing Algorithm
Abstract as a single objective one. Measures like support, confidence and other interestingness criteria which are used for evaluating a rule, can be thought of as different objectives of association rule mining problem. Support count is the number of records, which satisfies all the conditions that exist in the rule. This objective represents the accuracy of the rules extracted from the da...
متن کاملDecomposition in Data Mining: An Industrial Case Study
Data mining offers tools for discovery of relationships, patterns, and knowledge in large databases. The knowledge extraction process is computationally complex and therefore a subset of all data is normally considered for mining. In this paper, numerous methods for decomposition of data sets are discussed. Decomposition enhances the quality of knowledge extracted from large databases by simpli...
متن کاملMining Multiple Large Data Sources
Effective data analysis using multiple databases requires highly accurate patterns. Local pattern analysis might extract low quality patterns from multiple large databases. Thus, it is necessary to improve mining multiple databases using local pattern analysis. We present existing specialized as well as generalized techniques for mining multiple large databases. We formalize the idea of multi-d...
متن کاملTemporal Databases and Frequent Pattern Mining Techniques
Data mining is the process of exploring and analyzing data from different perspective, using automatic or semiautomatic techniques to extract knowledge or useful information and discover correlations or meaningful patterns and rules from large databases. One of the most vital characteristic missed by the traditional data mining systems is their capability to record and process time-varying aspe...
متن کاملGeometric clustering models for multimedia databases
Recently, in the elds of information retrieval, Data Mining, or Knowledge Discovery in Databases (KDD), is intensively studied to extract implicit useful information from large amount of data. One of the important objectives of KDD is to obtain generalizations by grouping similar objects via clustering. In the case of multi-media databases such as full text database and image database, geometri...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003