Does Relevance Matter to Data Mining Research?
نویسندگان
چکیده
Data mining (DM) and knowledge discovery are intelligent tools that help to accumulate and process data and make use of it. We review several existing frameworks for DM research that originate from different paradigms. These DM frameworks mainly address various DM algorithms for the different steps of the DM process. Recent research has shown that many real-world problems require integration of several DM algorithms from different paradigms in order to produce a better solution elevating the importance of practice-oriented aspects also in DM research. In this paper we strongly emphasize that DM research should also take into account the relevance of research, not only the rigor of it. Under relevance of research in general, we understand how good this research is in terms of the utility of its results. This chapter motivates development of such a new framework for DM research that would explicitly include the concept of relevance. We introduce the basic idea behind such framework and propose one sketch for the new framework for DM research based on results achieved in the information systems area having some tradition related to the relevance aspects of research.
منابع مشابه
Why Data Mining Research Does Not Contribute to Business?
Data mining (DM) and knowledge discovery are intelligent tools that help to accumulate and process data and make use of it. Nowadays there exist many DM algorithms, developed, implemented and available for direct use or integration into specific solution. There exist also a number of DM systems that provide DM tools for all steps of the DM process. This paper is aimed at provoking the discussio...
متن کاملA Data Mining Framework for Relevance Feature Discovery
Automatic discovery of relevance features in real-world data for describing user information needs or preferences is a new challenge in data mining community. For many years, several research efforts in information retrieval (IR) and information filtering (IF) have attempted to address the difficult issue, using term-based and phrase-based approaches. However, many experiments do not support th...
متن کاملConstraint-Based Pattern Discovery
Devising fast and scalable algorithms, able to crunch huge amount of data, was for many years one of the main goals of data mining research. But then we realized that this was not enough. It does not matter how efficient such algorithms can be, the results we obtain are often of limited use in practice. Typically, the knowledge we seek is in a small pool of local patterns hidden within an ocean...
متن کاملA Review on Feature Selection MethodsforHigh Dimensional Data
Feature selection has become an important task for effective application of data mining techniquesin real-world high dimensional datasets. It is a process that selects a subset of original features by removing irrelevant and redundant features on the basis of the evaluation criteria without loss of information content. A feature selection method helps to reduce computational complexity of learn...
متن کاملQuery expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008