Data mining and operational research: techniques and applications

نویسندگان

  • Kweku-Muata Osei-Bryson
  • Victor J. Rayward-Smith
چکیده

Data mining (DM) involves the use of a suite of techniques that aim to induce from data, models that meet particular objectives. DM algorithms are built on a range of techniques, including information theory, statistics, linear and non-linear models, AI, meta-heuristics. Within the context of data analysis methods, data mining can be considered to be an exploratory knowledge discovery approach to be contrasted with a confirmatory approach in which a hypothesis is specified and the validity of the hypothesis is tested against the data. DM models can be designed to find new useful patterns within the data that can be exploited. For example, an insurance company will already be aware that young men in fast cars pose a high risk (as do older women). Such a pattern in the data will be of no interest to them. However, they may be interested if they were to find that young men in fast, classic cars are much less of a risk. They could then develop pricing and marketing strategies to exploit this newly discovered pattern. In the context of contemporary organizations, the major motivating factors for the interest in, and application of, data mining techniques can be categorized as: changed business environment (eg pressure on traditional marketing techniques , shorter time to market, shorter product life cycles, and increased competition and business risks), drivers (eg customer, competition, and data asset), and enablers (eg data flood in organizations, new information technology solutions in data mining, data warehousing, and hardware). Many organizations routinely collect large amounts of data on their clients, employees, and suppliers. Factories collect data on their machinery and productivity. Hospitals collect data on patients, doctors, medications, and procedures. These data are often untidy, incomplete, and sometimes erroneous and yet, if used properly, it can be a valuable asset for management and effective organizations can find ways to exploit it. Predicting fraudulent behaviour in a customer database, for example, will generally require highly accurate models even if they can only be applied to a small proportion of the database. On the other hand, if the organization is planning a targeted marketing campaign and wishes to predict which customers might buy a certain product, the accuracy of the model can be much lower. Both operational research (OR) and DM are aimed at the development and application of models that can improve the quality of organizational decision-making process. While OR can be considered …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Credit scoring in banks and financial institutions via data mining techniques: A literature review

This paper presents a comprehensive review of the works done, during the 2000–2012, in the application of data mining techniques in Credit scoring. Yet there isn’t any literature in the field of data mining applications in credit scoring. Using a novel research approach, this paper investigates academic and systematic literature review and includes all of the journals in the Science direct onli...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

A Geometric View of Similarity Measures in Data Mining

The main objective of data mining is to acquire information from a set of data for prospect applications using a measure. The concerning issue is that one often has to deal with large scale data. Several dimensionality reduction techniques like various feature extraction methods have been developed to resolve the issue. However, the geometric view of the applied measure, as an additional consid...

متن کامل

A Systematic Review of Data Mining Applications in Digital Libraries

Purpose: Study aimed to identify the applications of data mining in the provision of services, collection and management of digital libraries. Methodology: This is an applied study in terms of purpose and in terms of method is qualitative research that have been done by systematic review method. For this purpose, articles have been obtained by searching databases of Springer, Emerald, ProQuest,...

متن کامل

Synergies between operations research and data mining: The emerging use of multi-objective approaches

Operations research and data mining already have a long-established common history. Indeed, with the growing size of databases and the amount of data available, data mining has become crucial in modern science and industry. Data mining problems raise interesting challenges for several research domains, and in particular for operations research, as very large search spaces of solutions need to b...

متن کامل

Implementation of Data Mining Techniques for Meteorological Applications

The CrossGrid project is one of the ongoing research projects involving GRID technology. One of the main tasks in the Meteorological applications package is the implementation of data mining systems for the analysis of operational and reanalysis databases of atmospheric circulation patterns. Previous parallel data mining algorithms reported in the literature focus on parallel computers with pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JORS

دوره 60  شماره 

صفحات  -

تاریخ انتشار 2009