Describing the Data Mining Process with DMSL

نویسندگان

  • Petr Kotásek
  • Jaroslav Zendulka
چکیده

The state of the art in the domain of knowledge discovery in databases (KDD) and data mining (DM) has reached the point where the existence of various languages is becoming highly desirable. This paper presents an XML-based language called DMSL (Data Mining Specification Language). Its purpose is to provide the framework for platform-independent definition of the whole data mining process, and exchange and sharing of DM projects among different applications, possibly operating in heterogeneous environments. We assume that the reader is familiar with the notions of XML, knowledge discovery in databases, and data mining.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Proposed Data Mining Methodology and its Application to Industrial Procedures

Data mining is the process of discovering correlations, patterns, trends or relationships by searching through a large amount of data stored in repositories, corporate databases, and data warehouses. Industrial procedures with the help of engineers, managers, and other specialists, comprise a broad field and have many tools and techniques in their problem-solving arsenal. The purpose of this st...

متن کامل

بررسی میزان تأثیر داروهای درمان ناباروری در بیماران نابارور با استفاده از الگوریتم خوشه بندی و تکنیک های داده کاوی

Background and purpose: The rate of infertility has increased throughout the world. Data mining is a new method for analyzing information from databases. Few studies are done regarding infertility and using data mining in describing and predicting different treatment methods and factors influencing these methods. This paper proposes a model for evaluating the efficacy of different drugs in trea...

متن کامل

Mapping Dependence

We describe DMSL, a domain specific language for defining schema mappings. Schema mappings are assertions in carefully crafted logics that express constraints between data represented in different formats, including XML and relational schema. DMSL is suitable for representing programs over mappings, which, for instance, occur in dataflow graphs of mappings. DMSL programs of mapping type are sta...

متن کامل

Opinion Mining, Social Networks, Higher Education

Background and Aim: With the advent of technology and the use of social networks such as Instagram, Facebook, blogs, forums, and many other platforms, interactions of learners with one another and their lecturers have become progressively relaxed. This has led to the accumulation of large quantities of data and information about studentschr('39') attitudes, learning experiences, opinions, and f...

متن کامل

Perform Three Data Mining Tasks with Crowdsourcing Process

For data mining studies, because of the complexity of doing feature selection process in tasks by hand, we need to send some of labeling to the workers with crowdsourcing activities. The process of outsourcing data mining tasks to users is often handled by software systems without enough knowledge of the age or geography of the users' residence. Uncertainty about the performance of virtual user...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002