Extending UML for Modeling Data Mining Projects (DM-UML)

نویسندگان

  • Óscar Marbán
  • Javier Segovia
چکیده

Existing Data Mining process models propose one way or another of developing projects in a structured manner, trying to reduce their complexity through effective project management. It is well-known in any engineering environment that one of the management tasks that helps to reduce project problems is systematic project documentation, but few of the existing Data Mining processes propose their documentation. Furthermore, these few remark the need of producing documentation at each phase as an input for the next, but they don’t show how to do it. On the other hand, in the literature there are examples of UML extensions for data mining projects, but they always focus on the model implementation side and fail to take into account the remainder of the process. In this paper, we present an extension of the UML modeling language for data mining projects (DM-UML) covering all the documentation needs for a project conforming to a standard process, namely CRISP-DM, ranging from business understanding to deployment. We also show an example of a real application of the proposed DM-UML modeling. The result of this approach is that, besides the advantages of having an standardized way of producing the documentation, it clearly constitutes a very useful and transparent tool for modeling and connecting the business understanding or modeling phase with the remainder of the project right through to deployment, as well as a way of facilitating the communication with the nontechnical stakeholders involved in the project, problems which have always been an open question in data mining. *Corresponding author: Javier Segovia, Informa’tica faculty, Polytechnic University of Madrid, Montegancedo Campus s / n. 28660 Boadilla del Monte (Madrid) Spain, E-mail: [email protected] Received July 03, 2013; Accepted September 16, 2013; Published September 30, 2013 Citation: Marbán Ó, Segovia J (2013) Extending UML for Modeling Data Mining Projects (DM-UML). J Inform Tech Softw Eng 3: 121. doi:10.4172/21657866.1000121 Copyright: © 2013 Marbán Ó, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Formal Description of the UML Architecture and Extensibility

Since its emergence in 1995, the Unified Modeling Language (UML) has become part of the mainstream of object-oriented software development in a wide range of applications. This paper presents a formal description of UML technologies for visualized specification and modeling of software systems, and analyzes the usability of UML views and diagrams. Requirements and extension of UML capability to...

متن کامل

UML Usage in Open Source Software Development : A Field Study

UML is the de facto standard for modeling software designs and is commonly used in commercial software development. However, little is known about the use of UML in Open-source Software Development. This paper evaluates the usage of UML modeling in ten opensource projects selected from common open-source repositories. We evaluated the usage of UML diagrams based on the information available in ...

متن کامل

Extending UML 2 Activity Diagrams with Business Intelligence Objects

Data Warehouse (DWH) information is accessed by business processes. Today, no conceptual models exist that make the relationship between the DWH and the business processes transparent. In this paper, we extend a business process modeling diagram, namely the UML 2 activity diagram with a UML profile, which allows to make this relationship explicit. The model is tested with example business proce...

متن کامل

Gaussian Process Models in Spatial Data Mining

ion of GeoDatabases Geographic Database Conceptual Modeling Modeling with a UML Profile Geographic Databases Spatio-temporal Database Modeling with an Extended Entity-Relationship Model Geographic Dynamics, Visualization and Modeling MAY YUAN Department of Geography and Center for Spatial Analysis, University of Oklahoma, Norman, OK, USA

متن کامل

Business Process Modeling with EPC and UML: Transformation or Integration?

Process and object-orientation are basic concepts of modeling, implementing and customizing information systems. In this paper we present two approaches of combining those concepts into a coherent way. In the first approach we discuss how to transform business process models (Event-driven Process Chain (EPC) diagrams) into object-oriented models (Unified Modeling Language (UML) diagrams). The m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013