Schema Mapping Using Hybrid Ripple-Down Rules

نویسندگان

  • Sarawat Anam
  • Yang Sok Kim
  • Byeong Ho Kang
  • Qing Liu
چکیده

Schema mapping is essential to manage schema heterogeneity among different sources. Schema mapping can be conducted by using machine learning algorithms or by knowledge engineering approaches. These two approaches have advantages and disadvantages. The machine learning approaches can learn their model using the data, but they are static, so they cannot be modified to reflect the domain data changes. Inversely, the knowledge engineering approaches need domain experts, but they can be modified by reflecting the domain data changes. In order to exploit the advantages of both approaches and reduce the limitations, we propose a hybrid approach, called Hybrid-RDR, which combines a machine learning algorithm with ripple-down rules (RDR), an incremental knowledge engineering approach. A model is constructed by a decision tree algorithm and then it is extended by adding rules incrementally. This approach achieves higher performance in terms of precision, recall and F-measure compared to the machine learning algorithm. This significantly reduces the effort for classifying the related schemas one by one by manually creating rules and it is possible to modify the knowledge base by adding rules without creating model again if decision tree gives wrong classifications whenever the schema data changes over time.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ripple Down Rules, a practical method of learning from code rewrites

Software developers sometimes rewrite large sections of program code. Reducing the number of rewrites would save valuable development time. Ripple Down Rules (RDR) has a proven knowledge acquisition track record. RDR looks to offer a simple to maintain method capturing knowledge gained through experience. RDR allows recommendations identified when a failure occurs, to be captured and reused. Th...

متن کامل

HepToX: Heterogeneous Peer to Peer XML Databases

We study a collection of heterogeneous XML databases maintaining similar and related information, exchanging data via a peer to peer overlay network. In this setting, a mediated global schema is unrealistic. Yet, users/applications wish to query the databases via one peer using its schema. We have recently developed HepToX, a P2P Heterogeneous XML database system. A key idea is that whenever a ...

متن کامل

NRDR for the Acquisition of Search Knowledge

The contribution of this paper is threefold: It substantially extends Ripple Down Rules, a proven eeective method for building large knowledge bases without a knowledge engineer. Furthermore, we propose to develop highly eeective heuristics searchers for combinatorial problems by a knowledge acquisition approach to acquire human search knowledge. Finally, our initial experimental results sugges...

متن کامل

Designing a Knowledge-based Schema Matching System for Schema Mapping

Schema mapping that provides a unified view to the users is necessary to manage schema heterogeneity among different data sources. Schema matching is a required task for schema mapping that finds semantic correspondences between entity pairs of schemas. Semi-automatic schema matching systems were developed to overcome manual works for schema mapping. However, such approaches require a high manu...

متن کامل

Maintaining Procedural Knowledge: Ripple-down-functions

Research into the maintenance of procedural knowledge has focused on the detection of declarative meta-structures within the procedures. An alternative approach is described here based on the ripple-down-rules (RDR) formalism. To the standard RDR rule tree, we add a functions environment hierarchy that stores the implementation of the procedures used in the rules. This functions environment str...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015