Clock: Synchronizing Internal Relational Storage with External XML Documents
نویسندگان
چکیده
In many business settings, a relational database system (RDBMS) will serve as the storage manager for data from XML documents. In such a system, once the XML data is dissembled and loaded into the storage system, XML queries posed against the (virtual) XML documents are processed by translating them into SQL queries against the relational storage. However, for applications which frequently update their XML documents, we cannot afford to reload a complete, possibly large, document for each update, instead we must be able to incrementally propagate document updates to the stored XML data. In this paper, we address the issue of correctly reflecting updates of external XML documents into the loaded XML data in a relational database system. We describe Clock, a framework for synchronizing the relational storage with updated XML documents by exploiting a metadata-driven technology. First, we propose a set of (DTD preserving) update primitives for XML documents. Second, based on the mapping between XML and the relational model, we describe the propagation of those update primitives. Validation of the updates ensures they will not violate the constraints specified by the DTD. We have implemented a working prototype of the Clock system using the IBM’s XML4J parser, JDBC 2 and Oracle 8i. We report on preliminary experiments conducted using this prototype to analyze our algorithms in a document update setting. This work was supported in part by several grants from NSF, namely, the NSF NYI grant #IRI 97–96264, the NSF CISE Instrumentation grant #IRIS 97–29878, and the NSF grant #IIS 97–32897. Dr. Rundensteiner would like to also thank IBM for the IBM partnership award and Verizon Laboratories Incorporated for partial support of Xin Zhang.
منابع مشابه
An approach to the model-based fragmentation and relational storage of XML-documents
A flexible method to store XML documents in relational or object-relational databases is presented that is based on an adaptable fragmentation. Whereas most known approaches decompose XML documents into minimal units we propose to store fragments of variable granularity ranging from single elements to whole documents. Different fragmentation strategies depending on the specific access and query...
متن کاملXML Data Stores: Emerging Practices
X ML's wide adoption in both academia and industry has revealed a controversial fact. From one perspective, XML is characterized by interoperability, ease of use, and extensibility; from another, it creates difficulties when integrated into existing systems (such as typical database applications and database management systems). One result of this conflict is the arrival of native XML data stor...
متن کاملA DOM-Based Approach of Storage and Retrieval of XML Documents Using Relational Databases
This paper describes a novel approach of storage and retrieval of XML documents using relational databases. In this approach, an XML document is decomposed into nodes based on its tree structure, and stored into relational tables according to the nodes types. Our approach enables us to store XML documents using a fixed relational schema without any information about XML schema, and DTD. For the...
متن کاملAn Efficient Data Extraction and Storage Utility For XML Documents
In this paper, a mechanism to provide selective extraction of data objects from XML documents, the storage of these documents in an object-relational database, and retrieval and reconstruction of XML documents from extracted data objects is discussed. The motivation is provided by a need for a Workflow Process Repository in a Workflow Management System (WFMS) [6], namely METEOR WFMS, to store m...
متن کاملHow to Store and Query XML Data
Due to the fact XML is a universal data-exchange format, all the questions concerning storage and querying XML documents have been lately of great popularity. Query languages, called XPath and XQuery have been developed and standardized in order to query XML data not only in XML files, but anything existing as XML, including databases. XML databases running XPath and XQuery are generally classi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001