A bi-labeling based XPath processing system
نویسندگان
چکیده
We present BLAS, a Bi-LAbeling based XPath processing System. BLAS uses two labeling schemes to speed up query processing: P-labeling for processing consecutive child (or parent) axis traversals, and D-labeling for processing descendant (or ancestor) axis traversals. XML data are stored in labeled form and indexed. Algorithms are presented for translating XPath queries to SQL expressions. BLAS reduces the number of joins in the SQL query translated from a given XPath query and reduces the number of disk accesses required to execute the SQL query compared with the traditional XPath processing using D-labeling alone. We also propose an approximate P-labeling scheme and the corresponding query translation algorithm to handle XML data trees that contain a large number of distinct tag names, and/or are very deep. This extension captures a spectrum of XPath-to-SQL query translation schemes, ranging from existing schemes that do not use P-labels to the one that uses exact P-labels. Experimental results demonstrate the efficiency of the BLAS system. Disciplines Computer Sciences Comments Chen, Y., Davidson, S., & Zheng, Y., A Bi-Labeling Based XPath Processing System, Information Systems 35, 2010, doi: 10.1016/j.is.2009.05.005 This conference paper is available at ScholarlyCommons: http://repository.upenn.edu/cis_papers/600 A bi-labeling based XPath processing system Yi Chen a, , Susan B. Davidson , Yifeng Zheng c a Arizona State University, United States b University of Pennsylvania, United States c Amazon.com, United States a r t i c l e i n f o
منابع مشابه
Terminology of Combining the Sentences of Farsi Language with the Viterbi Algorithm and BI-GRAM Labeling
This paper, based on the Viterbi algorithm, selects the most likely combination of different wording from a variety of scenarios. In this regard, the Bi-gram and Unigram tags of each word, based on the letters forming the words, as well as the bigram and unigram labels After the breakdown into the composition or moment of transition from the decomposition to the combination obtained from th...
متن کاملAn Efficient Encoding and Labeling for Dynamic XML Data
In order to efficiently determine structural relationships among XML elements and to avoid re-labeling for updates, much research about labeling schemes has been conducted, recently. However, a harmonic support of efficient query processing and updating has not been achieved. In this paper, we propose an efficient XML encoding and labeling scheme, called EXEL, which is a variant of the region n...
متن کاملXPath-Logic and XPathLog: a logic-programming-style XML data manipulation language
We define XPathLog as a Datalog-style extension of XPath. XPathLog provides a clear, declarative language for querying and manipulating XML whose perspectives are especially in XML data integration. In our characterization, the formal semantics is defined wrt. an edge-labeled graph-based model which covers the XML data model. We give a complete, logic-based characterization of XML data and the ...
متن کاملAn XPath Algebraic Characterization of A ( k ) and P ( k ) Indices with Applications to Query Processing
Recent studies have proposed structural summary techniques for path query evaluation on semi-structured data sources. One major line of this research has been the introduction of the DataGuide, 1-index, 2-index, and A(k) indices, and subsequent investigations and generalizations. Another recent study has considered structural characterizations of fragments of XPath, the standard path navigation...
متن کاملAn Encoding and Labeling Scheme Based on Continued Fraction for Dynamic XML
Much research about labeling schemes has been conducted to efficiently determine the ancestor-descendant relationships and the document-order between any two random XML nodes without re-labeling for updates. In this paper, we present an efficient XML encoding and labeling scheme for dynamic XML document, named Continued Fraction-based Encoding (CFE). The proposed CFE scheme labels nodes with co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Syst.
دوره 35 شماره
صفحات -
تاریخ انتشار 2010