XML Document Indexes: A Classification
نویسندگان
چکیده
Because choosing the most efficient query execution plan relies on indexing techniques, such techniques play an important role in developing query processors. In the Web context, they’re even more crucial, as XML documents are massively used and frequently queried. Given that XML documents are semistructured, however, general query processing techniques — such as those for relational or objectoriented data — won’t work. Researchers have proposed several XMLspecific indexing approaches, but to our knowledge, a unified overview of XML indexing has yet to appear. Such an overview and classification would be useful both theoretically, during design and development, and practically, in helping XML application developers choose the appropriate product. Here, we present a classification of XML indexing techniques based on two key factors. First, we identify the query type that can be optimized. Second, we identify the query processing strategy that benefits from a specific indexing technique. Our classification highlights each technique’s main characteristics, advantages, and drawbacks. In addition to helping developers choose the best XML query management strategies, the classification can help users of those solutions understand why they get specific performance results.
منابع مشابه
Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملApply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML
As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...
متن کاملFliX: A Flexible Framework for Indexing Complex XML Document Collections
While there are many proposals for path indexes on XML documents, none of them is perfectly suited for indexing large-scale collections of interlinked XML documents. Existing strategies lack support for intraor inter-document links, require large amounts of time to build or space to store the index, or cannot efficiently answer connection queries. This paper presents the FliX framework for conn...
متن کاملIndexing collections of XML documents with arbitrary links
______________________________________________________________________ In recent years, the popularity of XML has increased significantly. XML is the extensible markup language of the World Wide Web Consortium (W3C). XML is used to represent data in many areas, such as traditional database management systems, e-business environments, and the World Wide Web. XML data, unlike relational and objec...
متن کاملAn Efficient Index Lattice for XML Query Optimization
Structural indexes of XML data can effectively reduce the search space for the evaluation of path queries over the data. The indexes partition the structural graph of an XML document into equivalent classes of nodes that are then condensed into index nodes. However, structural indexes are inadequate to handle queries with valuebased conditions, since equivalent nodes in the same partition becom...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Internet Computing
دوره 9 شماره
صفحات -
تاریخ انتشار 2005