XML Document Indexes: A Classification

نویسندگان

  • Barbara Catania
  • Anna Maddalena
  • Athena Vakali
چکیده

Because choosing the most efficient query execution plan relies on indexing techniques, such techniques play an important role in developing query processors. In the Web context, they’re even more crucial, as XML documents are massively used and frequently queried. Given that XML documents are semistructured, however, general query processing techniques — such as those for relational or objectoriented data — won’t work. Researchers have proposed several XMLspecific indexing approaches, but to our knowledge, a unified overview of XML indexing has yet to appear. Such an overview and classification would be useful both theoretically, during design and development, and practically, in helping XML application developers choose the appropriate product. Here, we present a classification of XML indexing techniques based on two key factors. First, we identify the query type that can be optimized. Second, we identify the query processing strategy that benefits from a specific indexing technique. Our classification highlights each technique’s main characteristics, advantages, and drawbacks. In addition to helping developers choose the best XML query management strategies, the classification can help users of those solutions understand why they get specific performance results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

Apply Uncertainty in Document-Oriented Database (MongoDB) Using F-XML

As moving to big data world where data is increasing in unstructured way with high velocity, there is a need of data-store to store this bundle amount of data. Traditionally, relational databases are used which are now not compatible to handle this large amount of data, so it is needed to move on to non-relational data-stores. In the current study, we have proposed an extension of the Mongo...

متن کامل

FliX: A Flexible Framework for Indexing Complex XML Document Collections

While there are many proposals for path indexes on XML documents, none of them is perfectly suited for indexing large-scale collections of interlinked XML documents. Existing strategies lack support for intraor inter-document links, require large amounts of time to build or space to store the index, or cannot efficiently answer connection queries. This paper presents the FliX framework for conn...

متن کامل

Indexing collections of XML documents with arbitrary links

______________________________________________________________________ In recent years, the popularity of XML has increased significantly. XML is the extensible markup language of the World Wide Web Consortium (W3C). XML is used to represent data in many areas, such as traditional database management systems, e-business environments, and the World Wide Web. XML data, unlike relational and objec...

متن کامل

An Efficient Index Lattice for XML Query Optimization

Structural indexes of XML data can effectively reduce the search space for the evaluation of path queries over the data. The indexes partition the structural graph of an XML document into equivalent classes of nodes that are then condensed into index nodes. However, structural indexes are inadequate to handle queries with valuebased conditions, since equivalent nodes in the same partition becom...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Internet Computing

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2005