Dynamic Clustering-Based Query Answering in Peer-to-Peer Systems

نویسندگان

  • Weining Qian
  • Shuigeng Zhou
  • Yi Ren
  • Aoying Zhou
  • Beng Chin Ooi
  • Kian-Lee Tan
چکیده

P2P computing has been employing in more and more application domains as the technology becomes mature. One popular and successful application area is file sharing. However, current file sharing systems support only or mainly keybased exact matching (e.g., Chord [27], CAN [25]) and keyword-based searching (e.g., Napster, Gnutella) for files discovery and location, which is not enough to meet the requirements of more advanced applications such as information retrieval and data management. In this paper, we propose a new query answering model for P2P applications, which is termed as clustering-based query answering (CBQA). In our definition, CBQA will retrieve the data objects that are in the same cluster of the query from the global dataset distributed over peers of a P2P system. Generally, CBQA may obtain more correct answers than similarity based query can, which means higher recall may be achieved. To implement the new query model, we first present a framework that support clustering based query answering, including general algorithms, lemmas and system architecture. Then we give three concrete algorithms for different clustering criteria, namely k-nearest-neighbor, distancebased, and density-based clustering, along with detailed analyses and discussions. Finally, implementation issues, especially dynamic neighbors selection and caching techniques to enable the scalability of our method are addressed. Theoretical analysis and preliminary experiments show that our method can guarantee to find desirable objects in the interested cluster with modest bandwidth overhead.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information Integration in Schema-Based Peer-To-Peer Networks

Peer-to-peer (P2P) networks have become an important infrastructure during the last years. Using P2P networks for distributed information systems allows us to shift the focus from centrally organized to distributed information systems where all peers can provide and have access to information. In previous papers, we have described an RDF-based P2P infrastructure called Edutella which is a speci...

متن کامل

On the Discovery of the Semantic Context of Queries by Game-Playing

In: Proceedings of the Sixth International Conference On Flexible Query Answering Systems (FQAS-04), Springer-Verlag LNAI 3055, pp. 203-216. c © 2004 by Springer-Verlag. To model query answering, a question arises out of how the meaning of an user’s query is functional to get a valuable answer. In this paper, (1) we investigate the question within an existing peer-to-peer architecture for knowl...

متن کامل

Distributed Query Answering in Peer-to-peer Reasoning Systems Technical Report

Interest in distributed reasoning is gaining momentum due to the emergence of the Semantic Web. In this work, we address the problem of meaningfully answering queries to distributed propositional reasoners which may be mutually inconsistent. We define a peer-to-peer query answering framework in which peers may be prioritized. Our formal framework includes distributed entailment relations for bo...

متن کامل

Efficient and Effective Query Answering in a PDMS with SUNRISE

Peer Data Management Systems (PDMSs) have been recently proposed as an evolution of Peer-To-Peer (P2P) systems toward a more semantics-based description of peers’ contents and relationships. In a PDMS scenario a key challenge is query routing, i.e. the capability of selecting small subsets of semantically relevant peers to forward a query to. In this paper we demonstrate SUNRISE (System for Uni...

متن کامل

Exploiting Preferences over Information Sources to Efficiently Resolve Inconsistencies in Peer-to-peer Query Answering

Decentralized reasoning is receiving increasing attention due to the distributed nature of knowledge on the Web. We address the problem of answering queries to distributed propositional reasoners which may be mutually inconsistent. This paper provides a formal characterization of a prioritized peerto-peer query answering framework that exploits a preference ordering over the peers as well as a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003