ar X iv : 1 71 1 . 01 61 6 v 1 [ cs . D S ] 5 N ov 2 01 7 Bloom Filters , Adaptivity , and the Dictionary Problem

نویسندگان

  • Michael A. Bender
  • Martin Farach-Colton
  • Mayank Goswami
  • Rob Johnson
  • Samuel McCauley
  • Shikha Singh
چکیده

The Bloom filter—or, more generally, an approximate membership query data structure (AMQ)— maintains a compact, probabilistic representation of a set S of keys from a universeU . An AMQ supports lookup, and possibly insert and delete operations. If x ∈ S, then lookup(x) returns “present.” If x 6∈ S, then, lookup(x) may return “present” with probability at most ε, where ε is a tunable false-positive probability, and such an x is called a false positive of the AMQ. Otherwise lookup(x) returns “absent.” AMQs have become widely used to accelerate dictionaries that are stored remotely (e.g., on disk or across a network). By using an AMQ, the dictionary needs to access the remote representation of S only when the AMQ indicates that the queried item might be present in S. Thus, the primary goal of an AMQ is to minimize its false-positive rate, so that the number of unnecessary accesses to the remote representation of S can be minimized. However, the false-positive guarantees for AMQs are rather weak. The false-positive probability of ε holds only for distinct or randomly chosen queries, but does not hold for arbitrary sequences of queries. For example, an adversary that chooses its queries based on the outcomes of previous queries can easily create a sequence of queries consisting almost entirely of false positives. Even simply repeating a randomly chosen query has an ε chance of producing a sequence entirely of false positives. In this paper, we give adaptive AMQs that do have strong false-positive guarantees. In particular, for any fixed ε, our AMQs guarantee a false-positive rate of ε for every query and for every sequence of previously made queries. Furthermore, our adaptive AMQ is optimal in terms of space (up to lower order terms) and complexity (all operations are constant time). This research was supported in part by NSF grants CCF 1114809, CCF 1217708, CCF 1218188, CCF 1314633, CCF 1637458, IIS 1247726, IIS 1251137, CNS 1408695, CNS 1408782, CCF 1439084, ccf-bsf 1716252, CCF 1617618, IIS 1541613, and CAREER Award CCF 1553385, as well as NIH grant 1U01CA198952-01, by the European Research Council under the European Union’s 7th Framework Programme (FP7/2007-2013) / ERC grant agreement no. 614331, and by Sandia National Laboratories, EMC, Inc, and NetAPP, Inc. ∗Stony Brook University, Stony Brook, NY 11794-4400, USA. Email: {bender,shiksingh}@cs.stonybrook.edu. †Rutgers University, Piscataway NJ 08855, USA. Email: [email protected]. ‡Queens College, CUNY, New York, USA. Email: [email protected]. §VMware Research, Creekside F, 3425 Hillview Ave, Palo Alto, CA 94304. Email: [email protected] ¶IT University of Copenhagen, Copenhagen, Denmark. Email: [email protected].

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ar X iv : 1 71 1 . 01 61 9 v 1 [ m at h . O C ] 5 N ov 2 01 7 Enlarged Controllability of Riemann – Liouville Fractional Differential Equations ∗

We investigate exact enlarged controllability for time fractional diffusion systems of Riemann–Liouville type. The Hilbert uniqueness method is used to prove exact enlarged controllability for both cases of zone and pointwise actuators. A penalization method is given and the minimum energy control is characterized.

متن کامل

ar X iv : n uc l - ex / 9 61 10 01 v 1 5 N ov 1 99 6 π + + d → p + p reaction between 18 and 44 MeV

A study of the reaction π+ + d → p+ p has been performed in the energy range of 18 – 44 MeV. Total cross sections and differential cross sections at six angles have been measured at 15 energies with an energy increment of 1 – 2 MeV. This is the most systematic data set in this energy range. No structure in the energy dependence of the cross section has been observed within the accuracy of this ...

متن کامل

ar X iv : h ep - e x / 97 11 01 6 v 1 2 1 N ov 1 99 7 Results on Λ 0 Production at HERMES

The production of Λ 0 's at the HERMES experiment is presented. Prospects for the future of Λ measurements at HERMES are discussed.

متن کامل

ar X iv : 1 21 1 . 24 05 v 1 [ cs . G T ] 1 1 N ov 2 01 2 Rank - 1 Games With Exponentially Many Nash Equilibria

The rank of a bimatrix game (A,B) is the rank of the matrix A + B. We give a construction of rank-1 games with exponentially many equilibria, which answers an open problem by Kannan and Theobald (2010).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017