Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

4. Bloom filters. Suppose you are given a Bloom filter of size m with t hash functions (each of which can be assumed to be

image text in transcribed

4. Bloom filters. Suppose you are given a Bloom filter of size m with t hash functions (each of which can be assumed to be computed in O(1) time and n inserted elements. Let S be the set representing the keys being stored. (a) (1 point) Assuming simple uniform hashing, what is the expected time (in terms of t) for searching for a key k that may or may not be in S). (b) (3 points) Between using Bloom filters and hashing with chaining, how would you determine which has the better search time if a = n/m is fixed and t is the only variable. (c) Recall that the probability of a false positive is upper bounded by the following value: p= (1 - e-tn/myt (1) (5 points) What is the number of hash functions required to get a false positive rate of at most 0.02 if m= 100 and n = 10? (ii) (7 points) For any given fixed load factor n/m find the number of hash functions that will minimize the false positive rate. (Hint: differentiate p) (d) (4 points) The standard Bloom filters discussed in class do not support deletion oper- ations. Without changing the number m of array slots, describe a way to modify the data structure to allow for deletions and explain how this modification enables deletion to work 4. Bloom filters. Suppose you are given a Bloom filter of size m with t hash functions (each of which can be assumed to be computed in O(1) time and n inserted elements. Let S be the set representing the keys being stored. (a) (1 point) Assuming simple uniform hashing, what is the expected time (in terms of t) for searching for a key k that may or may not be in S). (b) (3 points) Between using Bloom filters and hashing with chaining, how would you determine which has the better search time if a = n/m is fixed and t is the only variable. (c) Recall that the probability of a false positive is upper bounded by the following value: p= (1 - e-tn/myt (1) (5 points) What is the number of hash functions required to get a false positive rate of at most 0.02 if m= 100 and n = 10? (ii) (7 points) For any given fixed load factor n/m find the number of hash functions that will minimize the false positive rate. (Hint: differentiate p) (d) (4 points) The standard Bloom filters discussed in class do not support deletion oper- ations. Without changing the number m of array slots, describe a way to modify the data structure to allow for deletions and explain how this modification enables deletion to work

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

More Books

Students also viewed these Databases questions

Question

Find the mass of a lead sphere of radius 2 cm.

Answered: 1 week ago

Question

5. Give examples of binary thinking.

Answered: 1 week ago

Question

4. Describe the factors that influence self-disclosure

Answered: 1 week ago

Question

1. Explain key aspects of interpersonal relationships

Answered: 1 week ago