Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 09, 2024

( 5 pts ) Implement in Python a class to model hash functions from a certain family H of hash functions. ( Call this class

$(5$ pts $)$ Implement in Python a class to model hash functions from a certain family H of hash functions. $($ Call this class HashFamily. $)$ The constructor for H should take n and p as arguments. The constructor should generate a random set of p distinct indices from ${0, 1, . . .,$ n $- 1}$ and remember this set, call it S $.$ This class should have a method hash $($ x $)$ which computes the hash value of any n $-$ bit number x as follows. The method first extracts ${$ x $[$ i $]$ : i in S $} .$ It then formulates the p $-$ bit number obtained from this set in order of increasing index and returns this p $-$ bit number in decimal. $($ Added $12 - 17$ : See Q&A On this question at the end of this document. $)$

$(5$ pts $)$ Implement in Python a Bloom Filter $($ BF $)$ as a class.

The constructor should take m $,$ k $,$ and n as arguments, where m and k are as done in class and n is the number of bits in an input. $($ Its okay to limit m to be a power of $2 .)$ The constructor should then create k random hash functions as HashFamily objects with arguments n $=$ n and p $=$ log $_2 ($ m $) .$ It maintains these hash functions in its state.

Support insert and lookup as done in class.

Support a bash insert method $-$ a wrapper around insert $-$ as a convenience method $($ see next question $) .$

$(7$ pts $)$ In this question, the aim is to empirically study how various parameter settings impact the false positive rate on simulated data.

Generate $10000 64 -$ bit numbers at random and store this list somewhere. Call it S $.$

for m in $[64, 128, 256, 512, 1024, 2048, . . ., 65536]$ :

for k in $[1, 2, 4, 8, . . .,$ m $/ 2]$ :

Create a BF with params m $=$ m $,$ k $=$ k $,$ n $= 64 .$

for q in $[$ m $, 2$ m $, 4$ m $, 8$ m $, 16$ m $, 32$ m $, 64$ m $]$

Generate q n $-$ bit numbers and batch insert them all into the BF $. ($ Note that these are inserted into an existing BF $.)$

Generate $1000$ random n $-$ bit numbers from scratch and lookup each of them in the BF $.$ Calculate the proportion of these $1$ K numbers on which the BF answers YES. The higher this proportion the higher the false positive rate.

Report your results in a table whose rows correspond to $($ m $,$ k $,$ q $)$ one row per triplet.

Analyze your results to draw meaningful conclusions about the impact of the various parameter settings on the false positive rate.

Q: Having difficulty understanding the requirements for problem $1$ of HW $6 .$ Ask: explain a bit more with an example or what exactly the class functionality should be $?$

A: h $=$ HashFamily $($ n $= 6,$ p $= 2)$ #

After the constructor has executed, h $.$ S is a random $2 -$ element subset of ${1, 2, 3, 4, 5},$ e $.$ g $.$ h $.$ S $= {1, 3} .$

Next, consider h $.$ hash $($ x $)$ where x is an n $-$ bit binary number, represented however you wish $($ e $.$ g $.$ as bit array $) .$ Say x $= 011011 .$ For h $.$ S $= {1, 3}$ the bits in x that h $.$ S touches are in bold next. $011011$ Projecting out these bits forms the binary number $10$ which in decimal is $2 .$ So hash. $($ x $)$ returns $2 .$

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions

Question

=+It follows further that P(B-A) = P(B) - P(A), and as a special case, (2.6) P(Ac) =1- P(A).

Answered: 1 week ago

Question

★★★★★

Ken Johnston, the data processing manager for Stanley Morgan Bank, is planning a project to install a new management information system. He now is ready to start the project, and wishes to finish in...

Answered: 1 week ago

Question

★★★★★

PA4. LO 7.2 Prepare a budgeted income statement using the information shown. 15,000 Sales (units) Sales price per unit Uncollectible expense Direct material per unit Direct labor per unit (hours)...

Answered: 1 week ago

Question

★★★★★

me with the below for logistic management Business organisations work effectively and efficiently when their supply chains are integrated, from the primary source upstream, into the organ

Answered: 1 week ago

Question

★★★★★

Two precepts of Operations Management are efficiency and effectiveness. Efficiency is important in many areas of Operations as it can affect cost and schedule. Effectiveness however takes on another...

Answered: 1 week ago

Question

★★★★★

A hospital (or other provider or supplier) hosts an informational program to help patients with diabetes learn more about controlling their disease. To encourage participation, dinner will be...

Answered: 1 week ago

Question

★★★★★

According to the Holistic University Website Management reading, what team the backbone of the web team structure since it is responsible for the technical, quality, and operational aspects of the...

Answered: 1 week ago

Question

★★★★★

11. [p. 15] What is the equation preload and bolt size? used to calculate required Calculate fastener the tightening torques for the 20 fastener, a grade 8, -20 fastener and a grade 8, 14-28 faster...

Answered: 1 week ago

Question

★★★★★

Comfort with public disagreement is a big source of conflict on cross-cultural teams. Members from cultures that place a high value on "face" and group harmony may be averse to confrontation because...

Answered: 1 week ago

Previous Question Next Question