Question: Suppose relation r is stored partitioned and indexed on A, and s is stored partitioned and indexed on B. Consider the join r r.B=s.B s.
Suppose relation r is stored partitioned and indexed on A, and s is stored partitioned and indexed on B. Consider the join r ⋈r.B=s.B s. Suppose s is relatively small, but not small enough to make asymmetric fragment-and-replicate join the best choice, and r is large, with most r tuples not matching any s tuple. A hash-join can be performed but with a semi join filter used to reduce the data transfer. Explain how semijoin filtering using Bloom filters would work in this parallel join setting.
Step by Step Solution
3.46 Rating (156 Votes )
There are 3 Steps involved in it
A Bloom filter is a spaceefficient data structure that can be used to test whether an element is a m... View full answer
Get step-by-step solutions from verified subject matter experts
