Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Consider the following tables PatientInfo PatientID Phone# FirstLevel Contact Date Phone# Date PatientID that were generated by a ContactTracingApp for finding out the first
Consider the following tables PatientInfo PatientID Phone# FirstLevel Contact Date Phone# Date PatientID that were generated by a ContactTracingApp for finding out the first level contacts made by a patient. Patientinfo has information about the id of the tested patient, their phone number and the date on which they tested positive for a disease while FirstLevelContact contains Patientld of the tested patient, the phone number of the primary contact and the date on which the contact was made. The data is stored as CSV files on HDFS and runs into a few GB each. a) Write MR pseudo code to identify the number of superspreaders, which is defined as the number of patients who have more than 20 first level contacts. Show intermediate key-value pairs. b) How many map-reduce steps do you require to generate the output? d) If the FirstLevelContact were stored on HBase by using PatientID as the key and with range partitioning for 4000 keys with 500keys per region, show how the data will be spread across different region servers.
Step by Step Solution
★★★★★
3.51 Rating (174 Votes )
There are 3 Steps involved in it
Step: 1
It seems you have an image with a question related to mapreduce operations and data distribution across HBase Ill address each point separately starti...Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started