Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

There are seven stages to a MapReduce job: - Job setup - Load Split - Map - Copy - Merge - Reduce - Write part

There are seven stages to a MapReduce job:-
Job setup - Load Split - Map - Copy - Merge - Reduce - Write part
Recall Admhal's Law for evaluating the speed-up of parallel tasks and your knowledge of which stages can be parallelized among the 7 stages for the MapReduce Job. Assume job setup, and copy+merge take 50 and 150 seconds, respectively. Assume with no parallelization, the load split+mapper tasks would take 2500 seconds and the reducers+write part would take 1500 seconds. Given that you have 128 CPU cores, then based on Amdahl's Law, what would be the achievable speed-up in this case?
Round up the result to the nearest integer (e.g.,4.6****-->5,4.3****-->4, and 4.5****-->5).
Hint: you need to calculate how much of the total time percentage, the non-parallelizable (sequential) stages (i.e.,*f*) take and then apply Amdahl's law.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Machine Learning And Knowledge Discovery In Databases European Conference Ecml Pkdd 2010 Barcelona Spain September 2010 Proceedings Part 1 Lnai 6321

Authors: Jose L. Balcazar ,Francesco Bonchi ,Aristides Gionis ,Michele Sebag

2010th Edition

364215879X, 978-3642158797

More Books

Students also viewed these Databases questions

Question

a neglect of quality in relationship to international competitors;

Answered: 1 week ago