Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 25, 2024

A large meteorological data organisation regularly collects various meteorological data from various locations spread throughout the nation, applies some analytics and provides district-wise weather outlook

A large meteorological data organisation regularly collects various meteorological data from various locations spread throughout the nation, applies some analytics and provides district-wise weather outlook for the next 6 hours, 12 hours, 24 hours and 48 hours. It is known that historical data for each district, past prediction details and new district data collected are together used to generate the forecasts. The data is fetched hourly, and details are updated every hour. For each district, the whole fetching process takes about 15 minutes and the core analytics work takes about 30 minutes. Since they still use a legacy network, any parallelization will result in a 5 min communication overhead. Assume there are 650 districts, and a cluster of 65,000 nodes. 80% of code can be parallelised. i.How much speedup is theoretically achievable given the high communication overhead? ii. The organisation was using reduced data size so that they could complete work in time at the cost of reduced accuracy for larger time windows. If it uses full data, then time will increase by a factor of 4. Will the company be able to execute for full data if communication overhead was reduced to zero? Justify with relevant computation. A large meteorological data organisation regularly collects various meteorological data from various locations spread throughout the nation, applies some analytics and provides district-wise weather outlook for the next 6 hours, 12 hours, 24 hours and 48 hours. It is known that historical data for each district, past prediction details and new district data collected are together used to generate the forecasts. The data is fetched hourly, and details are updated every hour. For each district, the whole fetching process takes about 15 minutes and the core analytics work takes about 30 minutes. Since they still use a legacy network, any parallelization will result in a 5 min communication overhead. Assume there are 650 districts, and a cluster of 65,000 nodes. 80% of code can be parallelised. i.How much speedup is theoretically achievable given the high communication overhead? ii. The organisation was using reduced data size so that they could complete work in time at the cost of reduced accuracy for larger time windows. If it uses full data, then time will increase by a factor of 4. Will the company be able to execute for full data if communication overhead was reduced to zero? Justify with relevant computation

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Databases In Networked Information Systems 6th International Workshop Dnis 2010 Aizu Wakamatsu Japan March 2010 Proceedings Lncs 5999

Authors: Shinji Kikuchi ,Shelly Sachdeva ,Subhash Bhalla

2010th Edition

3642120377, 978-3642120374

More Books

Students also viewed these Databases questions

Question

List the six steps in the model proposed by Bell for the implementation of analytics.

Answered: 1 week ago

Question

★★★★★

On January 1, 2014, Everett Corporation had these stockholders equity accounts. Common Stock ($10 par value, 70,000 shares issued and outstanding).... $700,000 Paid-in Capital in Excess of Par...

Answered: 1 week ago

Question

★★★★★

A large meteorological data organisation regularly collects various meteorological data from various locations spread throughout the nation, applies some analytics and provides district-wise weather...

Answered: 1 week ago

Question

★★★★★

The aggregate supply curve (AS) is positive relationship with price (P) in the short run; therefore the curve is upward sloping from left to right. Whereas, in the long run aggregate supply (LRAS)...

Answered: 1 week ago

Question

★★★★★

SOMER ZRELINE 1298 097 Ingredients: Pret lostery Acetate Directions Apply disbr Paint the s your on Warnings for estonal of t A Unilever Brand 80 anyode NAWINIO DAIMORE SHOLA Question 2.5. The...

Answered: 1 week ago

Question

★★★★★

The bicycle and rider shown in (Figure 1) have a mass of 70 kg with center of mass located at G. The coefficient of kinetic friction at the rear tire is B = 0.6. Figure Part A B 0.55 m -0.4 m 1.2 m 1...

Answered: 1 week ago

Question

★★★★★

Oakmont Company has an opportunity to manufacture and sell a new product for a four-year period. The company's discount rate is 16% and It estimated the following costs and revenues for the new...

Answered: 1 week ago

Question

★★★★★

A coin collector sells III-Vth century Roman sesterces (a silver coin of ancient Rome) via an internet link. Her last week's sales are shown in the spreadsheet table below. (Hint: she sold each...

Answered: 1 week ago

Question

★★★★★

After walking through the fifth gate, your computer informs you that more than half of the gates have been crossed. "Only 4 left", you conclude. "Computer, what is this gate's riddle?" Your computer...

Answered: 1 week ago

Question

★★★★★

4. How can the characteristics of the trainee affect self-directed learning?

Answered: 1 week ago

Question

★★★★★

3. Discuss the process of behavior modeling training.

Answered: 1 week ago

Question

★★★★★

1. What are the strengths and weaknesses of the lecture, the case study, and behavior modeling?

Answered: 1 week ago

Previous Question Next Question