Answered step by step
Verified Expert Solution
Question
1 Approved Answer
4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respectively. Also assume that on a
4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respectively. Also assume that on a single processor a program requires the execution of 2.56E9 arithmetic instructions, 1.28E9 load/store instructions, and 256 million branch instructions. Assume each processor has a 2 GHz clock frequency. Assume that, as the program is parallelized to run over multiple cores, the number of arithmetic and load/store instructions per processor is divided by 0.7 xp (where p is the number of processors) but the number of branch instructions per processor remains the same. a. Find the total execution time for this program on 1, 4, and 8 processors, and show the relative speedup of the 4, and 8 processor results relative to the single processor result. b. If the CPI of the arithmetic instructions was doubled, what would the impact be on the execution time of the program on 1, 2, 4, and 8 processors? C. To what should the CPI of load/store instructions be reduced in order for a single processor to match the performance of four processors using the original CPI values? 4. (40 points) Assume for arithmetic, load/store, and branch instructions, a processor has CPIs of 1, 12, and 5, respectively. Also assume that on a single processor a program requires the execution of 2.56E9 arithmetic instructions, 1.28E9 load/store instructions, and 256 million branch instructions. Assume each processor has a 2 GHz clock frequency. Assume that, as the program is parallelized to run over multiple cores, the number of arithmetic and load/store instructions per processor is divided by 0.7 xp (where p is the number of processors) but the number of branch instructions per processor remains the same. a. Find the total execution time for this program on 1, 4, and 8 processors, and show the relative speedup of the 4, and 8 processor results relative to the single processor result. b. If the CPI of the arithmetic instructions was doubled, what would the impact be on the execution time of the program on 1, 2, 4, and 8 processors? C. To what should the CPI of load/store instructions be reduced in order for a single processor to match the performance of four processors using the original CPI values
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started