Exercise 4 19 This exercise is intended to help you understand the cost complexity performance tradeoffs of forwarding in a pipelined processor Problems in this exercise refer to pipelined datapaths from Figure 4 45 These problems assume that, of all instructions executed in a processor, the following fraction of these instructions has a particular type of RAW data dependence The type of RAW data dependence is identifi ed by the stage that produces the result (EX or MEM) and the instruction that consumes the result (1st instruction that follows the one that produces the result, 2nd instruction that follows, or both) We assume that the register write is done in the fi rst half of the clock cycle and that register reads are done in the second half of the cycle, so EX to 3rd and MEM to 2nd dependences are not counted because they can not result in data hazards Also, assume that the CPI of the processor is 1 if there are no data hazards EX to 1st only EX to 1st and 2nd EX to 2nd only MEM to 1st a 10 10 5 25 b 15 5 10 20 4 19 1 10 4 7 If we use no forwarding, what fraction of cycles are we stalling due to data hazards 4 19 2 5 4 7 If we use full forwarding (forward all results that can be forwarded), what fraction of cycles are we stalling due to data hazards 4 19 3 10 4 7 Let us assume that we can not afford to have three input Muxes that are needed for full forwarding We have to decide if it is better to forward only from the EX MEM pipeline register (next cycle forwarding) or only from the MEM WB pipeline register (two cycle forwarding) Which of the two options results in fewer data stall cycles The remaining three problems in this exercise refer to the following latencies for individual pipeline stages For the EX stage, latencies are given separately for a processor without forwarding and for a processor with different kinds of forwarding IF ID EX (no FW) EX (full FW) EX (FW from EX MEM only) EX (FW from MEM WB only) MEM WB a 100ps 50ps 75ps 110ps 100ps 100ps 100ps 60ps b 250ps 300ps 200ps 350ps 320ps 310ps 300ps 200ps 4 19 4 10 4 7 For the given hazard probabilities and pipeline stage latencies, what is the speed up achieved by adding full forwarding to a pipeline that had no forwarding 4 19 5 10 4 7 What would be the additional speed up (relative to a processor with forwarding) if we added time travel forwarding that eliminates all data hazards Assume that the yet to be invented time travel circuitry adds 100ps to the latency of the full forwarding EX stage 4 19 6 20 4 7 Repeat Exercise 4 19 3 but this time determine which of the two options results in shorter time per instruction

Question

Exercise 4 19 This exercise is intended to help you understand the cost complexity performance tradeoffs of forwarding in a pipelined processor  Problems in this exercise refer to pipelined datapaths from Figure 4 45  These problems assume that, of all instructions executed in a processor, the following fraction of these instructions has a particular type of RAW data dependence  The type of RAW data dependence is identifi ed by the stage that produces the result (EX or MEM) and the instruction that consumes the result (1st instruction that follows the one that produces the result, 2nd instruction that follows, or both)  We assume that the register write is done in the fi rst half of the clock cycle and that register reads are done in the second half of the cycle, so EX to 3rd and MEM to 2nd dependences are not counted because they can not result in data hazards  Also, assume that the CPI of the processor is 1 if there are no data hazards  EX to 1st only EX to 1st and 2nd EX to 2nd only MEM to 1st a  10  10  5  25  b  15  5  10  20  4 19 1  10   4 7  If we use no forwarding, what fraction of cycles are we stalling due to data hazards  4 19 2  5   4 7  If we use full forwarding (forward all results that can be forwarded), what fraction of cycles are we stalling due to data hazards  4 19 3  10   4 7  Let us assume that we can not afford to have three input Muxes that are needed for full forwarding  We have to decide if it is better to forward only from the EX MEM pipeline register (next cycle forwarding) or only from the MEM WB pipeline register (two cycle forwarding)  Which of the two options results in fewer data stall cycles  The remaining three problems in this exercise refer to the following latencies for individual pipeline stages  For the EX stage, latencies are given separately for a processor without forwarding and for a processor with different kinds of forwarding  IF ID EX (no FW) EX (full FW) EX (FW from EX MEM only) EX (FW from MEM WB only) MEM WB a  100ps 50ps 75ps 110ps 100ps 100ps 100ps 60ps b  250ps 300ps 200ps 350ps 320ps 310ps 300ps 200ps 4 19 4  10   4 7  For the given hazard probabilities and pipeline stage latencies, what is the speed up achieved by adding full forwarding to a pipeline that had no forwarding  4 19 5  10   4 7  What would be the additional speed up (relative to a processor with forwarding) if we added time travel forwarding that eliminates all data hazards  Assume that the yet to be invented time travel circuitry adds 100ps to the latency of the full forwarding EX stage  4 19 6  20   4 7  Repeat Exercise 4 19 3 but this time determine which of the two options results in shorter time per instruction

SolutionInn · Accepted Answer

The Answer is in the image, click to view ...

Question: Exercise 4.19 This exercise is intended to help you understand the cost/complexity/performance tradeoffs of forwarding in a pipelined processor. Problems in this exercise refer to

Step by Step Solution