Question
The following problem concerns optimizing a procedure for maximum performance on an Intel Pentium IV with the following characteristics of the functional units: Operation Latency
The following problem concerns optimizing a procedure for maximum performance on an Intel Pentium IV with the following characteristics of the functional units: Operation Latency Issue Time/Rate Integer Add 1 1 Integer Multiply 3 1 Floating Point Add 2 1 Floating Point Multiply 4 2 Load or Store (Cache Hit) 1 1 Assume there is one of each functional unit, array1 and array2 have the correct types, e.g. int or floating point. Assume input1, input2, input3, out1, and out2 can be stored in registers. (a) float out1, out2, input1, input2; for (i=0; i The following problem concerns optimizing a procedure for maximum performance on an Intel Pentium IV with the following characteristics of the functional units operation Latency issue Time/Rate Integer Add Integer Multiply Floating Point Add Floating Point Multiply 4 Load or Store (Cache Hit) 1 Assume there is one of each functional unit, array1 and array2 have the correct types. e.g. int or floating point. Assume inpu input2. input3. out1. and out2 can be stored in registers. float out1, out2, input 1, input2; for (i e; ik length i++) out1 puti array in out2 out array20i What is the CPE of this loop? (b) int input1 input2 input 3; for (i e, ik length i input1 input 1 array1[ij; input 2 input array2[i ij; input 3 input2 input What is the CPE of this loop? (he answer is in format of integer)
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started