Answered step by step
Verified Expert Solution
Question
1 Approved Answer
COMPUTER architecture 2.24 130] Since instruction-level parallelism can also be effectively exploited on in-order superscalar processors and very long instruction word (VLIW) processors with speculation,
COMPUTER architecture
2.24 130] Since instruction-level parallelism can also be effectively exploited on in-order superscalar processors and very long instruction word (VLIW) processors with speculation, one important reason for building an out-of- ordef (O0O) superscalar processor is the ability to tolerate unpredictable memory latency caused by cache misses. Hence, you can think about hardware supporting O00 issue as being part of the memory system! Look at the floorplan of the Alpha 21264 in Figure 2.33 to find the relative area of the integer and floating-point issue queues and mappers versus the caches. The queues schedule instructions for issue, and the mappers rename register specifiers. Hence, these are necessary additions to support O00 issue. The 21264 only has Ll data and instruction caches on chip, and they are both 64 KB two-way set associative. Use an 000 superscalar simulator such as SimpleScalar (www.cs.wisc.edul-mscalar simplescalar.html) on memory-intensiv benchmarks to find out how much performance is lost if the area of the issue queues and mappers is used for adi- tional Ll data cache area in an in-order superscalar processor, instead of O00 issue in a model of the 21264. Make sure the other aspects of the machine are as similar as possible to make the comparison fair. Ignore any increase in access or cycle time from larger caches and effects of the larger data cache on the floorplan of the chip. (Note that this comparison will not be totally fair, as the code will not have been scheduled for the in-order processor by the compiler.) 2.24 130] Since instruction-level parallelism can also be effectively exploited on in-order superscalar processors and very long instruction word (VLIW) processors with speculation, one important reason for building an out-of- ordef (O0O) superscalar processor is the ability to tolerate unpredictable memory latency caused by cache misses. Hence, you can think about hardware supporting O00 issue as being part of the memory system! Look at the floorplan of the Alpha 21264 in Figure 2.33 to find the relative area of the integer and floating-point issue queues and mappers versus the caches. The queues schedule instructions for issue, and the mappers rename register specifiers. Hence, these are necessary additions to support O00 issue. The 21264 only has Ll data and instruction caches on chip, and they are both 64 KB two-way set associative. Use an 000 superscalar simulator such as SimpleScalar (www.cs.wisc.edul-mscalar simplescalar.html) on memory-intensiv benchmarks to find out how much performance is lost if the area of the issue queues and mappers is used for adi- tional Ll data cache area in an in-order superscalar processor, instead of O00 issue in a model of the 21264. Make sure the other aspects of the machine are as similar as possible to make the comparison fair. Ignore any increase in access or cycle time from larger caches and effects of the larger data cache on the floorplan of the chip. (Note that this comparison will not be totally fair, as the code will not have been scheduled for the in-order processor by the compiler.)Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started