Question: Now assume that we can use scatter-gather loads and stores (LVI and SVI). Assume that tiPL, tiPR, clL, clR, and clP are arranged consecutively in

Now assume that we can use scatter-gather loads and stores (LVI and SVI). Assume that tiPL, tiPR, clL, clR, and clP are arranged consecutively in memory. For example, if seq_length==500, the tiPR array would begin 500 * 4 bytes after the tiPL array. How does this affect the way you can write the VMIPS code for this kernel? Assume that you can initialize vector registers with integers using the following technique which would, for example, initialize vector register V1 with values (0,0,2000,2000):

LI R2,0 SW R2,vec SW R2, vec+4 LI R2,2000 SW R2, vec+8 SW R2, vec+12 LV V1,vec

Assume the maximum vector length is 64. Is there any way performance can be improved using gather-scatter loads? If so, by how much?

LI R2,0 SW R2,vec SW R2, vec+4 LI R2,2000 SW R2, vec+8 SW R2, vec+12 LV V1,vec

Step by Step Solution

★★★★★

3.40 Rating (169 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer

Step: 1 Unlock

In this case the 16 values could be loaded into each vector register pe... View full answer

Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock

Step: 3 Unlock

Students Have Also Explored These Related Computer Architecture Questions!

Suppose that we wish to add the operation PRINT-SET(x), which is given a node x and prints all the members of xs set, in any order. Show how we can add just a single attribute to each node in a...

Draw an ERD for each of the following situations. (If you believe that you need to make additional assumptions, clearly state them for each situation.) Draw the same situation using the tool you have...

Assume that we know the bottleneck link along the path from the server to the client is the first link with rate R, bits/sec. Suppose we send a pair of packets back to back from the server to the...

CHAPTER 12 Valuation Cash-Flow-s peeches me i n Exhibits 12.17 12 10 12 14 inchide the actual amounts for fiscal 2015 and the projected amounts for Year 1 to Years for the income statements, balance...

ch3 ch3 Required Information [The following infomation apples to the questions disployed below] Sweeten Company had no jobs in progress at the beginning of the year and no beginving inventories, it...

Required: a. If the Jacobys decide to rent the home, what is their after-tax cost of the rental for the first year? (include income from the annuity account in your analysis.) Note: Round your...

8;4 1. [-/0.45 Points]DETAILSBBUNDERSTAT12 8.4.003. MY NOTES ASK YOUR TEACHER When testing the difference of means for paired data, what is the null hypothesis? H o : d 0 H o : d = 0 H...

Decision Analysis - Luxury Car vs. Sportscar The next 15 questions refer to the following scenario: A car manufacturer can launch either a new luxury sedan (L) or a new high-performance sports car...

I will gladly give a thumbs up to whomever can answer my questions thank you. TurStuff, Inc., sells a wide range of drums, bins, boxes, and other containers that are used in the chemical Industry....

NEED ANSWERED ASAP! EVERY TH ING IN YELLOW AND IN THE SAME CHART FORMAT Thank You! last picture is simply a zoomed out ceiw of the whole problem thank you again! THESE ARE CLEARER PICTURES USE THESE...

With respect to the interest rate, (a) What is the liquidity effect? (b) What is the price-level effect? (c) What is the expectations effect?

Plaintiff-appellants Bates and O'Steen, licensed to practice law in the state of Arizona, opened a "legal clinic" in 1974. The clinic provided legal services to people with modest incomes for...

Which of the following is not a benefit of statistical sampling? Seleccione una: a . It allows inference about a population. b . It allows quantification of risk. c . It allows auditors to be certain...

The risk owner, or person who will own or take responsibility for the risk event: One person should be responsible for monitoring each risk event.

Sequential consistency (SC) requires that all reads and writes appear to have executed in some total order. This may require the processor to stall in certain cases before committing a read or write...

The switched snooping protocol above supports sequential consistency in part by making sure that reads are not performed while another node has a writeable block and writes are not performed while...

For each part of this exercise, assume the initial cache and memory state in Figure 4.42. Each part of this exercise specifies a sequence of one or more CPU operations of the form: P#: [

A pump is needed for 10 years at a remote location. The pump can be driven by an electric motor if a power line is extended to the site. Otherwise, a gasoline engine will be used. Use an annual cash...

The owners' equity accounts for Hexagon International are shown here Common stock ($.50 par value) Capital surplus Retained earnings 35,000 320,000 708,120 Total owners' equity $1,063,120 a-1.If the...

Suppose Capital One is advertising a 6 0 - month, 5 . 7 6 % APR motorcycle loan. If you need to borrow $ 7 , 2 0 0 to purchase your dream Harley - Davidson, what will be your monthly payment? ( Note:...