Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Feb 08, 2024

Complete the function compute using pthreads v1() to parallelize the SAXPY loop using the chunking method. That is, given vectors x and y of some

Complete the function compute using pthreads v1() to parallelize the SAXPY loop using the "chunking" method. That is, given vectors x and y of some arbitrary length n, and when k threads are used, individual threads calculate SAXPY in parallel on smaller chunks of these vectors

This assignment, worth twenty points, is due February 1, 2024, by 11:59 pm. You may work on it in a group of up to two people. Please submit original code. You are asked to compare two different ways of parallelizing the SAXPY loop using the pthread library. SAXPY is a function within the standard Basic Linear Algebra Subroutines (BLAS) library and stands for "Single-Precision AX Plus Y." The implementation is very simple, involving a combination of scalar multiplication and vector addition. The routine takes as inputs two vectors of 32-bit floating-point values x and y with n elements each, and a scalar value a. It multiplies each element x[i] by a and adds the result to y[i]. The serial implementation looks like this: void saxpy (float *x, float *y, float a, int n) { } int i; for (i = 0; i < n; i++) y[i] = a *x[i] +y[i]; Using the provided sequential program saxpy.c as a starting point, develop two versions that parallelize SAXPY. Complete the function compute_using-pthreads_vl() to parallelize the SAXPY loop using the "chunking" method. That is, given vectors x and y of some arbitrary length n, and when k threads are used, individual threads calculate SAXPY in parallel on smaller chunks of these vectors. Complete the function compute_using_pthreads_v2() to parallelize the SAXPY loop using the "striding" method. That is, each thread strides over elements of the vectors with some stride length, calculating SAXPY along the way. For example, given k = 4 threads, thread 0 calculates SAXPY for elements y[0], y[4], y[8], . . ., thread 1 calculates SAXPY for elements y[1], y[5], y[9], . . ., and so on. The pseudo-code for this method looks like this for each thread: /* tid is the thread ID and k is the number of threads created */ int stride = k; while (tid < n) { y[tid] a * x[tid] + y[tid];

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Linear Algebra with Applications

Linear Algebra with Applications

Authors: Steven J. Leon

7th edition

131857851, 978-0131857858

More Books

Students explore these related Algorithms questions

Question

b. After receiving the second coupon payment (at the end of the second year), Arjay decides to sell his bond in the bond market. What price can he expect for his bond if the one-year interest rate at...

Answered: 3 weeks ago

Question

re Regular Languages and Finite Automata (a) Let L be the set of all strings over the alphabet {a, b} that end in a and do not contain the substring bb. Describe a deterministic finite automaton...

Answered: 3 weeks ago

Question

can someone solve this Modern workstations typically have memory systems that incorporate two or three levels of caching. Explain why they are designed like this. [4 marks] In order to investigate...

Answered: 3 weeks ago

Question

Using a resource-based view, explain why some firms improve their economic performance by adopting a CSR strategy, whereas others achieve no results or damaging results.

Answered: 3 weeks ago

Question

What are several types of health insurance coverage available under group and individual policies?

Answered: 3 weeks ago

Question

What is the relationship between due professional and negligence?

Answered: 3 weeks ago

Question

1. Dont imply that the success may be based on luck, extra help, or easy material.

Answered: 3 weeks ago

Question

At December 31, 2012, Appaloosa Corporation had a deferred tax liability of $25,000. At December 31, 2013, the deferred tax liability is $42,000. The corporations 2013 current tax expense is $48,000....

Answered: 3 weeks ago

Question

Other than a ban, the bottled - water externality can be dealt with by A . a quota on the number of bottles a household can purchase B . a subsidy on bottled water C . making bottled water a public...

Answered: 3 weeks ago

Question

The following information pertains to the City of Williamson for 2024, its first year of legal existence. For convenience, assume that all transactions are for the general fund, which has three...

Answered: 3 weeks ago

Question

2 A company has 500 shares of $50 par value preferred stock outstanding and the call price of its preferred stock is $60 per share. It also has 20,000 shares of ommon stock outstanding and the total...

Answered: 3 weeks ago

Question

1) An organisation has the following contribution function: Contribution = 5X + 10Y where X = the number of units of product X produced, and Y = the number of units of product Y produced. A graph has...

Answered: 3 weeks ago

Question

Case Study: If Only I Had Known A few months ago, Maria Turks, manager of client care at Willowpark Retirement Centre, was asked to review a job description for caregiver as 25 people in this job...

Answered: 3 weeks ago

Question

(LO3, 4, 6) Exercises 4-24 Indirect cost rates and the death spiral Famous Flange Company manufactures a variety of special flanges for numerous customers. Annual capacity-related (manufacturing...

Answered: 3 weeks ago

Question

Page No. Date (6) On 30 september 2014, Razor's closing inventory was counted and valved at its cast of Tas I million inventory which had cost demaged Some item T2s 210,000 Gad been in a flood Con is...

Answered: 3 weeks ago

Question

12 SC 301 student is comparing between two stocks to invest in: Stock A and Stock B. owing are the stock returns (in dollars) in the last 10 months (sample). Monthly Returns Stock A Stock B 585 960...

Answered: 3 weeks ago

Question

Which of the following would be most unllkely to be sold for the same price, regardless of location? gold silver platinum real estate A b and c

Answered: 3 weeks ago

Question

Comptech Ltd is a manufacturer of optical equipment. In September 2019, Ed Thompson the Chief Research Officer, attended a conference in Switzerland that focused on optical developments for the 21st...

Answered: 3 weeks ago

Question

Let A be a 5 4 matrix with singular values 1 = 5, 2 = 3, and 3 = 4 = 1. Determine the values of ||A||2 and ||A||F.

Answered: 3 weeks ago

Question

Let U be an n n upper triangular matrix with nonzero diagonal entries. (a) Explain why U must be nonsingular. (b) Explain why U-1 must be upper triangular.

Answered: 3 weeks ago

Question

Is the transpose of an elementary matrix an elementary matrix of the same type? Is the product of two elementary matrices an elementary matrix?

Answered: 3 weeks ago

Question

The article The Responsiveness of Food Sales to Shelf Space Requirements (J. Marketing Research, 1964: 6367) reports the use of a Latin square design to investigate the effect of shelf space on food...

Answered: 3 weeks ago

Question

The accompanying data was obtained in an experiment to investigate whether compressive strength of concrete cylinders depends on the type of capping material used or variability in different batches...

Answered: 3 weeks ago

Question

The article An Analysis of Variance Applied to Screw Ma-chines (Industrial Quality Control, 1956: 89) describes an experiment to investigate how the length of steel bars was affected by time of day...

Answered: 3 weeks ago

Previous Question Next Question