Answered step by step
Verified Expert Solution
Link Copied!
Question
1 Approved Answer

4.3 (Matrix calculus) The optimization problem we posed for A R in 4.1.4 is an example of a problem where the unknown is a matrix

4.3 ("Matrix calculus") The optimization problem we posed for A R in 4.1.4 is an example of a problem where the unknown is a matrix rather than a vector. These prob- lems appear frequently in machine learning and have inspired an alternative notation for differential calculus better suited to calculations of this sort. (a) Suppose f: Rnxm R is a smooth function. Justify why the gradient of f can be thought of as an n x m matrix. We will use the notation to notate the gradient of f(A) with respect to A. af JA (b) Take the gradient /A of the following functions, assuming and y are constant vectors: (i) Ay (ii) x A Ax T (iii) (x - Ay) W(- Ay) for a constant, symmetric matrix W (i) (ii) (c) Now, suppose X e Rmxn is a smooth function of a scalar variable X(t) : R Rmxn. We can notate the differential X = X'(t). For matrix functions X(t) and Y(t), justify the following identities: (X+Y) = ax + ay (XT) = (2x)T (iii) a(XY)= (X)Y + X(OY) (iv) (X) = X(@X)X- (see Exercise 1.13) After establishing a dictionary of identities like the ones above, taking the derivatives of functions involving matrices becomes a far less cumbersome task. See [99] for a comprehensive reference of identities and formulas in matrix calculus.
image text in transcribed
3 ("Matrix calculus") The optimization problem we posed for AR22 in 4.1.4 is an example of a problem where the unknown is a matrix rather than a vector. These problems appear frequently in machine learning and have inspired an alternative notation for differential calculus better suited to calculations of this sort. (a) Suppose f:RnmR is a smooth function. Justify why the gradient of f can be thought of as an nm matrix. We will use the notation Af to notate the gradient of f(A) with respect to A. (b) Take the gradient /A of the following functions, assuming x and y are constant vectors: (i) xAy (ii) xAAx (iii) (xAy)W(xAy) for a constant, symmetric matrix W (c) Now, suppose XRmn is a smooth function of a scalar variable X(t):R Rmn. We can notate the differential XX(t). For matrix functions X(t) and Y(t), justify the following identities: (i) (X+Y)=X+Y (ii) (X)=(X) (iii) (XY)=(X)Y+X(Y) (iv) (X1)=X1(X)X1 (see Exercise 1.13) After establishing a dictionary of identities like the ones above, taking the derivatives of functions involving matrices becomes a far less cumbersome task. See [99] for a comprehensive reference of identities and formulas in matrix calculus

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image
Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modeling And Designing Accounting Systems Using Access To Build A Database

Authors: Laura R. Ingraham, C. Janie Chang

1st Edition

0471450871, 978-0471450870

More Books

Students explore these related Accounting questions

Question

How flying airoplane?

Answered: 3 weeks ago