Answered step by step
Verified Expert Solution
Question
1 Approved Answer
( 4 points ) Derive d e l J d e l W i j . ( 2 points ) Write d e l J
points Derive
points Write as an outer product of two vectors. is a matrix with the same dimen
sions as ; it is just like a gradient, except that and are matrices rather than vectors.
points Derive Problem
You want to train a neural network to recognize the age and sex based on the face image. Your
training data consists of grayscale pixel images. The training labels include the age and sex
of each person. Your neural network consists of an input layer with units, a hidden
layer with units, and an output layer with units. You use the ReLU activation function for
the hidden units and no activation function for the outputs or inputs You train your network
with the cost function Please use the following notations to answer questions:
is a training image input vector with a component appended to the end as a bias is
a training label input vector, and is the output vector. All vectors are column vectors.
max is the ReLU activation function, is its derivative if other
wise and is applied componentwise to a vector.
is the vector of hidden unit values before the ReLU activation functions are applied, and
is the vector of hidden unit values after they are applied but we append a com
ponent to the end of as the bias term
is the weight matrix mapping the input layer to the hidden layer;
is the weight matrix mapping the hidden layer to the output layer;
Questions:
points Calculate the number of parameters weights in this network. You can leave your
answer as an expression.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started