Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Question 3 . Consider a neural network model with one hidden layer and the following properties: The size of the input is 2 . The

Question 3. Consider a neural network model with one hidden layer and the following
properties:
The size of the input is 2.
The size of hidden layer is 3 with each node has activation function Sigmoid:
S(x)=11+e-x.
The output has size 1 with activation function tanh(x).
(a) Draw a computational graph for this neural network.
(b) The binary cross-entropy loss function for a label vector y and the prediction vector
hat(y) is defined as
L(y,hat(y))=-1li=1l[yi*log2(hat(y)i)+(1-yi)*log2(1-hat(y)i)]
Note that hat(y)i is the output of the neural network for input xi.
Calculate the gradient of L with respect to the weights of the first layer W(1). You
can use a matrix notation.
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Logics For Databases And Information Systems

Authors: Jan Chomicki ,Gunter Saake

1st Edition

1461375827, 978-1461375821

More Books

Students also viewed these Databases questions

Question

You have

Answered: 1 week ago