Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Ex 5 . 4 : Activation and weight scaling. Consider the two hidden unit network shown in Figure 5 . 6 2 , which uses

5.4

: Activation and weight scaling. Consider the two hidden unit network shown in

Figure

5.62,

which uses ReLU activation functions and has no additive bias parameters. Your

task is to find a set of weights that will fit the function

y = | x_{1} + 1.1 x_{2} | .

Can you guess a set of weights that will fit this function?

Starting with the weights shown in column

b,

compute the activations for the hid

-

den and final units as well as the regression loss for the nine input values

(x_{1}, x_{2}) i n

{- 1, 0, 1} {- 1, 0, 1} .

Now compute the gradients of the squared loss with respect to all six weights using the

backpropagation chain rule equations

(5.65 - 5.68)

and sum them up across the training

samples to get a final gradient.

What step size should you take in the gradient direction, and what would your update

squared loss become?

Repeat this exercise for the initial weights in column

(

)

of Figure

5.62 .

Given this new set of weights, how much worse is your error decrease, and how many

iterations would you expect it to take to achieve a reasonable solution?

Figure

5.63

Function optimization:

(

)

the contour plot of

f (x, y) = x^{2} + 20 y^{2}

with

the function being minimized at

(0, 0)

;

(

)

ideal gradient descent optimization that quickly

converges towards the minimum at

x = 0, y = 0 .

Would batch normalization help in this case?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Databases In Telecommunications Ii Vldb 2001 International Workshop Dbtel 2001 Rome Italy September 10 2001 Proceedings Lncs 2209

Authors: Willem Jonker

2001st Edition

354042623X, 978-3540426233

More Books

Students also viewed these Databases questions

Question

★★★★★

A blackjack player at a Las Vegas casino learned that the house will provide a free room if play is for four hours at an average bet of $50. The players strategy provides a probability of .49 of...

Answered: 1 week ago

Question

★★★★★

An air-filled parallel plate capacitor of length L, width a, and plate separation d has its plates maintained at constant potential difference Vo. If a dielectric slab of dielectric constant ?r is...

Answered: 1 week ago

Question

★★★★★

15.44 Suppose that scores given by judges to competitors in the ski-jumping events of the Winter Olympics were analyzed. For the mens ski-jumping competition, suppose there were 22 contestants and 9...

Answered: 1 week ago

Question

★★★★★

Consider again the choice between outsourcing and in-house assembly of HomeNet discussed in Section 8.3 and analyzed in Table 8.6. Suppose, however, that the upfront cost to set up for in-house...

Answered: 1 week ago

Question

★★★★★

Ex 5 . 4 : Activation and weight scaling. Consider the two hidden unit network shown in Figure 5 . 6 2 , which uses ReLU activation functions and has no additive bias parameters. Your task is to find...

Answered: 1 week ago

Question

★★★★★

Entrepreneurship Topic: How Do I Operate An Enterprise? (Managing Money) ? Activity 1 ? Activity 2 ? Activity 3 ? V. Evaluation ? Additional Readings Hello tutors, please add a brief explaination to...

Answered: 1 week ago

Question

★★★★★

Initial investment-Basic calculation Cushing Corporation is considering the purchase of a new grading machine to replace the existing one. The existing machine was purchased 3 years ago at an...

Answered: 1 week ago

Question

★★★★★

PROBLEM 1 Eudora Corporation manufactures a propeller. Shown below is Eudora's cost structure: ...................................................Variable cost per...

Answered: 1 week ago

Question

★★★★★

Consider an optical ground sensor at 35N latitude & 35E longitude. The image plane of its camera is 1500 pixels by 2000 pixels, the size of each pixels 0.005 mm by 0.005 mm, and the focal length is...

Answered: 1 week ago

Question

★★★★★

241. Two smooth circular cylinders each of weight 1500 N and radius 20 cm, are connected at their centers by a string AB of length of 50 cm and rest upon a horizontal plane, supports above then a...

Answered: 1 week ago

Question

★★★★★

Sarah Johnson is the health information manager for the Columbus County Alcohol and Drug Treatment Center, a CARF-accredited, publicly funded treatment facility. As a member of the quality...

Answered: 1 week ago

Question

★★★★★

6. Are my sources reliable?

Answered: 1 week ago

Question

★★★★★

How do you feel about the fact that even unintentionally using someone elses words, ideas, or intellectual property is still plagiarism? Does it seem unfair that you might suffer severe consequences...

Answered: 1 week ago

Question

★★★★★

5. Are my sources compelling?

Answered: 1 week ago

Previous Question Next Question