Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Please help with this for deep learning. CODE IN PHYTON. Problem 3 : For a given number of parameters P , let m P (

Please help with this for deep learning. CODE IN PHYTON.

Problem

3

: For a given number of parameters

P,

let

m_{P} (k)

be the number of nodes per layer such that Params

(k, m) = P (

or as close to

P

as possible

) .

A network with

k

layers and

m_{P} (k)

nodes per layer should therefore have approximately

P

total parameters. For such a network, we can define trainLoss

(k, P)

and testLoss

(k, P),

for each

k

from

1

k_{P} .

Identify

10

values of

P,

sufficiently distinct so that the resulting network shapes and scale. How did you make your choice? Use the linear softmax model as a baseline.

Note, because of the integer

/

rounding issues, your

P

values should be distinct enough so that you don't accidentally create networks with the same

m

and

k

for two distinct

P

values.

Plot

(

overlaying the curves for each

P)

trainLoss

(k, P),

with the

x -

axis as

\frac{k}{k_{P}}

from

0

1 .

Note, you probably do not want to test every possible

k

value, due to time constraints. But test enough

k

values so that the trend is clear.

Plot

(

overlaying the curves for each

P)

test Los

(k, P),

with the

x -

axis as

\frac{k}{k_{P}}

from

0

1 .

How do the results compare to the baseline performance of the linear softmax model?

What do you notice about the underlying trends? Is there a point where layers become too narrow to be useful, and if so

,

where is it

?

What seems to be the sweet spot, if any, for network shape? How does it depend on

P ?

Create a plot showing

(

overlaying the curves for each

P)

total training time in terms of passes through the data. Set the

x -

axis to go over

\frac{k}{k_{P}}

for ease of comparison. Does this change your assessment of the network shape tradeoff at all?

Bonus:

Is total parameters

P

a fair comparison point? Try to find a better one. Justify it

.

Does introducing regularization

(

weight decay

)

or normalization layers help?

Problem

4

: For a

P

of your choice and the optimal network shape as determined above

-

try to find an even better network shape

(

layers of unequal size, for instance

)

that gives better results for the same

(

approximate

)

total number of parameters. Is it better to have uniform layers? Layers of decreasing size? Increasing size? Experiment with it

,

and summarize your results. You may want to save the best model you find.

Bonus: Does regularization help?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Privacy In Statistical Databases International Conference Psd 2022 Paris France September 21 23 2022 Proceedings Lncs 13463

Authors: Josep Domingo-Ferrer ,Maryline Laurent

1st Edition

3031139445, 978-3031139444

More Books

Students also viewed these Databases questions

Question

★★★★★

Where could a researcher working for the U.S. Marine Corps (www.marines.com/home) find information that would identify the most productive areas of the U.S. in which to recruit? What would you...

Answered: 1 week ago

Question

★★★★★

______ are dollar denominated claims issued by U.S. banks representing ownership of shares of a foreign companys stock held on deposit by the U.S. bank in the issuing firms home country. Foreign...

Answered: 1 week ago

Question

★★★★★

Which philosopher proposed that nerve pathways allowed for reflexes? a. Socrates b. Ren Descartes c. John Locke

Answered: 1 week ago

Question

★★★★★

Business transactions completed by Hannah Venedict during the month of September are as follows: a. Venedict invested $90,000 cash along with office equipment valued at $21,000 in exchange for common...

Answered: 1 week ago

Question

★★★★★

Please help with this for deep learning. CODE IN PHYTON. Problem 3 : For a given number of parameters P , let m P ( k ) be the number of nodes per layer such that Params ( k , m ) = P ( or as close...

Answered: 1 week ago

Question

★★★★★

(The following information applies to the questions displayed below.] Kirkland Theater sells season tickets for six events at a price of $189. For the 2016 season, 1,200 season tickets were sold....

Answered: 1 week ago

Question

★★★★★

In the Deacon process for the manufacture of C12, a dry mixture of HCl and air is passed over a heated catalyst that promotes the oxidation of HCl to Cl2. The Deacon process can be reversed and HCl...

Answered: 1 week ago

Question

★★★★★

day & date time kWh reading kWh used hours elapsed avg. kW used day & date time kWh reading kWh used hours elapsed avg. kW used day & date time kWh reading kWh used hours elapsed avg. kW used Day 1...

Answered: 1 week ago

Question

★★★★★

X P(x) 0 0.2 1 0.25 2 0.3 3 0.25 Find the standard deviation of this probability distribution. Give your answer to at least 2 decimal places

Answered: 1 week ago

Question

★★★★★

QUESTION 9 Consider the following production process. The process consists of 35 steps and operates at a cycle time of 1.19 min/unit. Their most error prone operation is step 17. Units are inspected...

Answered: 1 week ago

Question

★★★★★

3. A hydrogen electron absorbs energy and travels from n= 3 to n= 5. What is the energy of the transition and what is its corresponding frequency

Answered: 1 week ago

Question

★★★★★

3 The Amish forgive those involved in conflict with them because of their deeply held religious beliefs. Are there other reasons for forgiveness that can be useful in a conflict?

Answered: 1 week ago

Question

★★★★★

4 What is your reaction to the information on Amish forgiveness in light of what we learned about Amish shunning earlier in the chapter?

Answered: 1 week ago

Question

★★★★★

Think of an attitude you have about conflict that is making it difficult for you to talk productively about disagreements with someone in your life. For example, do you believe that discussing...

Answered: 1 week ago

Previous Question Next Question