Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

4 Under - Parameterization and Over - Parameterization In the previous section, we had more data points than features in our data, i . e

4

Under

-

Parameterization and Over

-

Parameterization

In the previous section, we had more data points than features in our data, i

.

.,

we were looking at

N > 100 .

This tends to be the ideal situation, since we need to find an unknown weight for each feature, and this gives us enough information to determine each weight

(

similar to how two data points are enough to find the slope and intercept, the two unknowns, of a line

) .

Sometimes, however, we may have fewer data points than we have features

-

this makes it difficult to determines how the underlying model should depend on each feature. We just don't have enough data. In the following problems, consider a training data set of size

N = 50

and a test data set of size

N = 50 .

Problem

8

: Let

A

be a matrix of random values, with

k

rows and

101

columns, where each entry sampled from a

N (0, 1)

distribution. Note that for any input vector

x_{,} A x_{?}

will be a vector of

k

values. We could then consider performing linear regression on the data points

(A x_{,} y)

rather than

(x_{,} y) .

Note that if

k 50,

this transformed data set will have fewer input features than we have data points in our data set, and thus we restore linear regression to working order.

Plot over

k

from

1

50

the testing error when, for a given

k,

you pick a random

A

to transform the input vectors by

,

then do linear regression on the result. You'll need to repeat the experiment for a number of

A,

for each

k,

to get a good plot. What do you notice? Does this seem to be a reasonable trend?

Problem

9

: Notice that there's nothing stopping us from continuing to increase

k .

This puts us in a region over over

-

parameterization

(

we have more features in our data than data points

),

and in fact increasingly over

-

parameterization, if we were bold enough to take

k > 100 .

One possible solution is to

,

when performing linear regression on the transformed

A x_{?}

data, do ridge regression, introducing the ridge penalty into the loss we are minimizing.

Continue the experiment, for

k = 50, 51, 52,

dots,

200,

plotting the resulting testing error

(

averaged over multiple choices of

A) .

How did you choose a good

value?

(

Note that the number of weights we need to find changes with

k -

should this influence

?)

What do you notice?

Bonus: Why does this happen?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Advances In Spatial Databases 5th International Symposium Ssd 97 Berlin Germany July 15 18 1997 Proceedings Lncs 1262

Authors: Michel Scholl ,Agnes Voisard

1st Edition

3540632387, 978-3540632382

More Books

Students also viewed these Databases questions

Question

★★★★★

If an on-the-run issue for an issuer is evaluated properly using a binomial model, how would the theoretical value compare to the actual market price?

Answered: 1 week ago

Question

★★★★★

You should know from ECE2020 that a digital representation of certain real numbers is only an approximation. For example, irrational numbers such as , e, V2, etc have only a finite number of digits...

Answered: 1 week ago

Question

★★★★★

6. Explore the development of cause- related marketing programmes (CPMs) and their implications for corporate and consumer philanthropy.

Answered: 1 week ago

Question

★★★★★

The local residents of Greene County, a small, rural, mostly minority community, have recently learned that a major oil company is putting a refinery in the county. The residents ask for an...

Answered: 1 week ago

Question

★★★★★

4 Under - Parameterization and Over - Parameterization In the previous section, we had more data points than features in our data, i . e . , we were looking at N > 1 0 0 . This tends to be the ideal...

Answered: 1 week ago

Question

★★★★★

Please help An electron is accelerated through 1.75 x 10 V from rest and then enters a uniform 2.80-T magnetic field. (a) What is the maximum magnitude of the magnetic force this particle can...

Answered: 1 week ago

Question

★★★★★

Researchers conduct a RCT to test the impact of a literacy program on a set of student volunteers, and have the following information. Average Literacy Rate in the US Population is = 80% Average...

Answered: 1 week ago

Question

★★★★★

T5.4 Computer voice recognition software is getting better. Some companies claim that their software correctly recognizes 98% of all words spoken by a trained user. To simulate recognizing a single...

Answered: 1 week ago

Question

★★★★★

Below is a frequency table for the General Social Survey variable SOCOMMUN, which is based on the survey question, "How often do you spend an evening with someone who lives in your neighborhood?" In...

Answered: 1 week ago

Question

★★★★★

1. [5 marks] Consider L = lim n 6 1 -(+) i= Write down an integral with value L. Justify your expression.

Answered: 1 week ago

Question

★★★★★

Check Your Understanding Practise 1. Determine whether each relation is a function or is not a function. Give a reason for your answer. a) (1, 2), (0, 1), (1, 2), (2, 5) b) (3, 12), (4, 12), (5, 14),...

Answered: 1 week ago

Question

★★★★★

(Appendices) Select a commonly used product, such as gasoline or milk, and track the prices of this product over a 20-year period by creating a line graph. Explain price increases and decreases based...

Answered: 1 week ago

Question

★★★★★

(Appendices) Search magazines, newspapers, and the Internet for advertisements. Collect or print out ads that offer the following: (a) something for nothing, (b) bonus for early reply, (c) offers of...

Answered: 1 week ago

Question

★★★★★

(Appendices) Brandon, age 32, is married and has a son, age one. Six months ago, Brandon purchased an individual health insurance policy covering the entire family. His son was recently diagnosed...

Answered: 1 week ago

Previous Question Next Question