Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Aug 30, 2024

Your task: Recall that Naive Bayes assumes all xi s are conditionally independent given y . However, in this assignment, you will be building a

"Your task:

Recall that Naive Bayes assumes all xi

s are conditionally independent given y

.

However, in this assignment, you will be building a not

-

so

-

Naive Bayes model where now xi

s are correlated. In particular, we will only consider two features x

1

and x

2,

where x

1

and x

2

are not conditionally independent given y

.

We assume f

(

x

1,

x

2 |

y

=

k

)

is a bi

-

variate Gaussian

1

distribution N

(\

mu

1

k

, \

mu

2

k

, \

sigma

1, \

sigma

2, \

rho

),

where

\

mu

1

k and

\

mu

2

k are means of x

1

and x

2

given y

=

k

, \

sigma

1

and

\

sigma

2

are standard deviations of x

1

and x

2,

and

\

rho is the correlation between x

1

and x

2 .

Notethat

\

mu

1

k and

\

mu

2

k dependonthevaluekofy,but

\

sigma

1, \

sigma

2,

and

\

rho donotdependony. Also recall that the density of a bi

-

variate Gaussian distribution, given

(\

mu

1

k

, \

mu

2

k

, \

sigma

1, \

sigma

2, \

rho

),

is:

\

sigma

2 (

x

1 \

mu

1

k

) 2 + \

sigma

12 (

x

2 \

mu

2

k

) 2 2 \

rho

\

sigma

1 \

sigma

2 (

x

1 \

mu

1

k

) (

x

2 \

mu

2

k

) 2 (1 \

rho

2) \

sigma

2 \

sigma

2

1 2

f

(

x

1,

x

2 |

y

=

k

)

P

(

y

=

k

)

P

(

y

=

k

|

x

1,

x

2) =

Pi f

(

x

1,

x

2 |

y

=

i

)

P

(

y

=

i

)

Build a not

-

so

-

Naive

-

Bayes model that uses x

1

and x

2

as features to predict y

.

Use train data to build your model and make predictions for the test set. When making predictions use the optimal Bayes classifier idea, i

.

e

.,

if P

(

y

= 1 |

x

1,

x

2) > = 0.5,

classify the test instance as

1,

else classify the test instance as

0 .

Finally, report the accuracy of your model in the TEST set.

Function Parameters:

train: This is the training data set used to train your model. Particularly, use train set to determine the necessary parameters of the model

\

mu

1

k

, \

mu

2

k

, \

sigma

1, \

sigma

2, \

rho and P

(

y

=

i

) .

DO NOT use the training set to make predictions and compute the accuracy of your model as you should do it using the test set. train should be a pandas dataframe, that has three columns x

1,

x

2,

y

.

Please note that you should use the entire train data to build your model, in other words, you DON

T need to further split the training data to create another test set. You can find a sample training set in HW

5

traindata.csv

.

test: This is the test data set used to make predictions and test the performance of your model. Please note that test data set should not be used during the training of the model and only be used to make predictions and compute the accuracy. test is also a pandas dataframe, that has three columns x

1,

x

2,

y

.

A sample test data set can be found in HW

5

testdata.csv

Output: Your function should return class predictions for the test set and overall accu

-

racy of your model in the test set. The type of predictions should be a numpy array and the type of the reported accuracy should be a numpy float" solve it without using import multivariable

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Relational Database And SQL

Relational Database And SQL

Authors: Lucy Scott

3rd Edition

1087899699, 978-1087899695

More Books

Students also viewed these Databases questions

Question

★★★★★

In March, Kelly Company had the following unit production costs: materials $10 and conversion costs $8. On March 1, it had zero work in process. During March, Kelly transferred out 22,000 units. As...

Answered: 1 week ago

Question

★★★★★

The income elasticity of demand for automobiles in the United States was estimated by a government agency to be between 2.5 and 3.9. a. What does this mean? b. If incomes rise by 10 percent, what...

Answered: 1 week ago

Question

★★★★★

=+ (d) Suppose that ? is uncountable, let , consist of the countable and the cocountable sets, and take P( A) to be 0 or 1 as A is countable or cocountable.

Answered: 1 week ago

Question

★★★★★

Solve these various time value of money scenarios. 1. Suppose you invest a sum of $ 3,500 in an interest- bearing account at the rate of 10% per year. What will the investment be worth six years from...

Answered: 1 week ago

Question

★★★★★

Prepare journal entries to record transactions a through h. a. Raw materials purchased on credit, $82,000. b. Direct materials used, $37,500. Indirect materials used, $17,200. c. Direct labor used,...

Answered: 1 week ago

Question

★★★★★

Sgt. Barnes is a veteran police officer filing as Head of Household. Currently, her taxable income is $50,000, but has been asked to sit for the Leutenant's exam. She is excited about the opportunity...

Answered: 1 week ago

Question

★★★★★

The option writer is the Multiple Choice individual who obtains the rights. seller of an option O buyer of an option. underlying asset of the option An arbitrageur is someone who Multiple Choice...

Answered: 1 week ago

Question

★★★★★

A thyroid cancer patient ingests a single dose of radioactive iodine-131 to kill the cancer cells. Iodine-131 has an effective half-life of approximately 7.71 days--some is lost to radioactive decay,...

Answered: 1 week ago

Question

★★★★★

Question 9 3 pts Currently, the Bright Ltd is an all-equity company. Earnings before interest and taxes (EBIT) for the company is expected to be $73,084 forever, and the cost of capital is currently...

Answered: 1 week ago

Question

★★★★★

A state fisheries commission wants to estimate the number of bass caught in a given lake during a season in order to restock the lake with the appropriate number of young fish. The commission could...

Answered: 1 week ago

Question

★★★★★

QUESTION 4 The following Trial Balance was extracted from the books of Mega Enterprise as at 31 December 2020. Particulars Debit (RM) Credit (RM) Capital 215,000 Duty on purchases 2,300 Insurance...

Answered: 1 week ago

Question

★★★★★

As the safety manager on a construction site, you have many roles. At the site, we are working we must approve the trenching and excavation plans for the subcontractors performing the work. Looking...

Answered: 1 week ago

Question

★★★★★

6. On June 12 every year, some U.S. Americans celebrate Loving Day to commemorate: a. Your legal right to love someone of another race b. Your legal right to love someone of the same sex c. Your...

Answered: 1 week ago

Question

★★★★★

4. Which of the following is not the name of a Native American tribe? a. Seminole b. Apache c. Arapaho d. Illini

Answered: 1 week ago

Question

★★★★★

8. Sometimes viewed as a Scandinavian tortilla, these potato flatcakes are often sold in areas with high Scandinavian American populations: a. Lefse b. Lutefisk c. Aquavit d. Fiskepudding

Answered: 1 week ago

Previous Question Next Question