Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 09, 2024

Need gaussian naive byes python code for digits dataset In this exercise, you are to implement only one of two possible classifiers (your choice). Note,

Need gaussian naive byes python code for digits dataset

In this exercise, you are to implement only one of two possible classifiers (your choice). Note, you are not to use modules which provide these functions - that would be too easy (no sklearn.naive_bayes, for example) but rather create them yourselves. Students who implement the logic for writing code for both the classifiers will be given bonus credit. The performance of your classifier implementation will be evaluated for the classifier functionality (whether you correctly implement kNN or Nave Bayes for the dataset) rather than efficiency. The data set to use is the digit recognition data set available from the sklearn module; the demonstration linked here should provide some guidance. You are expected to use Jupyter notebooks and Python on this assignment, but can ask for exceptions. Your goal is to take the first half of the data set to train your model, and the last half is used for prediction.

image text in transcribed

b) Gaussian Naive Bayes x represents the image vector (x1,x2,x3,x64) ck represents class k= that is, one of the 10 digits for recognition Recall, we're looking for the highest p(ckx) by using this fact: p(ckx)=p(xck)p(ck)/p(x) Let's step through the parts: - p(ck) is simply the proportion of that class in the training data. E.g. if there are 20 fives out of 200 digits in the training sample p( five )=20/200=0.1 - p(xck) is more complicated - The main assumption of naive Bayes is that the features should be treated independently (which is why it's "naive"). This means p(xck)=p(x1ck)p(x2ck)p(x64ck) For each class, k, in the training data: - Calculate the mean and variance of each pixel location for that class - Use that and the formula for a gaussian probability to calculate p(xick) g(x)=21e21(x)2. - p(x) is the normalization term. You don't need to calculate this, since you just want to pick the largest p(ckx), and p(x) is the same denominator in calculating p(ckx) for every class. - However, if you want p(ckx) to provide a true estimate of the probability, you can use the following formula to calculate p(x) : p(x)=kp(x,ck)=kp(xck)p(ck) The predicted class is the largest p(ckx) for each image 1. Report the overall accuracy of your prediction. 2. Show the classification matrix. 3. Note which errors are more common. In what way does that match your intuitions

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Relational Database Technology

Relational Database Technology

Authors: Suad Alagic

1st Edition

354096276X, 978-3540962762

More Books

Students also viewed these Databases questions

Question

★★★★★

Greer failed to make three payments on a music store installment purchase. A debt collector advised Greer that the store would sue her if the payments were not made up. Is this statement a violation...

Answered: 1 week ago

Question

★★★★★

Charize, Beatrice and Janice are partners with beginning capital balances of P340,000, P280,000 and P400,000 respectively. The three partners agreed to receive 12% interest on their beginning capital...

Answered: 1 week ago

Question

★★★★★

A mechanical engineer at a manufacturing plant in Hamilton, Ontario, keeps a close watch on the performance and condition of the machines, inspecting for wear due to friction. The following data are...

Answered: 1 week ago

Question

★★★★★

The El Dorado Star is the only newspaper in El Dorado, New Mexico. Certainly, the Star competes with The Wall Street Journal, USA Today, and the New York Times for national news reporting, but the...

Answered: 1 week ago

Question

★★★★★

OneChicago has just introduced a new single stock futures contract on the stock of Brandex, a company that currently pays no dividends. Each contract calls for delivery of 1,000 shares of stock in...

Answered: 1 week ago

Question

★★★★★

An analysis, and application of the concepts of Employee Relations; Employee Welfare; and choosing the appropriate type of Business Entity with specific focus of each of these areas of the law to the...

Answered: 1 week ago

Question

★★★★★

Transfer functions in X and Y directions at the tool tip have been measured on twodifferent similar machines (Figure 1, a and b). Please compare these two machines interms of dynamic stiffness and...

Answered: 1 week ago

Question

★★★★★

5. A geometric distribution with parameter p has probability mass function f(x) = (1-p)-1p, x = {1, 2,...}. (a) Show that the Jeffreys prior for this distribution is the following improper prior....

Answered: 1 week ago

Question

★★★★★

5. BiCycle, Inc, produces two models of a new line of lightweight bicycles, a deluxe and a professional model. Each deluxe model requires 3 pounds of a titanium alloy while the professional model...

Answered: 1 week ago

Question

★★★★★

What valuable insights can businesses gain from social media analytics? Question 18Answer a. Data on audience demographics and engagement metrics b. Performance measurement of salesman c. Reframing...

Answered: 1 week ago

Question

★★★★★

If feedforward controls are the most proactive, then why do organizations need or use feedback controls? Explain and defend your answer.

Answered: 1 week ago

Question

★★★★★

4. Identify the challenges facing todays organizations

Answered: 1 week ago

Question

★★★★★

2. Compare two organizations that you belong to or have regular contact with (such as a social organization, a volunteer organization, or a company). Describe the type of management approach at these...

Answered: 1 week ago

Question

★★★★★

1. LaunchPad for Real Communication offers key term videos and encourages selfassessment through adaptive quizzing. Go to bedfordstmartins.com/realcomm to get access to: LearningCurve Adaptive...

Answered: 1 week ago

Previous Question Next Question