Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 07, 2024

lab test : Need gaussian naive byes python code for digits dataset In this exercise, you are to implement only one of two possible classifiers

lab test : Need gaussian naive byes python code for digits dataset

In this exercise, you are to implement only one of two possible classifiers (your choice). Note, you are not to use modules which provide these functions - that would be too easy (no sklearn.naive_bayes, for example) but rather create them yourselves. Students who implement the logic for writing code for both the classifiers will be given bonus credit. The performance of your classifier implementation will be evaluated for the classifier functionality (whether you correctly implement kNN or Nave Bayes for the dataset) rather than efficiency. The data set to use is the digit recognition data set available from the sklearn module; the demonstration linked here should provide some guidance. You are expected to use Jupyter notebooks and Python on this assignment, but can ask for exceptions. Your goal is to take the first half of the data set to train your model, and the last half is used for prediction.

image text in transcribed

b) Gaussian Naive Bayes x represents the image vector (x1,x2,x3,x64) ck represents class k= that is, one of the 10 digits for recognition Recall, we're looking for the highest p(ckx) by using this fact: p(ckx)=p(xck)p(ck)/p(x) Let's step through the parts: - p(ck) is simply the proportion of that class in the training data. E.g. if there are 20 fives out of 200 digits in the training sample p( five )=20/200=0.1 - p(xck) is more complicated - The main assumption of naive Bayes is that the features should be treated independently (which is why it's "naive"). This means p(xck)=p(x1ck)p(x2ck)p(x64ck) For each class, k, in the training data: - Calculate the mean and variance of each pixel location for that class - Use that and the formula for a gaussian probability to calculate p(xick) g(x)=21e21(x)2. - p(x) is the normalization term. You don't need to calculate this, since you just want to pick the largest p(ckx), and p(x) is the same denominator in calculating p(ckx) for every class. - However, if you want p(ckx) to provide a true estimate of the probability, you can use the following formula to calculate p(x) : p(x)=kp(x,ck)=kp(xck)p(ck) The predicted class is the largest p(ckx) for each image 1. Report the overall accuracy of your prediction. 2. Show the classification matrix. 3. Note which errors are more common. In what way does that match your intuitions

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Databases On The Web Designing And Programming For Network Access

Databases On The Web Designing And Programming For Network Access

Authors: Patricia Ju

1st Edition

1558515100, 978-1558515109

Students also viewed these Databases questions

Question

★★★★★

238/92U releases an average of 2.5 neutrons per fission compared to 2.9 for 239/94Pu. Pure samples of which of these two nuclei do you think would have the smaller critical mass? Explain.

Answered: 1 week ago

Question

★★★★★

Garden Glory is a partnership that provides gardening and yard maintenance services to individuals and organizations. Garden Glory is owned by two partners. They employ two office administrators and...

Answered: 1 week ago

Question

★★★★★

4. Use the results of part 2 to evaluate the probability of 7 out of 365 evenings result- ing in a loss of the total $1000 stake. The technique of simulating a process that contains random elements...

Answered: 1 week ago

Question

★★★★★

Pedro Bourbone is the founder and owner of a highly successful small business and, over the past several years, has accumulated a significant amount of personal wealth. His port-folio of stocks and...

Answered: 1 week ago

Question

★★★★★

Finance, or financial management, requires the knowledge and precise use of the language of the field. Match the terms relating to the basic terminology and concepts of the time value of money on the...

Answered: 1 week ago

Question

★★★★★

Holloway Company started operations on January 1, Year 1. During Year 1. Holloway earned $5,200 of service revenue and collected $4,420 cash from accounts receivable. Required Based on this...

Answered: 1 week ago

Question

★★★★★

2.36 Consider the following code: LDURB X10, [X11, #0] STUR X10, [X11, #8] Assume that the register X11 contains the address 0x10000000 and the data at address is 0x1122334455667788. 2.36.1 [5] What...

Answered: 1 week ago

Question

★★★★★

The GoT cups are a fast seller and you need to ensure that you have enough rolls of paper to fulfill demand. The first stage in the process is to determine the total cost of the current inventory...

Answered: 1 week ago

Question

★★★★★

Using the cognitive model and the BRUSO model discussed in class, write survey items for the following general questions. It may take more than one question. Responding to a survey item is itself a...

Answered: 1 week ago

Question

★★★★★

Read the following text carefully, then answer the question. Exciting New Gym Opening Soon!!! Northside Gym is opening soon, providing a new level of gym experience. The gym will meet all your...

Answered: 1 week ago

Question

★★★★★

Complete the self-scoring "Followership Questionnaire" in Ch. 13 (p. 487) of Leadership: Theory and Practice . DueThursday respond to the following: Based on your results from the "Followership...

Answered: 1 week ago

Question

★★★★★

Read the source of spotlight on the law 9.8 and compare their decisions over reasonable adjustment with more recent cases. Has the position changed as more decisions have been made at higher courts?

Answered: 1 week ago

Question

★★★★★

Annualised hours (see case study 7.1 and focus on research 7.1) appear to have considerable advantages for the employer. Read the article and book chapter on which these extracts are based and...

Answered: 1 week ago

Question

★★★★★

Specify which techniques of training are best suited to the following: Learning to drive a car Students needing a basic understanding of the business cycle Teaching teenagers about personal...

Answered: 1 week ago

Previous Question Next Question