Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

2 Multiclass Naive Bayes with Bag of Words A group of artists wish to use Naive Bayes algorithm to classify a given artwork into three

image text in transcribed

2 Multiclass Naive Bayes with Bag of Words A group of artists wish to use Naive Bayes algorithm to classify a given artwork into three different categories given a text description of the painting. These descriptions are short and simple and have already been passed through a feature function which returned the following key features based on the count of certain words used to describe the painting. The three categories are related to the overall color scheme and are as follows: Warm, Cool and Neutral. A set of these descriptions have been sampled and each were classified by the artists based on their feature vectors. The data collected so far is given in the table below: a. (1 pt) What is the probability y of each label y{ Warm, Neutral, Cool } ? b. (3 pts) The parameter y,j is the probability of a token j appearing with label y. It is defined by the following equation, where V is the size of the vocabulary set: y,j=j=1Vcount(y,j)count(y,j) The probability of a count of words x and a label y is defined as follows. Here, count (y,j) represents the frequency of word j appearing with label y over all data points. p(x,y;,)=p(y;)p(xy;)=p(y;)j=1Vy,jxj Here, the words are the names of colors that appear in the text description of the artwork, and a word count vector indicates the occurrence of each of the words in the text description for a given artwork. Find the most likely label y^ for the following word counts vector x=(0,1,0,1,1,0,0,1) using y^=argmaxylogp(x,y;;). Show final log (base-10) probabilities for each label rounded to 3 decimals. Treat log(0) as . (Hint: read more about binary multinomial naive Bayes in Jurafsky Martin Chapter 4, as well as Hiroshi Shimodaira's note - https://www . inf . ed.ac. uk/teaching/ courses/inf2b/learnnotes/inf2b-learn-note07-2up.pdf.) c. (3 pts) When calculating argmax xy, if y,j=0 for a label-word pair, the label y is no longer considered. This is an issue, especially for smaller datasets where a feature may not be present in all documents for a certain label. One approach to mitigating this high variance is to smooth the probabilities. Using add-1 smoothing, which redefines y,j, again find the most likely label y^ for the following word counts vector x=(0,1,0,1,1,0,0,1) using y^=argmaxylogp(x,y;;). Make sure to show final log probabilities. add-1smoothing:y,j=V+j=1Vcount(y,j)1+count(y,j)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Systems For Advanced Applications 15th International Conference Dasfaa 2010 International Workshops Gdm Benchmarx Mcis Snsmw Diew Udm Tsukuba Japan April 2010 Revised Selected Papers Lncs 6193

Database Systems For Advanced Applications 15th International Conference Dasfaa 2010 International Workshops Gdm Benchmarx Mcis Snsmw Diew Udm Tsukuba Japan April 2010 Revised Selected Papers Lncs 6193

Authors: Masatoshi Yoshikawa ,Xiaofeng Meng ,Takayuki Yumoto ,Qiang Ma ,Lifeng Sun ,Chiemi Watanabe

2010th Edition

3642145884, 978-3642145889

More Books

Students also viewed these Databases questions

Question

★★★★★

Continuing the previous problem, make the problem even more general by allowing upper bounds (arc capacities) and lower bounds for the flows on the allowable arcs. Some of the upper bounds can be...

Answered: 1 week ago

Question

★★★★★

In Wumpus World of the squares represented as ( X Y ) , and the agent finds Breeze on ( 1 2 ) and Stench on ( 2 1 ) , so he must infer that square _ _ _ _ _ is safety. Question 3 9 Answer a . All the...

Answered: 1 week ago

Question

★★★★★

3. Working individually or in groups, bring several business publications such as Bloomberg Businessweek and the Wall Street Journal to class. Based on their contents, compile a list entitled What HR...

Answered: 1 week ago

Question

★★★★★

Salamone Heaters selected data for October 2013 are presented here (in millions): Direct materials inventory, 10/ 1/ 2013 $ 75 Direct materials purchased 335 Direct materials used 380 Total...

Answered: 1 week ago

Question

★★★★★

2 Multiclass Naive Bayes with Bag of Words A group of artists wish to use Naive Bayes algorithm to classify a given artwork into three different categories given a text description of the painting....

Answered: 1 week ago

Question

★★★★★

Casual furniture company manufacturers outdoor furniture, and incurred the following costs during the month of January : Timber 25000 Paint 5000 Glue 500 Wages assembly personnel 20 000 Wages factory...

Answered: 1 week ago

Question

★★★★★

Define GDP and explain its components. How is GDP used as an indicator of economic health?

Answered: 1 week ago

Question

★★★★★

German physicist Werner Heisenberg related the uncertainty of an object's position()to the uncertainty in its velocity() 4 where is Planck's constant and is the mass of theobject. The mass of an...

Answered: 1 week ago

Question

★★★★★

Let's say that we have a 4-way set associative cache. The cache has 16 sets in total. Main memory consists of 16K blocks of 16 words each, and word addressing is used. a. Show the address format that...

Answered: 1 week ago

Question

★★★★★

Al purchases a speedboat costing $23,500. State taxes are 5.5% and federal excise tax is 15%. What is the total purchase price? (Round your answer to the nearest cent.)

Answered: 1 week ago

Question

★★★★★

NUMBER ONE HDL. Ltd, which manufactures footwear, makes up its accounts to 31 March each year. The company has an authorized share capital of Sh. 600,000,000 divided into 15,000,000 6.5% preference...

Answered: 1 week ago

Question

★★★★★

Using the information gathered in Application Exercise 7, prepare a combination general rsum that could be used with at least three of the advertised positions. Partner with a classmate to proofread...

Answered: 1 week ago

Question

★★★★★

Technology. Assume you worked for your father at his car dealership while attending school. Because you have limited work experience, you want to include this on your rsum. You have written the...

Answered: 1 week ago

Question

★★★★★

What content should go in the closing paragraph of an application letter? (Objective 1)

Answered: 1 week ago

Previous Question Next Question