Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Problem #4 and #5 use the following features to learn the concept Good Movie, which predicts whether a movie is good or not based on

image text in transcribedimage text in transcribed

Problem #4 and #5 use the following features to learn the concept Good Movie, which predicts whether a movie is good or not based on several attributes: Movie's budget = { Low, Medium, High } Genre = {Documentary, Drama, Comedy } Famous actors = {No, Yes } Director = {Unknown, Great } The training data for Good Movie is: # Budget 1 Low 2 Low 3 Medium 4 High 5 High 6 High 7 Medium 8 Medium 9 Low 10 Low 11 Low 12 High 13 Low 14 Medium 15 Medium 16 High Genre Documentary Documentary Documentary Drama Comedy Comedy Comedy Documentary Comedy Drama Comedy Drama Drama Drama Documentary Drama Famous Actors Director Yes Unknown Yes Great Yes Unknown Yes Unknown No Unknown No Great No Great No Unknown Yes Great Yes Unknown No Unknown No Unknown No Great Yes Great No Unknown Yes Great Good Movie No No Yes Yes No Yes No No Yes No No Yes Yes Yes Yes Yes Problem #5 [20 points] Based on the GrowTree and BestSplit-Class algorithms and using the entropy impurity function, create a decision tree to learn the Good Movie concept. Show how each node is decided (based on comparing impurity measures), then draw the full decision tree. If there are ties in impurity measures, give higher priority to attributes and values according to their order on page 1: that is, Budget is the highest priority feature and Director is the lowest priority; within Budget, Low is the highest priority value, followed by Medium and then High. [Normally these might be randomly chosen, but we'll use this "inductive bias."] If there are ties to determine a leaf's label, give priority to Yes. Now apply this learned concept (decision tree) to the three test data sets posted on the Assignments page, and list the correctly and incorrectly classified examples (by number), for each test data set. What is the error rate of the decision tree on the training data? On each test data set? Problem #4 and #5 use the following features to learn the concept Good Movie, which predicts whether a movie is good or not based on several attributes: Movie's budget = { Low, Medium, High } Genre = {Documentary, Drama, Comedy } Famous actors = {No, Yes } Director = {Unknown, Great } The training data for Good Movie is: # Budget 1 Low 2 Low 3 Medium 4 High 5 High 6 High 7 Medium 8 Medium 9 Low 10 Low 11 Low 12 High 13 Low 14 Medium 15 Medium 16 High Genre Documentary Documentary Documentary Drama Comedy Comedy Comedy Documentary Comedy Drama Comedy Drama Drama Drama Documentary Drama Famous Actors Director Yes Unknown Yes Great Yes Unknown Yes Unknown No Unknown No Great No Great No Unknown Yes Great Yes Unknown No Unknown No Unknown No Great Yes Great No Unknown Yes Great Good Movie No No Yes Yes No Yes No No Yes No No Yes Yes Yes Yes Yes Problem #5 [20 points] Based on the GrowTree and BestSplit-Class algorithms and using the entropy impurity function, create a decision tree to learn the Good Movie concept. Show how each node is decided (based on comparing impurity measures), then draw the full decision tree. If there are ties in impurity measures, give higher priority to attributes and values according to their order on page 1: that is, Budget is the highest priority feature and Director is the lowest priority; within Budget, Low is the highest priority value, followed by Medium and then High. [Normally these might be randomly chosen, but we'll use this "inductive bias."] If there are ties to determine a leaf's label, give priority to Yes. Now apply this learned concept (decision tree) to the three test data sets posted on the Assignments page, and list the correctly and incorrectly classified examples (by number), for each test data set. What is the error rate of the decision tree on the training data? On each test data set

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Creating A Database In Filemaker Pro Visual QuickProject Guide

Authors: Steven A. Schwartz

1st Edition

0321321219, 978-0321321213

More Books

Students also viewed these Databases questions