The RIPPER algorithm (by Cohen [1]) is an extension of an earlier algorithm called IREP (by F¨urnkranz

Question:

The RIPPER algorithm (by Cohen [1]) is an extension of an earlier algorithm called IREP (by F¨urnkranz and Widmer [3]). Both algorithms apply the reduced-error pruning method to determine whether a rule needs to be pruned. The reduced error pruning method uses a validation set to estimate the generalization error of a classifier. Consider the following pair of rules:
R1: A ˆ’ †’ C
R2: A ˆ§ B ˆ’ †’ C
R2 is obtained by adding a new conjunct, B, to the left-hand side of R1. For this question, you will be asked to determine whether R2 is preferred over R1 from the perspectives of rule-growing and rule-pruning. To determine whether a rule should be pruned, IREP computes the following measure:

(a) Suppose R1 is covered by 350 positive examples and 150 negative examples, while R2 is covered by 300 positive examples and 50 negative examples. Compute the FOIL's information gain for the rule R2 with respect to R1.
(b) Consider a validation set that contains 500 positive examples and 500 negative examples. For R1, suppose the number of positive examples covered by the rule is 200, and the number of negative examples covered by the rule is 50. For R2, suppose the number of positive examples covered by the rule is 100 and the number of negative examples is 5. Compute vIREP for both rules. Which rule does IREP prefer?
(c) Compute vRIPPER for the previous problem. Which rule does RIPPER prefer?

Fantastic news! We've Found the answer you've been seeking!

Step by Step Answer:

Answer rating: 100% (8 reviews)

a For this problem p 0 350 n 0 150 p 1 300 and n 1 50 Therefore the FOIL...View the full answer

Answered By

Muhammad Ahtsham Shabbir

I am a professional freelance writer with more than 7 years’ experience in academic writing. I have a Bachelor`s Degree in Commerce and Master's Degree in Computer Science. I can provide my services in various subjects. I have professional excellent skills in Microsoft ® Office packages such as Microsoft ® Word, Microsoft ® Excel, and Microsoft ® PowerPoint. Moreover, I have excellent research skills and outstanding analytical and critical thinking skills; a combination that I apply in every paper I handle. I am conversant with the various citation styles, among them; APA, MLA, Chicago, Havard, and AMA. I also strive to deliver the best to my clients and in a timely manner.My work is always 100% original. I honestly understand the concern of plagiarism and its consequences. As such, I ensure that I check the assignment for any plagiarism before submission.

4.80+ 392+ Reviews 587+ Question Solved

Related Book For book-img-for-question

Introduction to Data Mining

ISBN: 978-0321321367

1st edition

Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar

See More Books

Question Posted: Apr 29, 2016 03:27 PM

See More Questions

The RIPPER algorithm (by Cohen [1]) is an extension of an earlier algorithm called IREP (by F¨urnkranz

Question:

p+(N – n) VIREP P+N

Step by Step Answer:

a For this problem p 0 350 n 0 150 p 1 300 and n 1 50 Therefore the FOIL...View the full answer

Introduction to Data Mining

Students also viewed these Statistics questions