The original association rule mining framework considers only presence of items together in the same transaction. There
Question:
In this problem, you are asked to extend the association rule framework to negative itemsets (i.e., itemsets that contain both presence and absence of items). We will use the negation symbol (¬) to refer to absence of items.
(a) A naive way for deriving negative itemsets is to extend each transaction to include absence of items as shown in Table 7.17.
i. Suppose the transaction database contains 1000 distinct items.
What is the total number of positive itemsets that can be generated from these items? (Note: A positive itemset does not contain any negated items).
Table 7.17. Example of numeric data set.
ii. What is the maximum number of frequent itemsets that can be generated from these transactions? (Assume that a frequent itemset may contain positive, negative, or both types of items)
iii. Explain why such a naive method of extending each transaction with negative items is not practical for deriving negative itemsets.
(b) Consider the database shown in Table 7.14. What are the support and
confidence values for the following negative association rules involving
regular and diet soda?
i. ¬Regular Diet.
ii. Regular ¬Diet.
iii. ¬Diet Regular.
iv. Diet ¬Regular.
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Introduction to Data Mining
ISBN: 978-0321321367
1st edition
Authors: Pang Ning Tan, Michael Steinbach, Vipin Kumar
Question Posted: