Question
Problem 4. You have been hired as a data mining consultant for an insurance company. Your first task is to apply clustering to segment the
Problem 4. You have been hired as a data mining consultant for an insurance company. Your first task is
to apply clustering to segment the customers who are insured with the company. The customer data set
contains only 10 categorical attributes (gender, marital status, occupation, highest education level, etc).
1) You have seen how entropy is used in decision tree classifiers to split data points with categorical
attributes into homogeneous subgroups, so you decided to develop an entropy-based divisive
hierarchical clustering algorithm that can handle categorical attributes. The algorithm starts with 1
cluster containing all the points and chooses an attribute that gives the highest information gain to
partition the instances into its smaller clusters. It repeats this partitioning step on each cluster until
every cluster contains only a single point. You present the idea to your supervisor, who immediately
shoots it down. Explain why such an approach conceptually will not work.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started