Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Question 1 . According to the authors, what is the role of data mining in healthcare? Question 2 . What unique challenges are presented by
Question
According to the authors, what is the role of data mining in healthcare?
Question
What unique challenges are presented by data in healthcare? You may also want to take a look at
Future Directions section at the end of the paper.
Question
What is the CRISPDM methodology in data mining and how is it used in healthcare domain? See
Healthcare Data Mining Applications section.
Question
On Figure of Page we are shown the model that classifies whether a certain patient is diabetic or not. According to the authors, what is the advantage of using a decision tree model for
healthcare domain?
Note. The Figure has a typo; the root node should have N and D where D should
represent patients that are diabetic.
Question
According to the authors, what is the attribute that is considered the most important in classifying
whether a patient is diabetic or not? What is the second most important attribute? Can you guess
how these are inferred from Figure
Hint: Focus on the second column of Page and see how the authors are able to find which
nodes of the decision tree are important.
Question
According to the authors, what are the limitations of data mining applications in healthcare domain? In particular, comment on
What role does a data warehouse provide in data mining for healthcare?
the type of data problems present for healthcare.
limitations in the predictive modeling process for healthcare data.
importance of domain experts
scope of investment needed for applications
Question
After reading the paper, you propose to apply hierarchical clustering on the same dataset. The list
of attributes in the dataset include seven variables of particular interest: gender, age, body mass
index BMI waisthip ratio WHR smoking status, the number of times a patient exercises per
week, and onset of diabetes.
Explain why imputation is crucial for healthcare data. Why do you think using imputation
can be problematic no matter which method is chosen? Hint: You may want to use your
answers for Question to motivate your answer for this part.
Explain why scaling the data is necessary before applying any clustering algorithm.
Explain how clustering obtained from a hierarchical clustering and clusterings obtained from
a kmeans clustering can be qualitatively different.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started