Answered step by step
Verified Expert Solution
Question
1 Approved Answer
(a) Consider the following unlabelled dataset of patients no ID Fever Dry Cough Runny Nose Shortness of breath 1 yes yes no no 2 yes
(a) Consider the following unlabelled dataset of patients no ID Fever Dry Cough Runny Nose Shortness of breath 1 yes yes no no 2 yes yes no yes 3 yes yes no yes 4 yes no no 5 yes yes no no 6 yes yes yes no 7 yes 8 yes no 9 yes yes no 10 yes yes yes no no no no no no yes Table 2: The unlabelled dataset for questions This is essentially the same dataset as in Table except that we don't have access to the labels. Suppose we are interested in grouping the patients into two disjoint clus- ters based on their symptoms. For this, we choose the K-means algorithm with K=2. Apply only two rounds of updates of the K-means algorithm to this dataset. As for your two starting points for cluster centre candidates, choose data points of Sample 1 and Sample 10. In the first round, break any ties in favour of the cluster whose centre is Sample 10. In the second round, you can break ties arbitrarily, if you face any. You need to provide the detail of your work. Moreover, your final answer should clearly state the resulting clusters (after two rounds), in particular, the cluster centres and the association of each point to each of the two clusters. (a) Consider the following unlabelled dataset of patients no ID Fever Dry Cough Runny Nose Shortness of breath 1 yes yes no no 2 yes yes no yes 3 yes yes no yes 4 yes no no 5 yes yes no no 6 yes yes yes no 7 yes 8 yes no 9 yes yes no 10 yes yes yes no no no no no no yes Table 2: The unlabelled dataset for questions This is essentially the same dataset as in Table except that we don't have access to the labels. Suppose we are interested in grouping the patients into two disjoint clus- ters based on their symptoms. For this, we choose the K-means algorithm with K=2. Apply only two rounds of updates of the K-means algorithm to this dataset. As for your two starting points for cluster centre candidates, choose data points of Sample 1 and Sample 10. In the first round, break any ties in favour of the cluster whose centre is Sample 10. In the second round, you can break ties arbitrarily, if you face any. You need to provide the detail of your work. Moreover, your final answer should clearly state the resulting clusters (after two rounds), in particular, the cluster centres and the association of each point to each of the two clusters
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started