Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

able b. Hierarchical clustering is sometimes used generate K clusters (K>1) by taking the clusters at Kth level of the dendrogram (Root is at level

image text in transcribed

able b. Hierarchical clustering is sometimes used generate K clusters (K>1) by taking the clusters at Kth level of the dendrogram (Root is at level 1). By looking at the clusters produced in this way, we can evaluate the behavior of hierarchical clustering on different types of data and clusters, and also compare hierarchical approaches to K-Means The following is a set of 1-dimensional points: {6, 12, 18, 24, 30, 42,48) 1) For each of the following sets of initial centroids, create two clusters by assigning each point to the nearest centroid, and then calculate the total squared error for each set of two clusters. Show both the clusters and the total squared error for each set of centroids. (3, 7.53; (15, 25) Do both sets of the centroids represent stable solutions, i.e., if the K-Means algorithm was run on this set of points using the given centroids as starting centroids, would there be any change in the clusters generated? What are the two clusters produced by MIN? (MIN is single-link clustering, also called minimum method) Which technique, K-Means or MIN, seems to produce the "most natural clustering in this situation? What well-known characteristic of the K-Means algorithm explains the previous behavior? 2) 3) 4) 5) able b. Hierarchical clustering is sometimes used generate K clusters (K>1) by taking the clusters at Kth level of the dendrogram (Root is at level 1). By looking at the clusters produced in this way, we can evaluate the behavior of hierarchical clustering on different types of data and clusters, and also compare hierarchical approaches to K-Means The following is a set of 1-dimensional points: {6, 12, 18, 24, 30, 42,48) 1) For each of the following sets of initial centroids, create two clusters by assigning each point to the nearest centroid, and then calculate the total squared error for each set of two clusters. Show both the clusters and the total squared error for each set of centroids. (3, 7.53; (15, 25) Do both sets of the centroids represent stable solutions, i.e., if the K-Means algorithm was run on this set of points using the given centroids as starting centroids, would there be any change in the clusters generated? What are the two clusters produced by MIN? (MIN is single-link clustering, also called minimum method) Which technique, K-Means or MIN, seems to produce the "most natural clustering in this situation? What well-known characteristic of the K-Means algorithm explains the previous behavior? 2) 3) 4) 5)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Marketing The New Profit Frontier

Authors: Ed Burnett

1st Edition

0964535629, 978-0964535626

More Books

Students also viewed these Databases questions

Question

=+ Why have these changes occurred?

Answered: 1 week ago

Question

=+90 percent of all oil refineries) into several smaller companies

Answered: 1 week ago