Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

A quick but detailed soultion please, the last answer to this question did not have enough detail, i have done part a just need part

A quick but detailed soultion please, the last answer to this question did not have enough detail, i have done part a just need part b and c in detail. I.E the exact working out for each step, when calcuating the clutser, i have got this answer which seems incorrect can you point my mistake out, so c2 is 150 and you subtract it from c1, c4 and c7 respectively which are 200,250,100 and you get the answer 50,100,50 you then choose the lowest one at random and i chose for c2 to be assigned with c1 so the k1 cluster you then calculate the new mean which is 200+150 divide by 2 then you assign c3 to a cluster which is 300 so you subtract 300-175 which is c1 and c2 mean then 300-the mean of k2 which is c4250 and k3 which is c7 and 100 the answers you get is 125,50,200 you assign c3 to c4 and calculate the mean which is 250+300 divide by 2 equals 275 now for c5 which is 350, k1 which is 175 which is subtracted from 350 then k2 which is 275 subtracted from 350 then k3 which is 100 subtracted from 350 equals 50 assign to k2 and calculate new mean which is 250+300+275 divide by 3 giving 275 now for c6 which is 400 try with k1400-175 equals 225 then for k2400-275 equals 125 and similarly for k3400-100=300 assign to k2 new mean for k2 is now 275+400+300+250 divide by 4 equals 306.25, now for c8 which is 80 k1 which is 175 subtract 80 equals 95 k2: 306.25-80=226.25, k3 which is 100-80 equals 20 answer is 20 assign to c7, why has the algorithm converged show reasons why, why can a second epoch not occur?
Question 4B - Clustering [25%]
You have been provided with a dataset containing information about customer spending habits. Your task is to use the k-means algorithm with Euclidean distance to cluster the following 8 examples into 3 clusters:
\table[[Customer,Spending (in $)],[C1,200],[C2,150],[C3,300]]
\table[[C4,250],[C5,350],[C6,400],[C7,100],[C8,80]]
Suppose that the initial seeds (centres of each cluster) are C1, C4, and C7. Run the k-means algorithm for 1 epoch only.
In particular:
a) Fill the distance matrix based on the Euclidean distance of the points given above:
\table[[,C1,C2,C3,C4,C5,C6,C7,C8],[C1,0,,,,,,,],[C2,,0,,,,,,],[C3,,,0,,,,,],[C4,,,,0,,,,],[C5,,,,,0,,,],[C6,,,,,,0,,],[C7,,,,,,,0,],[C8,,,,,,,,0]]
b) Calculate the cluster assignment at the end of the first epoch:
a. The new cluster assignment (i.e., contents of each cluster)
b. The centroids of the new clusters
c) How many more iterations are needed to converge? Show cluster assignments and updated centroids for each of the remaining epochs.
In your report, you need to include the appropriate cluster assignment and centroids.
image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Pro SQL Server Wait Statistics

Authors: Enrico Van De Laar

1st Edition

1484211391, 9781484211397

More Books

Students also viewed these Databases questions