Question: In this exercise, we will learn about the mathematical details underlying many spectral clustering methods (Section 9.4.3). Given an n n n n

In this exercise, we will learn about the mathematical details underlying many spectral clustering methods (Section 9.4.3). Given an n×nn×n similarity matrix WW whose elements are the similarities between the corresponding data tuples (i.e., W(i,j)W(i,j) measures the similarity between tuples ii and j)j). We wish to partition the data tuples into two clusters. Let qq be a cluster membership vector of length n:q(i)=1n:q(i)=1 if data tuple ii belongs to Cluster A; and q(i)=1q(i)=1 if it belongs to Cluster B. One way to find these two clusters is to minimize the so-called cut-size, which measures the total similarities across different clusters:

9.4.3 Spectral clustering You may hear that spectral clustering methods have been used in many applications.

q=argminq{1,1}nJ=14ni,j=1(q(i)q(j))2W(i,j)q=argminq{1,1}nJ=14i,j=1n(q(i)q(j))2W(i,j)

a. Prove that the cut-size J=12qT(DW)qJ=12qT(DW)q, where DD is the degree matrix of W:D(i,i)=W:D(i,i)= nj=1W(i,j)j=1nW(i,j) and D(i,j)=0D(i,j)=0 for jiji; and TT is the vector transpose.

b. It is very difficult to directly optimize Eq. (9.48) since the cluster membership vector qq is a binary vector. In practice, we relax qq and allow it to take real numbers, and aim to solve the following optimization problem instead. Prove that the optimal solution of Eq. (9.49) is given by the eigenvector of DWDW that corresponds to the second smallest eigenvalue.

q* = arg min_q" (D  W)q  M st. ()==n =| (9.49)

9.4.3 Spectral clustering You may hear that spectral clustering methods have been used in many applications. What is the major idea behind spectral clustering? As discussed in Section 9.2.1, due to the curse of dimensionality, measuring pairwise distance in the original data space of a high-dimensional data set may not be reliable for clustering analysis. Fig. 9.13 illustrates the intuition. The figure shows two clusters of points, one in white and the other in black. Points a and b belong to two different clusters. However, in the original data space, the distance between a and b is shorter than that between a and some other points within the white cluster, such as c. Therefore if we rely on the pairwise distances in the original data space to construct clusters directly, the results may not be meaningful. FIGURE 9.13 0 00 b

Step by Step Solution

3.49 Rating (146 Votes )

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Data Mining Concepts And Techniques Questions!