Compare the SCAN algorithm (Section 9.5.3) with DBSCAN (Section 8.4.1). What are their similarities and differences? 9.5.3
Question:
Compare the SCAN algorithm (Section 9.5.3) with DBSCAN (Section 8.4.1). What are their similarities and differences?
Transcribed Image Text:
9.5.3 Graph clustering methods Let us consider how to conduct clustering on a graph. We first describe the intuition behind graph clustering. We then discuss two general categories of graph clustering methods. The intuition of finding clusters in a graph is to cut the graph into pieces, each piece being a cluster, such that the vertices within a cluster are well connected, and the vertices in different clusters are connected in a much weaker way. Formally, for a graph, G = (V, E), a cut, C = (S, T), is a partitioning of the set of vertices V in G, that is, V=SUT and SnT = . The cut set of a cut is the set of edges, {(u, v) E Elu ES, v E T}. The size of the cut is the number of edges in the cut set. For weighted graphs, the size of a cut is the sum of the weights of the edges in the cut set. "What kinds of cuts are good for deriving clusters in graphs?" In graph theory and some network applications, a minimum cut is of importance. A cut is minimum if the size of the cut is not greater than the size of any other cut. There are polynomial time algorithms to compute minimum cuts of graphs. Can we use those algorithms in graph clustering? Example 9.19. Cuts and clusters. Consider graph G in Fig. 9.17. The graph has two clusters: (a, b, c, d, e, f) and {g, h, i, j, k), and one outlier vertex, 1. Consider cut C = ({a, b, c, d, e, f, g, h, i, j, k}, {}). Only one edge, namely, (e, 1), crosses the two partitions created by C. Therefore the cut set of C is {(e,1)), and the size of C is 1. (Note that the size of any cut in a connected graph cannot be smaller than 1.) As a minimum cut, C does not lead to a good clustering because it only separates the outlier vertex, 1, from the rest of the graph. Cut C = ({a, b, c, d, e, f,l}, {g, h, i, j, k}) leads to a much better clustering than C. The edges in the cut set of C are those connecting the two "natural clusters" in the graph. Specifically, for edges (d, h) and (e, k) that are in the cut set, most of the edges connecting d, h, e, and k belong to one cluster.
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Answer rating: 100% (2 reviews)
Based on the information from the images provided we can compare SCAN Structural Clustering Algorithm for Networks and DBSCAN DensityBased Spatial Clu...View the full answer
Answered By
Sumit kumar
Education details:
QUATERNARY Pursuing M.Tech.(2017-2019) in Electronics and Communication Engg. (VLSI DESIGN) from
GNIOT Greater Noida
TERTIARY B.Tech. (2012-2016) in Electronics and Communication Engg. from GLBITM Greater Noida
SECONDARY Senior Secondary School Examination (Class XII) in 2012 from R.S.S.Inter College, Noida
ELEMENTARY Secondary School Examination (Class X) in 2010 from New R.J.C. Public School ,Noida
CERTIFICATION
Summer Training in ‘WIRELESS EMBEDDED SYSTEM’ from ‘XIONEE’ for the six weeks.
EMBEDDED SYSTEM Certificate issued by CETPA INFOTECH for one day workshop.
Certificate of Faculty development program on OPTICAL COMMUNICATION and NETWORKS for one week.
5.00+
1+ Reviews
10+ Question Solved
Related Book For
Data Mining Concepts And Techniques
ISBN: 9780128117613
4th Edition
Authors: Jiawei Han, Jian Pei, Hanghang Tong
Question Posted:
Students also viewed these Computer science questions
-
Defining the Problem (1). Lead is an environmental pollutant especially worthy of attention because of its damaging effects on the neurological and intellectual development of children. Morton et al....
-
Discuss the role of the client/therapist relationship from the behavior therapist's point of view. What are some of the criticisms of this relationship/ How do behavior therapists like Lazarus and...
-
The following additional information is available for the Dr. Ivan and Irene Incisor family from Chapters 1-5. Ivan's grandfather died and left a portfolio of municipal bonds. In 2012, they pay Ivan...
-
"If nominal GDP rises, velocity must rise." Is this statement true, false, or uncertain? Explain your answer.
-
A 19.5-ft ladder AB leans against a wall as shown. Assuming that the coefficient of static friction s μ is the same at A and B, determine the smallest value of s μ for which equilibrium is...
-
The article Application of Analysis of Variance to Wet Clutch Engagement (M. Mansouri, M. Khonsari, et al., Proceedings of the Institution of Mechanical Engineers, 2002:117125) presents the following...
-
7. Which of these might be valid consideration? a. A promise to do something. b. A promise to refrain from doing something. c. An action. d. All of the above.
-
Indiana Jones Corporation enters into a 6-year lease of equipment on January 1, 2011, which requires 6 annual payments of $40,000 each, beginning January 1, 2011. In addition, Indiana Jones...
-
Accounting for a retrospective change requires reissuing all prior financial statements affected by the change reporting the "catch-up" adjustment on the current income statement. adjusting the...
-
Consider partitioning clustering and the following constraint on clusters: The number of objects in each cluster must be between \(\frac{n}{k}(1-\delta)\) and \(\frac{n}{k}(1+\delta)\), where \(n\)...
-
In a large sparse graph where on average each node has a low degree, is the similarity matrix using SimRank still sparse? If so, in what sense? If not, why? Deliberate on your answer.
-
Refer to Nokias financial statements in Appendix A. Compute its cost of goods available for sale for the year ended December 31, 2009.
-
Based on the historical movement ofAUD/EUR exchange rate, predict what the rate will be on 31 December. a) Discuss the effect of Interest Rate Parity on AUD/EUR exchange rate on 31 December 2021
-
7. (1 point) Express the function y = x+6 as a composition y = f(g(x)) of two simpler functions y = f(u) and u = g(x). Generated by WeBWorK, http://webwork.maa.org, Mathematical Association of...
-
4. Examine Southwest Airlines by providing a detailed analysis of their financials ( (at a minimum, cover one ratio each from profitability, liquidity, leverage, and activity ratios). Compare...
-
Question 4: Anurika Ranches has a May 31, 2021 year end and purchases $100,000 bond investment at a price of 109 on February 2, 2021 using cash. The bond pays cash interest at 9% at a time when...
-
According the Medical Health Association (MHA) data the number of non-government not-for-Profit community hospitals comprising of 2946 and entrepreneurial investor-owned for-profits comprising of...
-
Using a tax research database, list the major Code sections for the following topics: a. Gift tax b. Capital gains c. Stock dividends d. Business energy credits
-
Establish identity. cos( + k) = (-1)k cos , k any integer
-
The thickness of a conductive coating in micrometers has a density function of 600x-2 for 100 m < x <120 m. (a) Determine the mean and variance of the coating thickness. (b) If the coating costs...
-
Suppose that contamination particle size (in micrometers) can be modeled as for Determine the mean of X.
-
Integration by parts is required. The probability density function for the diameter of a drilled hole in millimeters is for mm. Although the target diameter is 5 millimeters, vibrations, tool wear,...
-
Which is true of adding a video to a Google Slides presentation? O you need to have the URL in order to add a video directly, otherwise, you have to download it to your computer first O you can...
-
Which statement about competency models is true? OThey help HR professionals ensure that all aspects of talent management are aligned with an organization's strategy. OThey identify and describe a...
-
Good time management includes always completing your easiest task first which will create a sense of accomplishment and motivate you to tackle next the most difficult task. True False
Study smarter with the SolutionInn App