Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Aug 26, 2024

We will study a political blog dataset first compiled for the paper Lada A . Adamic and Natalie Glance, The political blogosphere and the 2

We will study a political blog dataset first compiled for the paper Lada A

.

Adamic and Natalie Glance,

The political blogosphere and the

2004

US Election

,

in Proceedings of the WWW

- 2005

Workshop on the

Weblogging Ecosystem

(2005) .

It is assumed that blog

-

site with the same political orientation are more

likely to link to each other, thus, forming a

community

cluster

in a graph. In this question, we will

see whether or not this hypothesis is likely to be true based on the data.

The dataset nodes.txt contains a graph with n

= 1490

vertices

(

nodes

)

corresponding to political

blogs.

The dataset edges.txt contains edges between the vertices. You may remove isolated nodes

(

nodes

that are not connected to any other nodes

)

in the pre

-

processing.

We will treat the network as an undirected graph; thus, when constructing the adjacency matrix, make

it symmetrical by

,

.

.,

set the entry in the adjacency matrix to be one whether there is an edge between

the two nodes

(

in either direction

) .

In addition, each vertex has a

0 - 1

label

(

in the

3

rd column of the data file

)

corresponding to the true

political orientation of that blog. We will consider this as the true label and check whether spectral clustering

will cluster nodes with the same political orientation as possible.

1 . (5

points

)

Use spectral clustering to find the k

= 2, 5, 10, 30, 50

clusters in the network of political blogs

(

each node is a blog, and their edges are defined in the file edges.txt

) .

Find majority labels

(

Same as

purity score from the image compression problem

)

in each cluster for different k values, respectively.

For example, if there are k

= 2

clusters, and their labels are

{0, 1, 1, 1}

and

{0, 0, 1}

then the majority

label for the first cluster is

1

and for the second cluster is

0 .

It is required you implement the

algorithms yourself rather than calling from a package.

4

Now compare the majority label with the individual labels in each cluster, and report the mismatch

rate

(

Also known as misclassification rate

)

for each cluster, when k

= 2, 5, 10, 30, 50 .

For instance, in

the example above, the mismatch rate for the first cluster is

1 / 4 (

only the first node differs from the

majority

),

and the second cluster is

1 / 3 .

2 . (5

points

)

Tune your k and find the number of clusters to achieve a reasonably small mismatch rate.

Please explain how you tune k and what is the achieved mismatch rate. Please explain intuitively what

this result tells about the network community structure.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Modern Database Management

Authors: Jeff Hoffer, Ramesh Venkataraman, Heikki Topi

13th Edition Global Edition

1292263350, 978-1292263359

More Books

Students also viewed these Databases questions

Question

★★★★★

Describe the three techniques used for communicating data in a local procedure call. What different settings are most conducive for the application of the different message passing techniques?

Answered: 1 week ago

Question

★★★★★

Summarize the general nancial climate in which the U.S. scheduled airlines nd themselves during this decade

Answered: 1 week ago

Question

★★★★★

(Appendix) What are trade discounts and quantity discounts? From an accounting viewpoint, how does the effect of trade and quantity discounts on selling (or invoice) price differ from the effect of...

Answered: 1 week ago

Question

★★★★★

Drab Corporation just obtained exclusive rights to a new revolutionary fertilizer that is sure to be an instant success in the gardening industry. Unfortunately, Drab does not control the capital to...

Answered: 1 week ago

Question

★★★★★

Exercise 8-16 (Algo) Comparison of FIFO and LIFO; perlodle system (LO8-1,8-4) Alta Skl Company's Inventory records contained the following information regarding its latest ski modet. The company uses...

Answered: 1 week ago

Question

★★★★★

Show Instructions D Question 6 2.22 pts A company purchased merchandise with an invoice price of $2,000 and credit terms of 3/10, 1/30, Assuming a 360 day year, what is the implied annual interest...

Answered: 1 week ago

Question

★★★★★

A particle of mass $ m $ moves under the influence of a conservative force described by the potential energy function $ U(x) = \frac{kx^2}{2} $, where $ k $ is a constant and $ x $ is the...

Answered: 1 week ago

Question

★★★★★

A 2 kg block slides on a horizontal frictionless surface with a velocity of 4 m/s. It collides elastically with a stationary block of mass 3 kg. Determine the final velocities of both blocks after...

Answered: 1 week ago

Question

★★★★★

A satellite of mass 500 kg is orbiting Earth at an altitude where the gravitational acceleration is $ 7.5 \, \text{m/s}^2 $. Calculate the gravitational force acting on the satellite and the...

Answered: 1 week ago

Question

★★★★★

Question 1 Calculate the limit: \[ \lim_{{x \to 0}} \frac{\sin(5x)}{x} \] Question 2: Find the derivative of the function $ f(x) = x^3 - 3x^2 + 4x - 7 $.

Answered: 1 week ago

Question

★★★★★

Question 1: Evaluate the integral: \[ \int (2x^2 - 3x + 5) \, dx \] Question 2: Determine the convergence or divergence of the improper integral: \[ \int_1^\infty \frac{1}{x^2} \, dx \] Question 3:...

Answered: 1 week ago

Question

★★★★★

Think about diversity and inclusion experience at the workplace in your cultural context. Based on your personal experience during the past two years, share one thing that you consider should have...

Answered: 1 week ago

Question

★★★★★

Understand how relocating headquarters functions from big cities to rural areas influences the work styles and employees work-life balance.

Answered: 1 week ago

Question

★★★★★

In your view, do you think leadership can influence HRM practices such as recruitment, selection, training, and career development in promoting gender diversity and inclusion in an organization?

Answered: 1 week ago

Previous Question Next Question