Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

8.2 Clustering the Users Who Posted Tweets About the Public Figure We want to categorize (cluster) the users of the tweets about the Public Figure

8.2 Clustering the Users Who Posted Tweets About the Public Figure
We want to categorize (cluster) the users of the tweets about the Public Figure based on the descriptions provided in their Twitter account. Descriptions in the users' Twitter profiles give a
short piece of information about the Twitter handle. To cluster users, build a document term matrix by using the user descriptions of the tweets (note, not tweets themselves) you downloaded in section 8.1.
1. Use rtweet packages users_data(tweets) function to extract users data from tweets data object you downloaded in Section 8.1. Store the unique author result in variable, authors.
2. Clean the data by pre-processing and then create a weighted Term Document Matrix using unique users descriptions.
3. Compute the appropriate number of clusters using the elbow method. Use cosine distance.
4. Cluster the users and visualize the clusters in two dimensional vector space.
5. Display the count of users in each cluster
6. List a maximum of 10 screen names of users in each cluster.
7. List the top 10 words in each cluster
8. Display the description of the first five users in each cluster.
9. Comment on your findings.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Practical Database Programming With Visual Basic.NET

Authors: Ying Bai

1st Edition

0521712351, 978-0521712354

More Books

Students also viewed these Databases questions