Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Follow the instructions below and write an essay that covers the following tasks. R script. R screenshot/s, your results and explanations should be included

Follow the instructions below and write an essay that covers the following tasks. R script. R screenshot/s,

Table 1. Account-based and Content-based Features Description Feature Name Description Account-based Features

Follow the instructions below and write an essay that covers the following tasks. R script. R screenshot/s, your results and explanations should be included for each question in the essay. 1 Load Twitter Spam dataset into R studio, use ggplot function to make density plot of Tweets' number (column: no_tweets) to compare spam and non-spam. Clarify how you make the plot and what's your observation from the density plot. 2 Use ggplot function to make scatterplots to present the relation of posted tweets number (column: no_tweets) and the number of followers (column: no_follower), displaying spammer and non-spammer with different colours. Clarify how you make the plot and what's your observation from the scatter plot. 3 Add regression lines to the scatterplot generated in Step 2 for spammer and non-spammer separately. what's your observation from the regression lines? Compared with scatterplot only, what are the advantages of adding regression lines? Table 1. Account-based and Content-based Features Description Feature Name Description Account-based Features account age no follower no following no.userfavorites no lists no tweets no. retweets no tweetfavorites no hashtag no usermention no.urls no.char no.digits The age of an account # of followers # of followings #of favourites the user received # of lists in which the user is a member of #of tweets that has been posted by the user. Content-based Features # of times this tweet has been retweeted # of favourites this tweet received # of hashtags in this tweet # of times this tweet being mentioned #of URLs contained in this tweet # of characters in this tweet # of digits in this tweet

Step by Step Solution

3.45 Rating (161 Votes )

There are 3 Steps involved in it

Step: 1

1 I loaded the TwitterSpam dataset into R studio and I used ggp... blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Income Tax Fundamentals 2013

Authors: Gerald E. Whittenburg, Martha Altus Buller, Steven L Gill

31st Edition

1111972516, 978-1285586618, 1285586611, 978-1285613109, 978-1111972516

More Books

Students also viewed these Computer Network questions