Question
USING R STUDIO Cleaning data You were hired as an intern in a social media think tank. As your first assignment, you were given Tik_Tok_data.csv
USING R STUDIO
Cleaning data
You were hired as an intern in a social media think tank. As your first assignment, you were given Tik_Tok_data.csv data set. Your supervisor needs to produce some scatter plots to give an insight as to what affects the popularity of some videos. Unfortunately, the data was scraped in a human readable format and not a machine readable format. Thats where you come in. On your resume you mentioned taking Data Analysis class and mastering R. Use your regex skills to clean the data set. Make sure that video duration is measured in seconds. Run a regression with views as the dependent variable and followers, likes, comments, shares and duration as the independent variables. Interpret your results. If the video has Share as a value for the number of shares, it means that the video got no shares. If instead of the view count number, it says Participating in this. . . , that means that the video went private. Make sure to comment the code you used to clean the data and write a note explaining what decisions you made when dealing with missing/ambiguous data. Calculate the total number of views, likes and comments gained by each user. Produce a publication-quality table that has all of the users, ordered by the total number of views. Please use the tidyverse.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started