Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

This WHOLE Question must be written using rStudio For this assignment, you will create and compare two graphics that address the primary question for Project

This WHOLE Question must be written using rStudio

For this assignment, you will create and compare two graphics that address the primary question for Project One, namely: "Does limiting user posts increase 7-day retention when compared to the unlimited posting model?" Since both variables (`retention_7` and `version`) are categorical, you might consider creating a bar plot (stacked, side-by-side, or segmented) or a mosaic plot, but you are not limited to these options. Your plots should be presentation-ready, not only for your supervisor but also company leadership. This includes appropriate labels, titles, themes, etc. After creating the two plots, comment briefly on which plot is more interpretable and better represents the data and why you feel it does so.

The Data and The Questions

You are working as a data scientist for a social media startup company. The platform allows users to post text and images, viewable by thousands of other users. In an effort to drive continued engagement on the platform, the company leadership is considering changing its current model, where users can upload an unlimited number of posts, to a more limited model, where users can only make a small number of posts per day. The company leadership believes that such a change would increase the perceived quality of content, thus encouraging users to continue using the platform.

In order to determine whether this change has a positive effect on engagement, a collection of new users were randomly assigned to one of the two models, either unlimited or limited posting. The following information was recorded for each of these new users:

  1. `userid`: numeric; unique user identification number
  2. `site_visits_7`: numeric; the number of site visits in the first 7 days after registering
  3. `retention_1`: logical; whether the user accessed the platform one day after registering
  4. `retention_7`: logical; whether the user accessed the platform 7 days after registering
  5. `version`: character; which posting model was assigned to the user
  6. `day_of_week`: character; day of week when user registered

The data can be found in the file `platform_retention.csv. The primary metric of interest in this study is 7-day retention.The primary question of interest is whether limiting user posts increases 7-day retention when compared to the unlimited posting model.However, company leadership also recognizes that changing the posting model for existing users may have negative effects on retention for those users. As such, your supervisor has asked you to avoid claiming a significant improvement unless the data strongly supports the claim, both in statistical and practical significance.

Here is what platform_retention.csv looks like:

userid site_visits_7 retention_1 retention_7 version day_of_week
1711136 15 FALSE FALSE unlimited Saturday
2964858 18 TRUE FALSE unlimited Thursday
3433251 19 FALSE FALSE unlimited Thursday
8452399 23 TRUE TRUE unlimited Sunday
2854126 27 TRUE TRUE unlimited Friday
2209313 23 TRUE FALSE unlimited Thursday
4386206 13 TRUE FALSE unlimited Friday
2475965 17 FALSE FALSE unlimited Monday
5360385 24 TRUE FALSE unlimited Monday
9171008 17 TRUE TRUE unlimited Sunday
3747153 25 FALSE FALSE unlimited Thursday
5584591 21 FALSE FALSE unlimited Monday
9308466 27 TRUE FALSE unlimited Sunday
4912334 17 FALSE FALSE unlimited Saturday
5402319 22 FALSE FALSE unlimited Thursday
2165188 16 FALSE TRUE unlimited Friday
7203436 18 TRUE FALSE unlimited Sunday
4093021 24 FALSE FALSE unlimited Tuesday
7491353 13 FALSE FALSE unlimited Monday
8082559 25 NA NA unlimited Monday
224388 25 FALSE FALSE unlimited Monday
6491807 18 FALSE FALSE limited Tuesday
2302174 17 TRUE FALSE unlimited Tuesday
7810742 28 FALSE FALSE unlimited Tuesday
5970580 16 TRUE FALSE unlimited Sunday
6053011 15 FALSE FALSE unlimited Monday
6807369 24 NA NA unlimited Thursday
2807863 17 FALSE FALSE unlimited Monday
1703365 20 FALSE TRUE unlimited Wednesday
3318235 27 FALSE FALSE unlimited Monday
5102107 12 TRUE FALSE unlimited Wednesday
8273320 23 FALSE FALSE unlimited Sunday
5937557 21 FALSE FALSE unlimited Friday
6060402 25 FALSE FALSE unlimited Tuesday
1467045 17 FALSE FALSE unlimited Wednesday
7074997 19 FALSE FALSE unlimited Saturday
4823328 16 TRUE FALSE unlimited Monday
981047 26 TRUE FALSE unlimited Sunday
3533785 16 TRUE FALSE unlimited Wednesday
8801315 23 FALSE FALSE unlimited Sunday
7983687 24 FALSE FALSE unlimited Tuesday
1906184 19 FALSE FALSE unlimited Monday
6133236 23 FALSE FALSE unlimited Saturday
8884908 19 TRUE FALSE unlimited Tuesday
7522391 25 FALSE FALSE unlimited Friday
4816064 16 FALSE FALSE unlimited Tuesday
3055424 24 TRUE FALSE unlimited Friday
3719801 14 TRUE FALSE unlimited Sunday
3697758 17 TRUE FALSE unlimited Sunday

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Calculus (Multi Variable)

Authors: Michael Sullivan

1st Edition

1464142890, 9781464142895

More Books

Students also viewed these Mathematics questions