Question
The data set email50 is a subset of the email data set in the OpenIntro.org textbook. Each case is an email from David Diez's G
The data set "email50" is a subset of the "email" data set in the OpenIntro.org textbook. Each case is an email from David Diez's G mail account in the early months of 2012. David had 3,921 emails total during that time period, 367 of which were spam, and 3554 were not spammed.
The variable "spam" indicates if the email was spam or not: "0" means the email did not spam, and "1" means the email was spam.
The variable "exclaim_mess" contains the number of exclamation marks ("!") in the email.
email50.csv download https://www.openintro.org/data/index.php?data=email50
The Statistics
There are 5 spam emails, with an average of 1.4 exclamation marks per email, and a standard deviation of 0.8944272 exclamation marks.
There are 45 non-spam emails, with an average of 4.755556 exclamation marks per email, and a standard deviation of 8.00896 exclamation marks.
Question
Use the statistics from the email50 dataset to answer the following question:
"At the 10% significance level, is there evidence that there is a difference in the average number of exclamation marks per email between spam and non-spam emails all of David's early 2012 emails?"
*If any conditions are not met, please state which one(s) and proceed with the inference.
PLEASE SHOW RSTUDIO COMMANDS
Step by Step Solution
3.34 Rating (148 Votes )
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started