Question
Reference: MAT 2375 Introduction to statistics and MAT3378 Analysis of experience plans Project: Part I Data generation Set. seed(123) # For reproducibility # Generate random
Reference: MAT 2375 Introduction to statistics and MAT3378 Analysis of experience plans
Project: Part I Data generation
Set. seed(123) # For reproducibility
# Generate random data
n< 100#
regions <- c"North" "South" "East" "0uest"
categories_expendits <- ("Power", "Clothing", "Electronics")
data <- data. frame (
Region = sample (regions, n, replace= True),
Income = rnorm(n, mean = 50000, sd = 10000),
Age = sample(20:70, n, replace = TRUE),
Expendits = rnorm(n, mean = 300, sd = 50) ,
Category = sample (catgories _dpenses, n, replace = TRUE)
)
Using the above dataset,
1. Perform a basic exploration (a few graphs).
2. Is the data normally distributed?
3. Is there a difference in average expenses between the pairs of category? Note: It means Is that you make a pair test using a t-test for all combinations, i.e.
H_0; average_North=MediumSouth vs H_1: not equal
H_0; average_East=Middle_South vs H_1: not equal.....
.... All combinations per pair (6 tests in total);
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started