Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Exercise 1 ( 5 0 points ) Create the following visualizations using R ( ggplot 2 ) and dataset ( acs _ ny . csv
Exercise points
Create the following visualizations using ggplot and dataset acsnycsv which is a subset of the
American Community Survey ACS for the New York state.
a Create a bar chart for income levels $ or less, $ $$$
and more than $ and language spoken at home.
b b Create a visualization to show the relationship between familyIncome $ or less,
$ $$ $ and more than $ and average HouseCost.
c Create scatterplots using to visualize relationship between the dependent variable
FamilyIncome and each independent variable.
d Using the dataset, create a multiple regression model in The purpose of this model is to
predict the household income.
Create and compare at least four models.
Perform Stepwise regression to find an optimal set of predictors.
Explain your results including squared and values.
Which model provides better accuracy? Why?
Does your model conform to the assumptions of linear regression?
Exercise points
Using the provided subset acsnycsv of the American Community Survey ACS for New York
state, create a logistic regression model in R The purpose of this model is to predict whether a
household has an income greater than $
a Explain your results including deviance residuals, coefficients, and AIC.
b Create coefficient plot for logistic regression on family income greater than $
Hint: You will need to create a new binary variable with TRUE for income above $ and FALSE for
income below. It will be helpful to create density plot of family income to see distribution.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started