Question
Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis. Imagine that you have been
Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis.
Imagine that you have been hired by a philanthropist group to analyze the relationship between house values and neighborhood characteristics. For example, they would like to know whether houses in neighborhoods with desirable characteristics command a higher price. Moreover, they are specifically interested in environmental features, such as proximity to water (i.e. lake, river, or ocean) and air quality. The group has obtained information from tens of thousands of neighborhoods throughout the United States.
I am given a subset of this data, contained in house values.csv, along with the variable descriptions in house values description.txt.
Here is the public link containing both files: https://drive.google.com/drive/folders/1ULhcDkb6HPP0oU6xGj8UY50vckOqWekJ?usp=sharing
My task is to perform a statistical analysis on this data to answer the philanthropist group's questions. I am also required to build a statistical model that allows me to test hypotheses of interest to the group. Additionally, need to include a discussion of statistical issues that may be caused by omitted variables.
I need to make sure all following elements are met in my answer:
- A comprehensive analysis of data quality and integrity
- A discussion of any observations you delete from the dataset, including implications for the final model results.
- A discussion of any data imputation technique you use, including implications for the final model results.
- A thorough exploratory analysis of each variable (and combinations of variables)
- An explanation of how the exploratory data analysis is linked to modeling choices
- An assessment and formal test of all key model assumptions
- A table of regression results that shows multiple model specifications
- A detailed discussion of the model results (in terms of answering the business questions posed by the philanthropist group)
- A discussion of biases caused by omitted variables
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started