Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis. Imagine that you have been

Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis.

Imagine that you have been hired by a philanthropist group to analyze the relationship between house values and neighborhood characteristics. For example, they would like to know whether houses in neighborhoods with desirable characteristics command a higher price. Moreover, they are specifically interested in environmental features, such as proximity to water (i.e. lake, river, or ocean) and air quality. The group has obtained information from tens of thousands of neighborhoods throughout the United States.

I am given a subset of this data, contained in house values.csv, along with the variable descriptions in house values description.txt.

Here is the public link containing both files: https://drive.google.com/drive/folders/1ULhcDkb6HPP0oU6xGj8UY50vckOqWekJ?usp=sharing

My task is to perform a statistical analysis on this data to answer the philanthropist group's questions. I am also required to build a statistical model that allows me to test hypotheses of interest to the group. Additionally, need to include a discussion of statistical issues that may be caused by omitted variables.

I need to make sure all following elements are met in my answer:

  1. A comprehensive analysis of data quality and integrity
  2. A discussion of any observations you delete from the dataset, including implications for the final model results.
  3. A discussion of any data imputation technique you use, including implications for the final model results.
  4. A thorough exploratory analysis of each variable (and combinations of variables)
  5. An explanation of how the exploratory data analysis is linked to modeling choices
  6. An assessment and formal test of all key model assumptions
  7. A table of regression results that shows multiple model specifications
  8. A detailed discussion of the model results (in terms of answering the business questions posed by the philanthropist group)
  9. A discussion of biases caused by omitted variables

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

WebAssign For Trigonometry

Authors: James Stewart

2nd Edition

1337772313, 9781337772310

More Books

Students also viewed these Mathematics questions

Question

What are the factors that affect the CCC estimate?

Answered: 1 week ago

Question

7. How can an interpreter influence the utterer (sender)?

Answered: 1 week ago