Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis. Imagine that you have been

Need urgent help with the following Data Analysis question. Either R or Python can be used for this exploratory analysis.

Imagine that you have been hired by a philanthropist group to analyze the relationship between house values and neighborhood characteristics. For example, they would like to know whether houses in neighborhoods with desirable characteristics command a higher price. Moreover, they are specifically interested in environmental features, such as proximity to water (i.e. lake, river, or ocean) and air quality. The group has obtained information from tens of thousands of neighborhoods throughout the United States.

I am given a subset of this data, contained in house values.csv, along with the variable descriptions in house values description.txt.

Here is the public link containing both files: https://drive.google.com/drive/folders/1ULhcDkb6HPP0oU6xGj8UY50vckOqWekJ?usp=sharing

My task is to perform a statistical analysis on this data to answer the philanthropist group's questions. I am also required to build a statistical model that allows me to test hypotheses of interest to the group. Additionally, need to include a discussion of statistical issues that may be caused by omitted variables.

I need to make sure all following elements are met in my answer:

  1. A comprehensive analysis of data quality and integrity
  2. A discussion of any observations you delete from the dataset, including implications for the final model results.
  3. A discussion of any data imputation technique you use, including implications for the final model results.
  4. A thorough exploratory analysis of each variable (and combinations of variables)
  5. An explanation of how the exploratory data analysis is linked to modeling choices
  6. An assessment and formal test of all key model assumptions
  7. A table of regression results that shows multiple model specifications
  8. A detailed discussion of the model results (in terms of answering the business questions posed by the philanthropist group)
  9. A discussion of biases caused by omitted variables

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Financial Accounting An Integrated Approach

Authors: Ken Trotman, Michael Gibbins, Elizabeth Carson

6th Edition

0170349683, 9780170349680

Students also viewed these Mathematics questions