Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

EDA for Case Competition ( 1 0 0 points ) * This part of the assignment uses the case competition data: insurance _ data.csv .

EDA for Case Competition (100 points)
*This part of the assignment uses the case competition data: insurance_data.csv. See the Case Competition folder in Blackboard for the data and data dictionary.
Case competition background
Describe the business problem.
o What exactly is the business problem to be solved?
o Is the problem a supervised or unsupervised problem? Classification or regression problem?
Data Exploration (Keep this concise.)
Use exploratory data analysis to present the most interesting insights about the data (e.g., summary statistics, histograms, plots of continuous variables by response etc.) It is up to you to determine the most appropriate and interesting insights to be included in the report.
o EDA
What is the response variable? Provide 1 appropriate table or visualization showing the distribution of the response variable.
What are the number of observations and predictor variables?
Which variables are categorical, continuous, or date variables?
Consider what values seem appropriate for each variable using the data dictionary. Are there missing values, NULL values, outliers, or any inconsistencies?
Data visualization: For each of the following plots, describe the insights gained from each plot. Do not simply state what the plot contains.
Explore both continuous and categorical variables. Provide 3 interesting plots.
Provide 2 interesting plots investigating the distribution of the response variable in relation to predictor variables.
o Example visualizations to provide: If the response is categorial, provide 2-way tables of response and categorical predictor variables, or boxplot of continuous predictor variable (y-axis) by categorical response (x-axis). If the response is continuous, provide scatterplots with continuous variables or boxplot of continuous response variable (y-axis) by categorical predictor variable (x-axis).
ChatGPT
It seems like you've shared a detailed part of an assignment related to exploratory data analysis (EDA) for a case competition using insurance_data.csv. To assist you effectively, I'll need more specific questions or areas where you require help. Feel free to ask about a particular aspect of the assignment or any specific analysis you're struggling with, and I'll do my best to assist you.
User
EDA for Case Competition (100 points)
*This part of the assignment uses the case competition data: insurance_data.csv. See the Case Competition folder in Blackboard for the data and data dictionary.
Case competition background
Describe the business problem

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Marketing The New Profit Frontier

Authors: Ed Burnett

1st Edition

0964535629, 978-0964535626

More Books

Students also viewed these Databases questions

Question

Does it use a maximum of two typefaces or fonts?

Answered: 1 week ago