Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

A recruiting firm ( Gazelle Recruiting ) has just received a data file from a college campus with over two hundred graduates who have received

A recruiting firm (Gazelle Recruiting) has just received a data file from a college campus with over two hundred graduates who have received job offers. The data file gives a good deal of information about these graduates, ranging from how well they did in grade school, high school, college etc. The file also contains the salary the graduates were offered for their jobs and whether they were placed or not.
The recruiting firm has no experience reading datasets. They dont have an idea of who these graduates are in terms of demographics. Senior management have asked for your guidance. They have provided a data file and a data dictionary. The data dictionary contains a list of fields in the dataset and an explanation for each one.
To help our client, here are the steps to take:
Cleanse
Look for null data in the table, and use the appropriate strategy to handle null data for each column. Explain why this was the strategy that you used.
Outliers
Search the data for outliers and remove them. Use the appropriate outlier method(s). Show all work.
Who are these graduates ?
Create 4-5 visualizations providing useful and explanatory information about these graduates. The visualizations should show trends, correlations, and other useful patterns that the data provides.
Remember, we are dealing with senior management. The charts must be readable, meaningful and at a summary level.
Use at least one group by and bin in your analysis.
Who are most likely to get placed?
Create visualizations to show management qualities specific to those who are likely to get placed at jobs. Show three meaningful visualizations.
Use at least one group/by or bin in your analysis.
Prepare the data for a regression analysis
The Gazelle management team has heard that it might be possible to use their data to make predictions. They dont know much about data analytics. We are hoping that an example of what is possible will help them understand better.
Prepare the data file to run with a regression analysis. Use the techniques and methods discussed in class.
Run the regression analysis
Show the regression scoring. Split the data into training and test sets. Show a confusion matrix for both.
Two files are attached to this assignment. The data file and the Data dictionary which explains every column in the set.
The code should be commented appropriately

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Systems Design Implementation And Management

Authors: Carlos Coronel, Steven Morris

14th Edition

978-0357673034

More Books

Students also viewed these Databases questions

Question

Distinguish between merit pay, bonus, spot bonuses, and piecework.

Answered: 1 week ago