Question
Please follow the instructions below and answer the question below. DSCI 352 Assignment 4 - Regression First, you will need to download the data file
Please follow the instructions below and answer the question below.
DSCI 352 Assignment 4 - Regression
First, you will need to download the data file "employee-perf.csv" found here: https://drive.google.com/open?id=14Ksw7uUdKDs_CLxqkCke5eLgadrGzp0l
Please place this file in a folder where you can access it with R, and then open up R or R Studio.
Assume you own a software company. When you bring on new employees your hiring process is 1) to give them a technical aptitude test, 2) to bring them in for an interview (conducted by a team of three hiring managers), and if preliminarily hired, 3) put them through a 6-week training course. At the end of the course, you have the option to let the employee go with no further costs or keep them on board. You keep a record of all your employees, including their interview scores (an average score from 1-5 based on three hiring managers' input from the interview), their aptitude test score, the number of training classes they missed, and their annual performance score. These records are found in the included file "employee-perf.csv." You are hiring and want to identify good candidates before having to keep them on the payroll for a full year (in order to get a performance review).
To begin, first, set your working directory to where your "employee-perf.csv" is located and read the data into a variable called employee.dat by executing the following command: > employee.dat <- read.csv("employee-perf.csv")
- Using the data in employee.dat, construct a linear regression model to predict the Annual performance rating from all other relevant variables. DO NOT use any variable selection methods such as stepwise selection, etc. 1) Place your regression model into a variable called employee.mod. 2) You may or may not need to leave out certain variables (i.e. columns) that are irrelevant. (This does not mean you need to find the best combination of variables, only that some columns may clearly be irrelevant). 3) In your model, make sure to list any variables (columns) in the same order they appear in the spreadsheet. Also, don't forget to replace spaces ' ' with dots '.' when specifying the variables in your model (i.e. Interview Score becomes Interview.Score). 4) Please use either no spaces or consistent spacing. If you use spaces, make sure to use no more than one space between characters. (10 points)
Please copy and paste your R command to build the model and assign it to employee.mod below (this should be a single line of R code using the lm function). Your input should be along the following lines: employee.mod <- lm (...
1)Please select all variables that should be used as thedependent variables in your model.
- Employee
- Annual Performance Rating
- Aptitude Test Score
- Interview Score
2.)Please select all variables that should be used as theindependent variables in your model.
- Employee
- Annual Performance Rating
- Aptitude Test Score
- Interview Score
- Missed Training Classes
- Missed Training Classes
3. In the model you created (employee.mod), which single metric tells you whether Does your model have any predictive validity? (10 points)
The coefficient The p-value corresponding to the intercept The p-value corresponding to the F-statistic The adjusted R-squared The F-statistic The multiple R-squared The p-value corresponding to the coefficient The Intercept
4.. Specify the value in your model of the metric you identified in question 2. (10 points)
Answers: ......................................................................................................................
5. Does your model have any predictive validity? (10 points)
Yes, with a high degree of certainty Yes, with a moderate degree of certainty No, with a high degree of certainty Inconclusive No, with a moderate degree of certainty
6. How much variance in Annual Performance Rating is explained by your model? Please round your answer to two decimal places. (10 points)
.Answer.................................................................................
7. Suppose you have the following candidates that have just made it through the training course and you can either retain them or let them go. Their information is as follows:
Candidates | Aptitude test score | IntervInterview scores | Misses training classes |
Billgate | 84 | 3.55 | 6 |
Mark Zuckerberg | 37 | 4.72 | 1 |
Sergey Brin | 86 | 4.61 | 3 |
This information is contained in the following spreadsheet, which you can download and make predictions on using R's predict function: https://drive.google.com/open?id=1yN8v21QefygI1gPEk2PT4LrLeZwUjfAt
What is the predicted Annual Performance Rating for Bill Gates? Round your answer to two decimal places. (10 points)
Answers:..........................................................................................................
8. What is the predicted Annual Performance Rating for Mark Zuckerberg? Round your answer to two decimal places. (10 points)
Answers:..............................................................................................................
9. What is the predicted Annual Performance Rating for Sergey Brin? Round your answer to two decimal places. (10 points)
Answer:...........................................................................................
10. If you have to choose two to hire based on your model, who would they be? (10 points)
Bill Gates Sergey Brin Mark Zuckerberg
11. If you have to choose one to fire based on your model, who would it be? (10 points)
Bill Gates Mark Zuckerberg Sergey Brin
12. By examining the model results, which cost-cutting policies (if any) would you recommend, and why?(10 points)
Eliminate the training classes, because they show no significant impact on employee performance All three measures have a significant impact on employee performance; don't eliminate any. Eliminate the interview, because it shows no significant impact on employee performance Eliminate the aptitude test and interview, because in combination they show no significant impact on the employee performance Eliminate the aptitude test and training classes, because in combination they show no significant impact on the employee performance Eliminate the aptitude test, because it shows no significant impact on employee
Please follow the instruction that provides below and using R to solve the problems. Then, answer the question 1-12Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started