Question
Players (Do not use this column in data analysis. Player's names are just for your information.) Club Head Speed (mph) Driving Distance (yds) Aaron Wise
Players (Do not use this column
in data analysis. Player's names
are just for your information.)
Club Head Speed
(mph)
Driving Distance
(yds)
Aaron Wise 115.05 302.20
Abraham Ancer 111.12 290.00
Adam Hadwin 113.70 294.60
Adam Long 111.48 289.60
Adam Scott 119.74 307.70
Billy Horschel 114.56 293.10
Brendan Steele 118.04 299.80
Brendon Todd 105.27 275.80
Brian Harman 110.51 292.40
Brian Stuard 105.82 281.20
Bryson DeChambeau 132.83 320.80
Cameron Percy 110.84 283.80
Cameron Smith 114.21 296.80
Charles Howell III 119.22 297.90
Charley Hoffman 118.10 307.30
Chris Kirk 110.84 290.40
Collin Morikawa 115.02 293.90
Corey Conners 116.11 294.50
Daniel Berger 117.33 297.90
Denny McCarthy 111.59 295.10
Doug Ghim 112.18 295.20
Dustin Johnson 119.56 313.30
Dylan Frittelli 120.05 310.60
Harold Varner III 115.42 301.80
Harry Higgs 118.21 297.40
J.T. Poston 115.89 301.00
James Hahn 114.86 299.80
Jason Day 117.26 300.60
Jason Kokrak 119.43 304.10
Jhonattan Vegas 122.17 304.70
Joaquin Niemann 119.72 312.70
Jon Rahm 119.28 306.10
Jordan Spieth 115.17 297.50
Justin Thomas (Winner) 117.90 300.40
Keegan Bradley 115.68 300.00
Kramer Hickok 114.47 293.10
Kyoung-Hoon Lee 114.22 298.50
Lanto Griffin 121.11 303.80
Lee Westwood 114.58 295.20
Louis Oosthuizen 111.65 297.20
Lucas Glover 113.28 294.40
Martin Laird 115.24 296.00
Matt Fitspatrick 116.03 287.10
Matt Jones 118.28 305.30
Michael Thomson 108.75 289.30
Nate Lashley 110.71 292.80
Nick Taylor 111.63 294.10
Patrick Reed 115.60 287.40
Patrick Rodgers 120.05 306.10
Patton Kizzire 115.15 301.60
Paul Casey 117.03 302.10
Phil Michelson 119.20 301.40
Rory Sabbatini 110.42 288.40
Russell Knox 112.08 291.10
Ryan Armour 106.86 279.50
Ryan Moore 108.36 288.00
Ryan Palmer 117.84 302.90
Scott Brown 116.41 297.50
Scott Harrington 120.78 306.70
Scott Piercy 111.60 291.90
Sergio Garcia 121.07 308.60
Shane Lowry 115.69 299.40
Si Woo Kim 114.57 297.80
Sungjae Im 113.95 297.40
Talor Gooch 118.75 300.20
Tom Hoge 112.13 289.20
Tyler McCumber 122.44 308.40
Victor Perez 119.17 295.00
Will Zalatoris 120.47 307.90
Zach Johnson 110.82 290.60
Excel demo videos. Please check. Correct application andinterpretation
are the purposes of this project.
Please check followings first:
? Please make sure that your Excel is full-version equipped withData Analysis. If not
sure, check the file “Bring Add-Ins Analysis Toolpack to Excel”from Canvas.
? Download and save both problem (Word) file and data (Excel) file.Work on them:
type your answer under each problem in the Word file and make yourExcel output in
the Excel file. SAVE your work! You will submit (post) these twofiles as your
project. Both will be graded.
? Submit (post) both files to the submission link of Project modulein Canvas by
11:55pm, Sunday, Dec. 5, 2021.
? You can post your files multiple times by due time, but the mostrecent posting is not
added to but “replaces” previous posting. Therefore, please makesure to post your
complete version. Grading your files will begin after duedate.
? Point assignment (Details for each output and problem are frompage 2):
Total 40 points from Excel output
Total 60 points from your answers to Q1 ~ Q9.
Grand Total 100 points
+ Extra credit 5 points from Q10.
? Data: project data are stored in Excel “Project Data” file.Download and save it as
yours and run analyses in it. Save your output. Details are fromnext page.
Content of data: “Club Head Speed” and “Driving* Distance” ofselected 70 players
who competed in 2021 THE PLAYERS Championship, Mar. 11 ~ 14,2021.
This tournament is randomly selected out of many internationaltournaments for
male professional golfers.
(*Driving distance: how far does a player hit a golf ball with thelongest golf club
called driver.)1
Project FA21 DSCI 2305
This project requires the understanding of whole course andespecially
more of chapters 4.8, and 13.
Excel Demo videos in Canvas week 13 module show how to runeach
Excel data analysis using practice data. Practice data aredifferent from
project data. Practice and apply the data analysis procedure to theproject.
Instructions on how to interpret Excel output are both on lecturevideos and
Excel demo videos. Please check. Correct application andinterpretation
are the purposes of this project.
Please check followings first:
? Please make sure that your Excel is full-version equipped withData Analysis. If not
sure, check the file “Bring Add-Ins Analysis Toolpack to Excel”from Canvas.
? Download and save both problem (Word) file and data (Excel) file.Work on them:
type your answer under each problem in the Word file and make yourExcel output in
the Excel file. SAVE your work! You will submit (post) these twofiles as your
project. Both will be graded.
? Submit (post) both files to the submission link of Project modulein Canvas by
11:55pm, Sunday, Dec. 5, 2021.
? You can post your files multiple times by due time, but the mostrecent posting is not
added to but “replaces” previous posting. Therefore, please makesure to post your
complete version. Grading your files will begin after duedate.
? Point assignment (Details for each output and problem are frompage 2):
Total 40 points from Excel output
Total 60 points from your answers to Q1 ~ Q9.
Grand Total 100 points
+ Extra credit 5 points from Q10.
? Data: project data are stored in Excel “Project Data” file.Download and save it as
yours and run analyses in it. Save your output. Details are fromnext page.
Content of data: “Club Head Speed” and “Driving* Distance” ofselected 70 players
who competed in 2021 THE PLAYERS Championship, Mar. 11 ~ 14,2021.
This tournament is randomly selected out of many internationaltournaments for
male professional golfers.
(*Driving distance: how far does a player hit a golf ball with thelongest golf club
called driver.)
2
Follow the order of problems which makes the project moreorganized.
The significance level in all analyses of this project is 0.05(i.e. ?=0.05).
Let’s describe each variable in following analysis. We call thisunivariate analysis:
(Excel output: 5 pts x 2 = 10 pts) Open your Excel data file. Thisis the part you work in
“Project Data” file. From Excel demo video, you learned how to runDescriptive Statistics in
Data Analysis. Using Excel, run “Descriptive Statistics” for ‘ClubHead Speed” and “Driving
Distance”. In Descriptive Statistics options, set confidence levelaccordingly with correct %
value. The significance level in all analyses of this project is0.05 (i.e. ?=0.05). As we learned
from chapter 9, the confidence level in % value is 100 (1-?).
(If you are not sure about the procedure of Descriptive Statistics,look up and practice with Excel
demo videos posted in Canvas. From this output, you will find manyfamiliar measures learned
in this course. A few are beyond the scope of this course. Afteryou get output, check the value
for “count’. If you included all of data, count value should be 70which is the sample size or total
observations.)
Use the output to answer Q1 and Q2.
Q1. (10 pts) Review the output of driving distance. What is samplemean? When we estimate
the true mean of driving distance with 95% confidence intervalusing this sample, what is the
margin of error?
Interpret your margin of error in the relation between mean of thissample (data from one
randomly chosen tournament) and the true mean of population (allpossible top player’s
performance).
Q2. (5 pts) With the answer of Q1, calculate 95% confidenceinterval, [low endpoint, upper
endpoint] for true mean of driving distance of top professionalmale golfers. Interpret the
interval.
3
Let’s describe the association between two characteristics. We callthis bivariate analysis.
? (Excel output: 10 pts) Run Excel Scatter Plot of “Club HeadSpeed” vs. “Driving Distance”.
? (Excel output: 5 pts) Run Excel Correlation between “Club HeadSpeed” vs. “Driving
Distance”.
Use the Scatter plot and correlation coefficient answeringQ3.
Q3. (10 pts) Do your output verify the common sense (faster ballflies farther)? Explain the
shape (direction and how tightly clustered) of the scatter plot of“Club Head Speed” vs. “Driving
Distance”. Does correlation coefficient support what scatter plotshows? Explain correlation
coefficient’s sign and magnitude and relate them with shape ofscatter plot.
Let’s check more details of the relation. To predict DrivingDistance based on Club Head
Speed, following linear model is examined using regressionanalysis. (Assumptions about error
term in simple linear regression are checked to besatisfied.)
yˆ=b0+b1x
Q4. (4 pts) We are going to use two variables, “driving distance”and “club head speed” in
regression, to predict “driving distance” based on “club headspeed”. Then, out of these two
variables, which is Y and which is X?
? (Excel output: 15 pts) Using driving distance” and “club headspeed”, run regression in
Excel data analysis. In regression option assignment, assignvariables for X and Y as you
answered in Q4.
Use regression output for answering Q5 through Q10.
Q5. (6 pts) Find the estimated y-intercept and estimated slope.Round your answer to three
decimal places.
? y-intercept:
? slope:
? Using all information above, write down the full equation of theregression model:
4
Q6. (5 pts) According to the estimated linear model, when the clubhead speed increases by
one mile per hour, what happens to the estimated driving distance(in yards)? Use the
estimated value from regression output.
Q7. (5 pts) Let’s say you are going to predict/estimate certainplayer’s driving distance using this
linear model. If the player’s club speed is 110.00 mph, what is thepredicted driving distance?
Round your answer to three decimal places.
Q8. (7 pts) Let’s say there is a man who just started practicingand his club head speed is 83
mph. Would it be reasonable to predict his driving distance usingthe regression model you
figured out above? Why or why not?
Q9. (8 pts) Report the coefficient of determination in threedecimal places from regression
output. What does this value mean? “Interpret” the value ofcoefficient of determination.
Q10. (for extra credit 5 pts) You used club head speed as the “solepredictor” of driving
distance in this “simple” linear regression. What is (are) thepossible limit(s) of this simple linear
regression in predicting Driving Distance? State in LESS than 200words.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started