Question
Using Rstudio: Analytics is used in many different sports and has become popular with the Money Ball movie. The pgatour2006.csv dataset contains data about 196
Using Rstudio:
Analytics is used in many different sports and has become popular with the Money Ball movie. The pgatour2006.csv dataset contains data about 196 tour players in 2006. The variables in the dataset are:
Players name
PrizeMoney = average prize money per tournament And a set of metrics that evaluate the quality of a players game.
DrivingAccuracy = percent of times a player is able to hit the fairway with his tee shot
GIR = percent of time a player was able to hit the green within two or less than par (Greens in
Regulation)
BirdieConversion = percentage of times a player makes a birdie or better after hitting the green
in regulation
PuttingAverage = putting performance on those holes where the green was hit in regulation.
PuttsPerRound= average number of putts per round (shots played on the green)
You are asked to build a model for PrizeMoney using the remaining predictors, and to evaluate the relative importance of each different aspects of a players game on the average prize money.
For the non golfers in the class, you can refer to this page for an explanation of the terms:
http://en.wikipedia.org/wiki/Glossary_of_golf
a) Create scatterplots to visualize the associations between PrizeMoney and the other five variables. Discuss the patterns displayed by the scatterplot. Do the associations appear to be linear? (you can create scatterplots or a matrix plot) [1 pt R code, 1 pt scatterplots, 2 pts answer = 4 pts]
b) Analyze distribution of PrizeMoney, and discuss if the distribution is symmetric or skewed. [1 pt R code, 1 pt answer = 2 pts]
c) Apply a log transformation to PrizeMoney and compute the new variable ln_Prize=log(PrizeMoney). Analyze distribution of ln_Prize, and discuss if the distribution is symmetric or skewed. [2 pts R code, 1 pt answer = 3 pts]
d) Fit a regression model of ln_Prize using the remaining predictors in your dataset. Apply your knowledge of regression analysis to define a valid model to predict ln_Prize. Hint: use scatterplots and correlation [3 pts R code, 1 pt answer = 4 pts]
If necessary remove not significant variables. Remember to remove one variable at a time
(variable with largest p-value is removed first) and refit the model, until all variables are
significant. [2 pts R code, 1 pt answer = 3 pts]
Analyze residual plots to check if the regression model is valid for your data. [1 pt R code, 1
pt answer = 2 pts]
Analyze if there are any outliers and influential points. If there are points in the dataset that need to be investigated, give one or more reason to support each point chosen. [1 pt R code, 1 pt answer = 2 pts]
pgatour2006_small.csv :-
Name,PrizeMoney,DrivingAccuracy ,GIR,PuttingAverage,BirdieConversion,PuttsPerRound Aaron Baddeley,60661,60.73,58.26,1.745,31.36,27.96 Adam Scott,262045,62,69.12,1.767,30.39,29.28 Alex Aragon,3635,51.12,59.11,1.787,29.89,29.2 Alex Cejka,17516,66.4,67.7,1.777,29.33,29.46 Arjun Atwal,16683,63.24,64.04,1.761,29.32,28.93 Arron Oberholser,107294,62.53,69.27,1.775,29.2,29.56 Bart Bryant,50620,72.76,68.67,1.812,24.95,30.06 Ben Crane,57273,63.51,62.01,1.736,32.28,28.46 Ben Curtis,86782,66.61,65.25,1.798,26.36,29.5 Bernhard Langer,23396,62.41,65.66,1.778,28.94,29.17 Bill Haas,29567,57.71,64.24,1.786,29.39,29.04 Billy Andrade,44080,62.84,66.44,1.788,27.7,29.31 Billy Mayfair,47172,70.14,66.55,1.796,26.65,29.36 Bo Van Pelt,49640,60.72,66.45,1.772,31.09,29.24 Bob Estes,53610,64.01,66.73,1.777,28.07,29.16 Bob May,26129,61.9,69.09,1.785,27.86,29.47 Bob Tway,11989,61.16,64.17,1.771,27.68,29.08 Brad Faxon,20911,57.56,58.05,1.771,28.13,28.5 Brandt Jobe,28658,61.12,67.23,1.79,28.02,29.59 Brent Geiberger,19683,67.04,64.95,1.778,28.04,28.96 Brett Quigley,79316,58.85,67.83,1.779,29.04,29.19 Brett Wetterich,120927,61.46,67.82,1.766,32.23,29.3 Brian Bateman,24814,59.78,64.53,1.808,25.53,29.29 Brian Davis,27224,70.06,65.37,1.79,28.34,29.21 Brian Gay,33471,68.64,63.19,1.727,31.05,28.05 Briny Baird,33782,71.39,70.2,1.777,28.87,29.64 Bubba Dickerson,20322,62.81,65.63,1.777,29.89,29.32 Bubba Watson,37751,51.5,66.74,1.797,30.66,29.56 Camilo Villegas,60073,58.06,65.29,1.78,30.55,29.29 Carl Pettersson,94571,61.01,62.31,1.776,29.62,28.84 Carlos Franco,15668,55.71,65.18,1.814,27.98,29.9 Chad Campbell,112443,59.77,67.43,1.795,28.02,29.51 Charles Howell III,51770,56.39,65.89,1.823,26.23,29.75 Charles Warren,37735,64.09,67.96,1.797,27.51,29.41 Charley Hoffman,38455,58.01,66.61,1.753,31.94,28.94 Chris Couch,50249,60.98,66.15,1.793,30.53,29.83 Chris DiMarco,59151,64.34,66.45,1.772,29.22,29.22 Chris Riley,18345,70.55,62.47,1.755,30.03,28.61 Chris Smith,8734,66.02,69.32,1.806,27.01,30.18 Corey Pavin,56873,67.66,63.57,1.815,23.43,28.91 Craig Barlow,45752,58.04,65.75,1.769,30.54,29.41 D.A. Points,14499,62.37,62.92,1.773,30.35,28.95 D.J. Trahan,31371,64.04,64.01,1.78,29.03,29.26 Daisuke Maruyama,38275,71.75,66.24,1.783,28.5,29.23 Daniel Chopra,46377,56.77,63.5,1.712,34.75,28.21 Danny Ellis,16630,60.58,62.59,1.764,28.93,28.84 Darron Stiles,10504,68.23,69.67,1.789,27.17,29.67 David Duval,13262,51.09,60.06,1.767,30.91,28.87 David Howell,65174,62.38,63.21,1.73,32.88,28.89 David McKenzie,15187,67.41,65.83,1.777,28.69,29.19 David Toms,132327,69.18,66.18,1.759,31.43,29.01 Davis Love III,119444,59.89,66.59,1.751,31.53,29 Dean Wilson,73819,63.47,65.27,1.765,29.26,28.83 Doug Barron,13865,67.49,60.43,1.781,28.59,28.58 Dudley Hart,26301,58.13,63.79,1.783,28.99,28.92 Duffy Waldorf,22340,67.92,66.24,1.803,28.35,29.47 Eric Axley,43951,63.68,65.73,1.748,31.14,28.8 Ernie Els,129234,57.62,63.72,1.759,30.51,28.72 Frank Lickliter II,57092,66.62,67.98,1.763,29.87,29.04 Fred Couples,45904,54.36,65.33,1.816,27.01,30.02 Fred Funk,54477,78.01,66.35,1.785,27.1,29.33 Fredrik Jacobson,43820,62.45,63.99,1.76,29.61,28.79 Geoff Ogilvy,217748,61.89,62.94,1.769,31.44,28.93 Greg Chalmers,5402,53.9,59.04,1.764,29.06,28.63 Greg Kraft,10528,70.01,62.74,1.77,27.22,28.95 Greg Owen,54862,67.13,68.54,1.812,28.87,29.64 Harrison Frazar,30656,60.94,64.04,1.748,32.7,28.55 Heath Slocum,39356,74.67,67.8,1.792,27.84,29.58 Henrik Bjornstad,15840,62.55,64.39,1.8,27.3,29.32 Hidemichi Tanaka,2240,58.88,57.91,1.824,23.92,29.26 Hunter Mahan,38188,68.31,67.4,1.777,31.06,29.65 Ian Leggatt,13031,61.1,63.23,1.793,27.93,29.28 Ian Poulter,103594,70.19,66.03,1.772,28.8,29.06 J.B. Holmes,57216,54.13,65.3,1.802,31.56,29.75 J.J. Henry,82196,60.03,68.16,1.788,28.12,29.69 J.L. Lewis,25804,64.2,62.67,1.802,26.37,29.23 J.P. Hayes,36918,66.88,64.24,1.753,32.6,28.84 James Driscoll,7583,50.54,61.32,1.817,28.83,29.39 Jason Bohn,57824,66.78,65.87,1.797,27.89,29.18 Jason Gore,24724,59.95,63.03,1.796,29.97,29.5 Jason Schultz,5265,59.43,63.19,1.808,27.73,29.45 Jeff Brehaut,16927,66.17,67.3,1.787,29.35,29.67 Jeff Gove,27361,65.65,72.03,1.813,25.8,30.11 Jeff Maggert,55014,71.86,65.36,1.789,27.5,29.33 Jeff Overton,20612,58.28,65.29,1.793,28.41,29.18 Jeff Sluman,43173,64.81,65.24,1.763,29.54,29.05 Jerry Kelly,56058,70.26,67.11,1.781,28.46,29.17 Jerry Smith,19594,69.66,66.55,1.78,28.79,29.35 Jesper Parnevik,54513,61.37,64.42,1.745,30.9,28.58 Jim Furyk,300555,73.85,70.71,1.742,30.47,28.85 Jimmy Walker,7331,49.75,61.97,1.764,28.84,29.03 Joe Durant,100398,78.43,69.75,1.785,28.67,29.81 Joe Ogilvie,37004,64.07,65.08,1.757,30.99,28.7 Joey Sindelar,27673,65.58,68.64,1.799,27.09,30 John Cook,29296,70.65,66.33,1.789,27.07,29.31 John Daly,9149,52.48,60.88,1.79,31.47,29.37 "John Engler, Jr.",2692,58.07,59.78,1.809,25.28,29.36 John Huston,15964,58.74,62.58,1.777,30.24,28.92 John Rollins,53530,64.22,66.12,1.755,30.73,29.15 John Senden,58953,64.95,71.15,1.793,27.23,29.84 Jon Mills,2426,57.05,62.71,1.835,24.36,29.65 Jonathan Byrd,70421,64.15,69.61,1.75,33.09,29.06 Jonathan Kaye,18085,65.37,68.42,1.828,25.8,29.99 Jose Maria Olazabal,117801,59.68,65.52,1.77,28.89,29.1 Justin Leonard,30068,67.24,63.36,1.779,26.34,28.87 Justin Rose,58189,64,67.89,1.759,31.72,29 K.J. Choi,91406,65.03,68.12,1.792,27.53,29.51 Kenny Perry,37214,68.45,68.75,1.813,26.46,30.19 Kent Jones,26899,67.95,65.46,1.762,29.54,29.2 Kevin Sutherland,25918,64.52,68.29,1.796,27.15,29.5 Kirk Triplett,42589,72.76,65.28,1.757,29.05,28.79 Kris Cox,18494,59.31,66.39,1.783,28.66,29.3 Larry Mize,12110,74.33,64.16,1.803,23.17,29.18 Lee Janzen,18721,56.39,63.08,1.765,30.58,29.23 Len Mattiace,3025,56.71,58.85,1.773,29.21,28.75 Lucas Glover,83483,63.69,67.33,1.763,32.02,29.15 Luke Donald,176523,66.86,66.05,1.752,30.19,28.54 Marco Dawson,20188,61.17,64.94,1.752,30.74,28.67 Mark Brooks,5777,69.35,62.92,1.799,23.86,29.27 Mark Calcavecchia,26123,65.3,62.07,1.792,30.67,29.37 Mark O'Meara,11315,57.35,59.44,1.778,27.66,28.7 Mark Wilson,18513,67.23,67.98,1.758,29.85,29.28 Mathew Goggin,41390,63.5,67.14,1.824,27.78,30.16 Mathias Gronberg,22467,62.51,66.54,1.8,29.17,29.39 Matt Hansen,7490,59.85,66.92,1.839,27.36,30.15 Michael Allen,18838,58.01,62.7,1.746,31.86,28.48 Michael Connell,4444,60.99,64.12,1.819,26.4,30 Mike Sposa,5285,65.71,63.16,1.823,26.93,29.37 Mike Weir,78489,64.1,66.47,1.752,30.73,29.06 Nathan Green,56693,63.1,63.03,1.757,31.32,28.56 Nicholas Thompson,8272,67.44,65.35,1.829,27.17,30.12 Nick Watney,42890,61.78,65.38,1.788,31.72,29.08 Olin Browne,25135,73.56,64.32,1.764,29.06,28.86 Omar Uresti,26532,74.3,66.38,1.807,25.89,29.34 Padraig Harrington,89312,66.14,65.22,1.758,31.18,29.04 Pat Perez,37869,59.31,64.67,1.771,30.05,29.04 Patrick Sheehan,11376,64.99,63.89,1.788,26.92,28.99 Paul Azinger,23403,62.65,62.39,1.769,28,28.38 Paul Goydos,37100,72.9,65.28,1.793,26.98,29.17 Paul Stankowski,14527,61.23,65.87,1.811,26.9,29.71 Peter Lonard,38046,64.19,63.16,1.824,25.35,29.51 Phil Mickelson,224027,58.61,68.28,1.731,35.66,28.91 Retief Goosen,145414,57.28,65.46,1.771,30.26,28.98 Rich Beem,24379,63.24,66.2,1.787,29.25,29.6 Richard S. Johnson,53634,67.99,66.02,1.764,28.43,28.94 Robert Allenby,68345,69.33,67.89,1.746,31.09,29.1 Robert Damron,14558,62.93,63.55,1.781,27.87,28.93 Robert Gamez,16455,69.51,64.83,1.788,27.67,29.26 Robert Garrigus,19200,57.25,64.66,1.792,31.43,29.28 Rod Pampling,111028,60.46,62.5,1.745,30.65,28.44 Roger Tambellini,4667,57.53,65.39,1.833,26.43,30.18 Ron Whittaker,10715,68.57,66.03,1.792,26.2,29.63 Rory Sabbatini,119240,56.49,64.52,1.781,30.18,28.92 Ryan Hietala,3816,51.03,60.08,1.809,26.96,29.36 Ryan Moore,51005,66.96,63.25,1.773,30.27,28.69 Ryan Palmer,36428,62.03,64.2,1.764,30.93,28.8 Ryuji Imada,32843,58.62,64.81,1.764,28.04,28.87 Scott Gutschewski,19973,64.98,66.67,1.77,29.06,29.53 Scott Verplank,69173,75.23,66.02,1.755,29.9,28.72 Sean O'Hair,47046,63.7,64.74,1.8,29.74,29.56 Sergio Garcia,91808,61.11,67.47,1.802,30.43,29.78 Shane Bertsch,20502,67.35,63.89,1.751,30.28,28.71 Shaun Micheel,56305,61.79,67.62,1.777,28.05,29.53 Shigeki Maruyama,38471,61.37,63.59,1.749,30.86,28.58 Skip Kendall,19997,72.08,67.21,1.77,27.24,29.15 Stephen Ames,114055,62.98,66.49,1.752,30.41,29.05 Stephen Leaney,27657,65.81,67.13,1.755,27.88,28.95 Steve Elkington,15012,73.91,69.61,1.817,24.96,29.88 Steve Flesch,42958,65.59,65.99,1.771,28.43,28.88 Steve Jones,11421,64.51,62.8,1.809,26.24,29.33 Steve Lowery,36289,62.52,65.46,1.782,28.33,29.14 Steve Stricker,106577,66.98,68.01,1.734,30.56,28.26 Stewart Cink,105997,59.72,64.9,1.756,30.99,28.75 Stuart Appleby,150889,59.62,63.53,1.743,31.53,28.63 Tag Ridings,15098,54.1,61.22,1.766,31.72,28.79 Ted Purdy,36861,65.86,67.05,1.796,28.62,29.66 Thomas Levet,9062,64.01,65.9,1.851,23.71,30.1 Tiger Woods,662771,60.71,74.15,1.756,35.26,29.38 Tim Clark,89770,67.27,65.96,1.763,30.66,29.03 Tim Herron,65783,58.28,62.62,1.802,27.17,29.44 Tim Petrovic,20064,63.54,61.29,1.799,27.35,29.01 Tjaart van der Walt,11187,69.69,64.52,1.808,25.45,29.42 Todd Fischer,11309,64.83,62.23,1.744,29.27,28.59 Todd Hamilton,6117,56.32,56.87,1.792,27.38,29.01 Tom Lehman,84604,60.96,65.93,1.827,24.37,30.08 "Tom Pernice, Jr.",72623,65.28,64.53,1.756,31.7,28.88 Tommy Armour III,14098,66.74,63.76,1.752,29.97,29.08 Trevor Immelman,160175,62.08,69.06,1.771,30.06,29.26 Troy Matteson,55581,59.45,66.34,1.803,28.12,29.52 Vance Veazey,10354,57.58,61.58,1.752,30.02,28.93 Vaughn Taylor,68613,63.31,64.08,1.725,34.26,28.67 Vijay Singh,170460,59.4,67.83,1.753,31.77,28.89 "Wes Short, Jr.",12803,57.78,64.18,1.782,30.56,29.7 Will MacKenzie,30344,62.91,67.99,1.809,28.65,29.97 Woody Austin,38043,61.11,62.67,1.773,30.49,28.72 Zach Johnson,90824,69.63,66.86,1.774,26.94,29.32
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started