Note: Solve all the above questions using Python. Use Pandas, Seaborn, Sklearn, etc. libraries for all...
Fantastic news! We've Found the answer you've been seeking!
Question:
Transcribed Image Text:
Note: Solve all the above questions using Python. Use Pandas, Seaborn, Sklearn, etc. libraries for all the following analysis. Consider data given in file HW5DataA ¹. Consider the following data description: Field Gender Table 1: Data Description Description Gender of the student Location Home City of the student Quiz-1 Quiz-2 Quiz-3 Quiz-4 Major-1 Major-2 Major-3 Score of the student in Quiz-1 Score of the student in Quiz-2 Score of the student in Quiz-3 Score of the student in Quiz-4 Score of the student in Major-1 Score of the student in Major-2 Score of the student in Major-3 Final Score of the student in the final exam. Gender Male Female Female Male Female Female Male Female Male Male Male Male Male Female Male Female Female Female Female Female Female Male Male Female Female Male Female Female Female Female Female Female Male Male Male Female Quiz-1 + 17 10 7 22 21 14 21 16 20 15 13 9 10 14 2 16 15 18 8 15 17 18 12 15 20 1 9 13 9 10 20 19 LO 00 Quiz-2 17 21 21 10 15 24 2 13 23 20 13 12 17 22 8 7 16 5 5 9 SE 8 18 14 13 20 13 10 17 11 24 FER Quiz- 24 14 23 21 17 5 21 6 5 11 5 19 14 13 20 14 15 21 15 14 14 5 14 9 10 19 11 22 7 21 23 Quiz-4 16 8 12 18 14 21 25 12 21 11 10 11 10 18 196 20 8 8 19 17 11 9 12 13 0 4 9 6 6 21 18 980 Major- 100 16 62 52 19 28 87 94 46 39 83 61 41 41 79 47 31 34 57 46 Major-2 30 27 62 26 Major-3 78 88 88 95 58 82 64 62 18 46 90 28 42 63 84 33 42 95 62 44 50 74 31 18 32 52 31 29 76 62 71 85 33 36 84 18 85 43 37 81 69 40 46 FERR 52 29 21 82 B-5: Correlation Analysis. Do the following: • Calculate the correlation between all the score columns of HW5DataA. • Identify top 3 variables that are highly correlated with 'Final' score column. • Which pair of score columns are strongly correlated? Note: Solve all the above questions using Python. Use Pandas, Seaborn, Sklearn, etc. libraries for all the following analysis. Consider data given in file HW5DataA ¹. Consider the following data description: Field Gender Table 1: Data Description Description Gender of the student Location Home City of the student Quiz-1 Quiz-2 Quiz-3 Quiz-4 Major-1 Major-2 Major-3 Score of the student in Quiz-1 Score of the student in Quiz-2 Score of the student in Quiz-3 Score of the student in Quiz-4 Score of the student in Major-1 Score of the student in Major-2 Score of the student in Major-3 Final Score of the student in the final exam. Gender Male Female Female Male Female Female Male Female Male Male Male Male Male Female Male Female Female Female Female Female Female Male Male Female Female Male Female Female Female Female Female Female Male Male Male Female Quiz-1 + 17 10 7 22 21 14 21 16 20 15 13 9 10 14 2 16 15 18 8 15 17 18 12 15 20 1 9 13 9 10 20 19 LO 00 Quiz-2 17 21 21 10 15 24 2 13 23 20 13 12 17 22 8 7 16 5 5 9 SE 8 18 14 13 20 13 10 17 11 24 FER Quiz- 24 14 23 21 17 5 21 6 5 11 5 19 14 13 20 14 15 21 15 14 14 5 14 9 10 19 11 22 7 21 23 Quiz-4 16 8 12 18 14 21 25 12 21 11 10 11 10 18 196 20 8 8 19 17 11 9 12 13 0 4 9 6 6 21 18 980 Major- 100 16 62 52 19 28 87 94 46 39 83 61 41 41 79 47 31 34 57 46 Major-2 30 27 62 26 Major-3 78 88 88 95 58 82 64 62 18 46 90 28 42 63 84 33 42 95 62 44 50 74 31 18 32 52 31 29 76 62 71 85 33 36 84 18 85 43 37 81 69 40 46 FERR 52 29 21 82 B-5: Correlation Analysis. Do the following: • Calculate the correlation between all the score columns of HW5DataA. • Identify top 3 variables that are highly correlated with 'Final' score column. • Which pair of score columns are strongly correlated?
Expert Answer:
Answer rating: 100% (QA)
First lets import the necessary libraries and load the data into a Pandas DataFrame import pandas as pd import seaborn as sns import matplotlibpyplot as plt from sklearnlinearmodel import LinearRegres... View the full answer
Related Book For
Applied Regression Analysis and Other Multivariable Methods
ISBN: 978-1285051086
5th edition
Authors: David G. Kleinbaum, Lawrence L. Kupper, Azhar Nizam, Eli S. Rosenberg
Posted Date:
Students also viewed these programming questions
-
Calculate Vo and in the given circuit. Assume the source voltage V= 220 V. V 70 92 EN 20 V The value of Vis The value of lo is www V. mA. +201 www www 30 5
-
The data in Question 28 are converted to z scores here, thus allowing you to compute the standardized beta coefficients for each predictor variable from the data given in the previous question. Using...
-
The data for the visibility chart in Discussion Question are shown in Table. The visibility standard is set at 100. Readings below 100 indicate that air pollution has reduced visibility and readings...
-
Consider an asset allocation problem with one risky asset and one risk-free asset .There are four investors .Each investor maximizes a mean-variance utility function to make their optimal investment...
-
a. Mulroney recalled from her CFA studies that the constant-growth discounted dividend model was one way to arrive at a valuation for a companys common stock. She collected current dividend and stock...
-
Determine the moments at A and B, then draw the moment diagram for the beam. EI is constant. 2400 lb 200 lb/ft 'A +10 ft 30 ft-
-
What types of relevant evidence are excluded based on policy reasons? What are the policy reasons behind excluding such evidence?
-
The following table shows some simple student data as of the date 06/20/2015: The following transactions occur on 06/21/2015: Student 004 changes major from Math to Business. Student 005 is deleted...
-
A coupon bond has a face value of $1,000.with a 4.83% coupon rate. It matures in 7 years and has a yield to maturity of 7.33%. What is the price of the bond?
-
1. The model should list the given financial information for all potential projects. 2. The model should associate with each proposed project a cell that is 1 if the project is approved and 0 if it...
-
The following are the yearly returns for companies A and B: Year Company A Company B 2000 12% 14% 2001 3% 6% 2002 11% 12% 2003 12% 7% 2004 14% 10% 2005 12% 14% 2006 8% 8% 2007 11% 14% 2008 14% 13%...
-
1. Define conductors, semi-conductors and insulators. 2. Explain the laws of electrical charges
-
A steam turbine operates at steady-state with 1.8 MPa and 350C steam at its inlet and wet- steam (vapor-liquid mixture, x = 0.90) at 95C at its exit. The mass flow rate of the steam is 38 kg/s, and...
-
You work for a large investment management firm. The analysts with your firm have made the following forecasts for the returns of stock A and stock B: VERY VERY WEAK VERY WEAK WEAK AVERAGE STRONG...
-
NNH&V Cable TV operates cable TV systems in larger cities in the northern two-thirds of New Hamp- shire and Vermont. The franchise agreement with the cities requires an annual reporting to each city...
-
Where would the following opportunities and threats fall in a PEST Analysis when addressing an FQHC? Opportunities: Grants, new ways to provide new services, integration of new EHR system, expanding...
-
Toluene, C6H5CH3. The molar mass of toluene is 92g/mol and its density is 0.867 g/ml. A solution is prepared by dissolving 156 grams of benzene C6H6 (Molar Mass= 78 g/mol) in 2120 ml of toluene....
-
Government is advised to tax goods whose demand curves are inelastic if the goal is to raise tax revenues. If the goal is to discourage consumption, then it ought to tax goods whose demand curves are...
-
This time including SEX as a predictor (coded SEX = 1 if female, SEX = 0 if male). a. Examine a plot of the studentized or jackknife residuals versus the predicted values. Are any regression...
-
Random samples of 100 persons awaiting trial on felony charges were selected from rural, urban, and suburban court locations in each of two states, one (state 1) in the Northeast and the other (state...
-
Assume that the following ANOVA table came from balanced two-way fixed-effects ANOVA. Show the formula used (with numbers filled in) and the numerical value of each letter in the table. Also,...
-
The unadjusted trial balance of The Rock Industries Ltd. at January 31, 2020, appears below. Adjustment data: a. Accrued service revenue at January \(31, \$ 2,000\) b. Prepaid rent expired during the...
-
During 2020, Schubert Inc. earned revenues of \(\$ 19\) million from the sale of its products. Schubert ended the year with net income of \(\$ 4\) million. Schubert collected cash of \(\$ 20\)...
-
Journalize the adjusting entry needed on December 31, 2020, the end of the current accounting period, for each of the following independent cases affecting Lee Computer Systems Inc. (LCSI). Include...
Study smarter with the SolutionInn App