Answered step by step
Verified Expert Solution
Link Copied!
Question
1 Approved Answer

Hello, please see attach and due dates. Please let me know if you're able to do this. Part A: Exploratory Data Analysis Preparation Open the

image text in transcribed

Hello, please see attach and due dates. Please let me know if you're able to do this.

image text in transcribed Part A: Exploratory Data Analysis Preparation Open the files for the Course Project and the data set. For each of the five variables, process, organize, present and summarize the data. Analyze each variable by itself using graphical and numerical techniques of summarization. Use Excel as much as possible, explaining what the results reveal. Some of the following graphs may be helpful: stem-leaf diagram, frequency/relative frequency table, histogram, boxplot, dotplot, pie chart, bar graph. Caution: not all of these are appropriate for each of these variables, nor are they all necessary. More is not necessarily better. In addition be sure to find the appropriate measures of central tendency, the measures of dispersion, and the shapes of the distributions (for the quantitative variables) for the above data. Where appropriate, use the five number summary (the Min, Q1, Median, Q3, Max). Once again, use Excel as appropriate, and explain what the results mean. Analyze the connections or relationships between the variables. There are ten (10) possible pairings of two (2) variables. Use graphical as well as numerical summary measures. Explain the results of the analysis. Be sure to consider all 10 pairings. Some variables show clear relationships, whereas others do not. Report Requirements From the variable analysis above, provide the analysis and interpretation for three individual variables. This would include no more than one graph for each, one or two measures of central tendency and variability (as appropriate), the shapes of the distributions for quantitative variables, and two or three sentences of interpretation. For the 10 pairings, identify and report only on three of the pairings, again using graphical and numerical summary (as appropriate), with interpretations. Please note that at least one pairing must include a qualitative variable, and at least one pairing must not include a qualitative variable. Prepare the report in Microsoft Word, integrating graphs and tables with text explanations and interpretations. Be sure to include graphical and numerical back up for the explanations and interpretations. Be selective in what is included in the report to meet the requirements of the report without extraneous information. All DeVry University policies are in effect, including the plagiarism policy. Project Part A report is due by the end of Week 2. Project Part A is worth 100 total points. See the grading rubric below. Submission: The report, including all relevant graphs and numerical analysis along with interpretations Format for report: A.Brief Introduction B.Discuss the first individual variable, using graphical, numerical summary and interpretation. C.Discuss the second individual variable, using graphical, numerical summary and interpretation. D.Discuss the third individual variable, using graphical, numerical summary and interpretation. E.Discuss the first pairing of variables, using graphical, numerical summary and interpretation. F.Discuss the second pairing of variables, using graphical, numerical summary and interpretation. G.Discuss the third pairing of variables, using graphical, numerical summary and interpretation. H.Conclusion Part A: Grading Rubric Category Description Three individual variables12 points each 36 36 Graphical analysis, numerical analysis (when appropriate), and interpretation Three relationships15 points each 45 45 Graphical analysis, numerical analysis (when appropriate), and interpretation Communication skills 19 19 Writing, grammar, clarity, logic, cohesiveness, adherence to the above format Total 100 100 A quality paper will meet or exceed all of the above requirements. Part B: Hypothesis Testing and Confidence Intervals The data file includes four hypotheses labeled a. - d. a. Mean sales per week exceeds 41.5 per salesperson b. Proportion receiving online training is less than 55% c. Mean calls made among those with no training is less than 145 d. Mean time per call is greater than 15 minutes 1.Using the same data set from Part A, perform the hypothesis test for each speculation in order to see if there is evidence to support the manager's belief. Use the Seven Elements of a Test of Hypothesis from Section 7.1 of your textbook, as well as the p-value calculation from Section 7.3, and explain your conclusion in simple terms. 2.Compute confidence intervals (the required confidence level is included with the speculations) for each of the variables described in A-D, and interpret these intervals. 3.Write a report about the results, distilling down the results in a way that would be understandable to someone who does not know statistics. Clear explanations and interpretations are critical. 4.All DeVry University policies are in effect, including the plagiarism policy. 5.Project Part B report is due by the end of Week 6. 6.Project Part B is worth 100 total points. See grading rubric below. Format for report: Summary Report (about one paragraph on each of the speculations a. - d.) Appendix with the calculations of the Seven Elements of a Test of Hypothesis, the p-values, and the confidence intervalsinclude the Excel formulas used in the calculations. Part B: Grading Rubric Addressing each speculation20 points each 80 80 Hypothesis test, interpretation, confidence interval, and interpretation Summary report clarity 20 20 One paragraph on each of the speculations Total 100 100 A quality paper will meet or exceed all of the above requirements. Part C: Regression and Correlation Analysis Use the dependent variable (labeled Y) and the independent variables (labeled X1, X2, and X3) in the data file. Use Excel to perform the regression and correlation analysis to answer the following. 1.Generate a scatterplot for the specified dependent variable (Y) and the X1 independent variable, including the graph of the "best fit" line. Interpret. 2.Determine the equation of the "best fit" line, which describes the relationship between the dependent variable and the selected independent variable. 3.Determine the coefficient of correlation. Interpret. 4.Determine the coefficient of determination. Interpret. 5.Test the utility of this regression model. Interpret results, including the p-value. 6.Based on the findings in Steps 1-5, analyze the ability of the independent variable to predict the designated dependent variable. 7.Compute the confidence interval for 1 (the population slope) using a 95% confidence level. Interpret this interval. 8.Using an interval, estimate the average for the dependent variable for a selected value of the independent variable. Interpret this interval. 9.Using an interval, predict the particular value of the dependent variable for a selected value of the independent variable. Interpret this interval. 10.What can be said about the value of the dependent variable for values of the independent variable that are outside the range of the sample values? Explain. In an attempt to improve the model, use a multiple regression model to predict the dependent variable, Y, based on all of the independent variables, X1, X2, and X3. 11.Using Excel, run the multiple regression analysis using the designated dependent and three independent variables. State the equation for this multiple regression model. 12.Perform the Global Test for Utility (F-Test). Explain the conclusion. 13.Perform the t-test on each independent variable. Explain the conclusions and clearly state how the analysis should proceed. In particular, which independent variables should be kept and which should be discarded. If any independent variables are to be discarded, re-run the multiple regression, including only the significant independent variables, and summarize results with discussion of analysis. 14.Is this multiple regression model better than the linear model generated in parts 1-10? Explain. 15.All DeVry University policies are in effect, including the plagiarism policy. 16.Part C report is due by the end of Week 7. 17.Part C is worth 100 total points. See grading rubric below. Summarize your results from Steps 1-14 in a three-page report. The report should explain and interpret the results in ways that are understandable to someone who does not know statistics. Submission: The summary report and all of the work done in 1-14 (Excel output and interpretations) as an appendix Format for report: A.Summary Report B.Points 1-14 should be addressed with appropriate output, graphs, and interpretations. Be sure to number each point 1-14. Part C: Grading Rubric Description Steps 1-12 and step 14, worth 5 points each 65 65 Addressed with appropriate output, graphs, and interpretations Step 13 15 15 Addressed with appropriate output, graphs, and interpretations Communication skills 20 Writing, grammar, clarity, logic, and cohesiveness Total 100 100 A quality paper will meet or exceed all of the above requirements. Sales (Y) Calls (X1) Time (X2) Years (X3) 51 167 14.9 5 34 133 17.9 4 49 161 19 3 45 185 15.7 1 47 176 16.6 2 47 183 15.1 2 38 122 22.8 3 44 171 16 3 47 157 16.9 1 37 148 18.5 3 51 177 13.5 4 40 144 20.5 0 48 136 15.7 2 52 197 16.5 2 46 145 19.8 0 42 167 20.9 3 37 120 14.2 2 42 148 19.9 1 43 131 21.8 1 49 184 19.7 2 44 150 21.7 1 43 148 18.8 1 55 189 14.2 1 37 152 23.4 0 44 148 15.9 3 43 169 15.7 4 49 188 24.1 1 45 164 19.7 3 45 146 14.2 3 43 173 23.4 2 47 164 18.1 0 48 177 16.4 3 49 160 16 3 51 190 13.3 1 42 135 19 0 37 137 21.4 1 51 167 19.1 1 44 169 10.5 0 46 149 21 3 42 153 18.3 2 45 140 13 3 37 133 23.4 2 52 173 21.9 0 39 156 15.7 4 45 130 24.3 3 37 130 18.4 1 40 125 14.4 4 44 182 18.3 4 48 165 23.4 5 Type ONLINE GROUP NONE ONLINE ONLINE ONLINE GROUP GROUP GROUP GROUP NONE NONE ONLINE ONLINE ONLINE ONLINE NONE NONE NONE ONLINE NONE ONLINE ONLINE GROUP ONLINE NONE ONLINE NONE GROUP ONLINE ONLINE ONLINE GROUP ONLINE NONE ONLINE ONLINE ONLINE NONE ONLINE GROUP ONLINE ONLINE NONE GROUP ONLINE NONE NONE ONLINE The following is a SUMMARY of th See Course home > Course project f 5 variables SALES represents the number sales CALLS represents the number of sale TIME represents the average time pe YEARS represents years of experienc TYPE represents the type of training Week2/PartA paper - Descriptive s Analyze/interpret 3 individual varia Report your findings for 3 pairs of v 1 pair must include a qualitative Week6/PartB paper - Confidence in 4 speculations (use alpha = .10) a. Mean sales per week exceeds 4 b. Proportion receiving online train c. Mean calls made among those d. Mean time per call is greater th Report your conclusions on each of AND the p-value to explain your fi Compute then explain/interpret con Week7/PartC paper - Regression a Calculate then explain/interpret the fo and the independent variables (labe 1. Generate a scatterplot for Y vs X1, i 2. Determine the equation of the "bes 3. Determine the coefficient of correla 4. Determine the coefficient of determ 5. Test the utility of this regression mo 6. Based on items 1-5, analyze the ab 7. Compute a 95% CI for 1 (the popu 8. Estimate the average for the depen 9. Predict the value of the dependent 10. What can be said about the value range of the sample values? Exp Build a model to predict the dependen 11. Prepare a multiple regression mod Explain the equation for this mul 12. Perform the Global Test for Utility ( 13. Perform the t-test on each indepen recommendation on which indepe If any independent variables are 42 53 37 46 43 45 42 48 39 46 46 45 44 49 41 48 46 48 47 54 45 58 42 50 49 51 57 59 53 49 46 48 45 58 52 48 43 55 44 59 46 41 42 46 45 43 41 49 44 49 154 178 142 153 166 138 167 171 149 151 162 158 188 149 157 156 172 174 188 180 173 174 138 145 149 152 167 164 165 129 148 135 140 172 183 138 135 174 128 187 145 118 150 138 167 143 143 152 169 166 17.5 15.6 21.8 16.6 20.8 22.3 21.2 15.3 22.2 18.9 19.1 16.4 15.2 24.9 13.6 17.8 14.8 21.9 19.2 13.9 20.8 17.9 19.1 22.3 21.4 14.3 17.1 12.7 16.2 17.9 22.8 21.9 12.7 12.4 15.9 17.1 20.2 18.4 20.9 13.9 16 18.3 12.5 15.5 16.8 17.9 17.3 26.3 16 19.1 2 2 1 1 3 2 2 2 1 1 2 1 3 2 3 4 1 2 1 4 2 1 2 3 3 2 2 3 2 3 3 3 2 2 3 5 3 3 4 2 4 2 3 1 2 3 1 0 1 0 ONLINE ONLINE NONE ONLINE ONLINE NONE NONE ONLINE GROUP GROUP ONLINE ONLINE GROUP ONLINE ONLINE ONLINE ONLINE GROUP ONLINE ONLINE ONLINE ONLINE GROUP GROUP ONLINE GROUP ONLINE GROUP ONLINE GROUP ONLINE GROUP GROUP ONLINE ONLINE ONLINE GROUP ONLINE GROUP ONLINE GROUP GROUP GROUP GROUP GROUP GROUP GROUP ONLINE ONLINE ONLINE the significant independent varia 14. Is this multiple regression model b The has been a SUMMARY of the r See Course home > Course project f 37 145 21.2 3 NONE e following is a SUMMARY of the requirements for each project See Course home > Course project for detailed requirements that apply to each paper SALES represents the number sales made this week. CALLS represents the number of sales calls made this week. TIME represents the average time per call this week. YEARS represents years of experience in the call center. TYPE represents the type of training the employee received. eek2/PartA paper - Descriptive statistics and graphs Due November 6 Analyze/interpret 3 individual variables using graphical and numerical analysis Report your findings for 3 pairs of variables using graphical and numerical analysis 1 pair must include a qualitative variable and 1 pair must NOT include a qualitative variable eek6/PartB paper - Confidence intervals and hypothesis testing Due Deceber 6 4 speculations (use alpha = .10) a. Mean sales per week exceeds 41.5 per salesperson b. Proportion receiving online training is less than 55% c. Mean calls made among those with no training is less than 145 d. Mean time per call is greater than 15 minutes Report your conclusions on each of the 4 speculations. Use the seven elements of a hypothesis test AND the p-value to explain your findings in simple terms Compute then explain/interpret confidence intervals for each of the 4 variables listed above eek7/PartC paper - Regression analysis Due December 7 lculate then explain/interpret the following 14 tasks/tests using the dependent variable (labeled Y) and the independent variables (labeled X1, X2, and X3) Generate a scatterplot for Y vs X1, including the graph of the "best fit" line. Interpret. Determine the equation of the "best fit" line Determine the coefficient of correlation. Interpret. Determine the coefficient of determination. Interpret. Test the utility of this regression model. Interpret results, including the p-value. Based on items 1-5, analyze the ability of X1 to predict Y Compute a 95% CI for 1 (the population slope). Interpret this interval. Estimate the average for the dependent variable when X1 = 170 using an interval. Interpret. Predict the value of the dependent variable when X1=170 using an interval. Interpret. . What can be said about the value of the dependent variable for values of X1 outside the range of the sample values? Explain. ild a model to predict the dependent variable/Y using all of the independent variables/ X1, X2, and X3 . Prepare a multiple regression model using the designated dependent and 3 independent variables. Explain the equation for this multiple regression model in simple terms. . Perform the Global Test for Utility (F-Test). Explain the conclusion. . Perform the t-test on each independent variable. Explain the conclusions including your recommendation on which independent variables should be kept and which should be discarded. If any independent variables are to be discarded, re-run the multiple regression, including only the significant independent variables, and summarize your finding on this final model. . Is this multiple regression model better than the linear model generated in parts 1-10? Explain. e has been a SUMMARY of the requirements for each project See Course home > Course project for detailed requirements that apply to each paper

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image
Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

International Financial Reporting A Practical Guide

Authors: Alan Melville

6th edition

1292200743, 1292200766, 9781292200767, 978-1292200743

More Books

Students explore these related Finance questions