Answered step by step
Verified Expert Solution
Link Copied!
Question
1 Approved Answer

Cumulative Assignment #1 - Statistical Analysis Data Analysis Project Description The goal of this culminating statistics project is to identify the factors that might have

Cumulative Assignment #1 - Statistical Analysis Data Analysis Project Description The goal of this culminating statistics project is to identify the factors that might have some influence on a chosen topic/variable. You will be collecting and analysing data for three different variables using a secondary source. There will be one dependent variable and two independent variables for which data will be needed. For example, what are the factors that affect climate change? If climate change was your dependent variable, what could you pick for your independent variable? You will choose 2 factors that will create models that you can compare, to see if you can find a high correlation. Your final report and summary presentation will include the following sections. Section 1 Dependent variable: Choose the main variable that you want to measure versus other variables. This will go on the vertical axis of your scatter plot. Provide rationale for why you chose this variable and brainstorm different variables that may impact it. Research the variable on the internet and collect data for at least 10 years (the more the better) or over at least 10 different places (definition: Provinces and territories across Canada, countries in the world, or teams in a sport are some examples.) in the same year. Note: Statistics Canada is our most reliable source for data and one of the best sources to get a wide variety of data. Be sure to cite your source in proper MLA or APA format. Section 2 Independent variables: Choose the independent variables that you want to measure against the dependent variable. You will find 2 different variables for which you can create a line of best fit with to test correlation. This will be your biggest challenge of the assignment. You need to find variables such that they overlap with the data that you chose for your dependent variable. For example: If you found the years 1990 to 2010 for your dependent variable, you will need to find those same years for each of your independent variables. If you had data across the provinces and territories for your independent variable for 2016, you will need data across the provinces and territories for 2016 for each of your independent variables. Note: Be sure to cite your source in proper MLA or APA format. Section 3 One variable measures with graphs and distributions: Describe each of the 3 of your variables (two independent and one dependent) by calculating and making reference to the measures of central tendency (definition: mean, median and modal interval), spread (definition: IQR with Q1 and Q3, range and standard deviation), and creating a histogram and box plots. Describe the distribution (definition: left or right skewed, normal, uniform, bimodal, etc.) of your data as well. For each of the variables, describe the meaning of each of the measures as you state what they are. Section 4 Two variable measures with linear models and correlations: Create 2 scatterplots, one for each of your independent variables. Create the linear model to describe the relationship. Describe the strength of the correlation and the meaning of the slopes and y-intercepts. Section 5 Identify and remove relevant points: Identify one point on each scatter plot that has the greatest impact on the model by using residual plots. Describe the percentile that the point would be if the data was normally distributed, by using the mean, standard deviation and z-Score. Describe the impact of the point by comparing the new linear model to the original. Section 6 Classification of correlation type: For each of the models, classify the correlation type as cause and effect, reverse cause and effect, accidental or common cause. Explain your choice and brainstorm possible common cause factors where applicable. Section 7 Conclusions from linear models: Make conclusions from the two variable analysis taking into consideration the removal of points that are far from the line of best fit. Discuss the overall impact, if any, that the independent variables have on the dependent variable by referring to other sections of the report. Section 8 Areas for improvement and recommendations for further exploration: Identify the areas of improvement in your study. If you had more time and resources, what are you still curious about that you may want to pursue further. Identify the possible biases in your investigation and overall how confident you are with your conclusions. Note: It is perfectly fine to say that the conclusions come with many red flags and should merely be used to begin a discussion on further studies. Considerations Take a moment to consider what a successful report must look like. Choose two of the sections, and comment on how you will know if you have been successful in this section. Refer back to this discussion when selfassessing your report before submitting. Consider the following questions: What counts/matters/is important when writing this section of the report? How will you know if it has been communicated with a high degree of understanding? Search the internet and Statistics Canada for some potential variables that you will use in your investigation. You will first want to find a good dependent variable that you can get a lot of data for. It should be something you are interested in so that you get the most out of this assignment. After you find your dependent variable with as many data points as possible, search for the potential factors that may have a relationship with it. Consider a few steps ahead: What type (definition: cause and effect, reverse cause and effect, common cause, accidental) of correlation would it be, if a strong correlation is determined? Would the correlation be accidental? If so, this may not be a good choice as an independent variable. If there is no correlation, is that an interesting conclusion? If so, then it is likely a good choice for an independent variable. If you are having trouble finding 2 variables, you can think about considering a large general variable like the ones on the statistics for 42 Countries. Summarize your main points from each section to communicate the findings of your study to peers.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image
Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Making Hard Decisions with decision tools

Authors: Robert Clemen, Terence Reilly

3rd edition

538797576, 978-0538797573

More Books

Students explore these related Mathematics questions