Question: I need to see the SPSS output. You need to have all z-scores, all charts, all descriptives data from SPSS, everything you used to answer

 I need to see the SPSS output. You need to have

I need to see the SPSS output. You need to have all z-scores, all charts, all descriptives data from SPSS, everything you used to answer the questions. I am sending you what the previous tutor sent me and it was of no help after I was charged.

all z-scores, all charts, all descriptives data from SPSS, everything you used

Assignment Module 2 Solutions 1. Use the procedure: Analyze, descriptive statistics, descriptive, save standardized values. You observe that the beverages with positive standardized scores on every variable are: UA UH UR SA. This means that these variables have a mean greater than the overall mean. 2. The most extreme value on each variable Variable Z-score Price 3.7052412703410544 Cost 3.706355433894953 Calories -2.9358060795038656 Sodium 2.0085931521403975 Alcohol -3.7765077890763545 The most extreme Z-score for all the variables is (-) 3.7765077890763545 3. Alcohol 4. 68% of the data lies within 1 standard deviation of the mean if the data is normally distributed. The percentage of beverages with an absolute z-score above 1 is close to 68% meaning the data are normal. Assignment Module 2 Problem Set Due Date: Apr 13, 2016 23:59:59 Max Points: 85 Details: Understanding the data sampling procedure and the description of the data is critical to accurately interpreting the results of a study. In this assignment, you will practice describing data from an SPSS data set. General Requirements: Use the following information to ensure successful completion of the assignment: Review "SPSS Access Instructions" for information on how to access SPSS for this assignment. Access the document, "Introduction to Statistical Analysis Using IBM SPSS Statistics, Student Guide" to complete the assignment. Download the file "Drinks.sav" and open it with SPSS. Use the data to complete the assignment. Download the file "Census.sav" and open it with SPSS. Use the data to complete the assignment. Directions: Locate the data set "Drinks.sav" and open it with SPSS. Follow the steps in section 2.11 Learning Activity as written. Answer all of the questions in the activity based on your observations of the SPSS output. Type your answers into a Word document for submission to the instructor. Locate the data set "Census.sav" and open it with SPSS. Follow the steps in section 3.9 Learning Activity as written. Answer all of the questions in the activity based on your observations of the SPSS output. Type your answers into a Word document for submission to the instructor. Locate the data set "Drinks.sav" and open it with SPSS. Follow the steps in section 4.18 Learning Activity as written. Answer all of the questions in the activity based on your observations of the SPSS output. Type your answers into a Word document for submission to the instructor. Introduction to Statistical Analysis Using IBM SPSS Statistics Student Guide Course Code: 0G517 ERC 1.0 Introduction to Statistical Analysis Using IBM SPSS Statistics Licensed Materials - Property of IBM Copyright IBM Corp. 2010 0G517 Published October 2010 US Government Users Restricted Rights - Use, duplication or disclosure restricted by GSA ADP Schedule Contract with IBM Corp. IBM, the IBM logo and ibm.com are trademarks of International Business Machines Corp., registered in many jurisdictions worldwide. SPSS, SamplePower, and PASW are trademarks of SPSS Inc., an IBM Company, registered in many jurisdictions worldwide. Other product and service names might be trademarks of IBM or other companies. This guide contains proprietary information which is protected by copyright. No part of this document may be photocopied, reproduced, or translated into another language without a legal license agreement from IBM Corporation. Any references in this information to non-IBM Web sites are provided for convenience only and do not in any manner serve as an endorsement of those Web sites. The materials at those Web sites are not part of the materials for this IBM product and use of those Web sites is at your own risk. TABLE OF CONTENTS Table of Contents LESSON 0: COURSE INTRODUCTION .................................................. 0-1 0.1 INTRODUCTION ................................................................................................................... 0-1 0.2 COURSE OBJECTIVES .......................................................................................................... 0-1 0.3 ABOUT SPSS ...................................................................................................................... 0-1 0.4 SUPPORTING MATERIALS ................................................................................................... 0-2 0.5 COURSE ASSUMPTIONS ...................................................................................................... 0-2 LESSON 1: INTRODUCTION TO STATISTICAL ANALYSIS ............. 1-1 1.1 OBJECTIVES ........................................................................................................................ 1-1 1.2 INTRODUCTION ................................................................................................................... 1-1 1.3 BASIC STEPS OF THE RESEARCH PROCESS ......................................................................... 1-1 1.4 POPULATIONS AND SAMPLES ............................................................................................. 1-3 1.5 RESEARCH DESIGN ............................................................................................................. 1-3 1.6 INDEPENDENT AND DEPENDENT VARIABLES ..................................................................... 1-4 1.7 NOTE ABOUT DEFAULT STARTUP FOLDER AND VARIABLE DISPLAY IN DIALOG BOXES .. 1-4 1.8 LESSON SUMMARY ............................................................................................................. 1-5 1.9 LEARNING ACTIVITY .......................................................................................................... 1-6 LESSON 2: UNDERSTANDING DATA DISTRIBUTIONS - THEORY2-1 2.1 OBJECTIVES ........................................................................................................................ 2-1 INTRODUCTION ......................................................................................................................... 2-1 2.2 LEVELS OF MEASUREMENT AND STATISTICAL METHODS ................................................. 2-1 2.3 MEASURES OF CENTRAL TENDENCY AND DISPERSION ..................................................... 2-5 2.4 NORMAL DISTRIBUTIONS ................................................................................................... 2-7 2.5 STANDARDIZED (Z-) SCORES ............................................................................................. 2-8 2.6 REQUESTING STANDARDIZED (Z-) SCORES...................................................................... 2-10 2.7 STANDARDIZED (Z-) SCORES OUTPUT ............................................................................. 2-10 2.8 PROCEDURE: DESCRIPTIVES FOR STANDARDIZED (Z-) SCORES ...................................... 2-10 2.9 DEMONSTRATION: DESCRIPTIVES FOR Z-SCORES............................................................ 2-11 2.10 LESSON SUMMARY ......................................................................................................... 2-12 2.11 LEARNING ACTIVITY ...................................................................................................... 2-13 LESSON 3: DATA DISTRIBUTIONS FOR CATEGORICAL VARIABLES ............................................................................................ 3-1 3.1 OBJECTIVES ........................................................................................................................ 3-1 3.2 INTRODUCTION ................................................................................................................... 3-1 3.3 USING FREQUENCIES TO SUMMARIZE NOMINAL AND ORDINAL VARIABLES .................... 3-2 3.4 REQUESTING FREQUENCIES ............................................................................................... 3-3 3.5 FREQUENCIES OUTPUT ....................................................................................................... 3-3 3.6 PROCEDURE: FREQUENCIES ............................................................................................... 3-4 3.7 DEMONSTRATION: FREQUENCIES ....................................................................................... 3-6 3.8 LESSON SUMMARY ........................................................................................................... 3-10 3.9 LEARNING ACTIVITY ........................................................................................................ 3-10 i INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS LESSON 4: DATA DISTRIBUTIONS FOR SCALE VARIABLES ......... 4-1 4.1 OBJECTIVES ........................................................................................................................4-1 4.2 INTRODUCTION ...................................................................................................................4-1 4.3 SUMMARIZING SCALE VARIABLES USING FREQUENCIES...................................................4-1 4.4 REQUESTING FREQUENCIES ................................................................................................4-2 4.5 FREQUENCIES OUTPUT .......................................................................................................4-2 4.6 PROCEDURE: FREQUENCIES ................................................................................................4-4 4.7 DEMONSTRATION: FREQUENCIES .......................................................................................4-6 4.8 SUMMARIZING SCALE VARIABLES USING DESCRIPTIVES.................................................4-11 4.9 REQUESTING DESCRIPTIVES .............................................................................................4-11 4.10 DESCRIPTIVES OUTPUT ...................................................................................................4-11 4.11 PROCEDURE: DESCRIPTIVES ...........................................................................................4-11 4.12 DEMONSTRATION: DESCRIPTIVES...................................................................................4-12 4.13 SUMMARIZING SCALE VARIABLES USING THE EXPLORE PROCEDURE ...........................4-13 4.14 REQUESTING EXPLORE ...................................................................................................4-13 4.15 PROCEDURE: EXPLORE ...................................................................................................4-16 4.16 DEMONSTRATION: EXPLORE...........................................................................................4-19 4.17 LESSON SUMMARY .........................................................................................................4-24 4.18 LEARNING ACTIVITY ......................................................................................................4-25 LESSON 5: MAKING INFERENCES ABOUT POPULATIONS FROM SAMPLES ............................................................................................ 5-1 5.1 OBJECTIVES ........................................................................................................................5-1 5.2 INTRODUCTION ...................................................................................................................5-1 5.3 BASICS OF MAKING INFERENCES ABOUT POPULATIONS FROM SAMPLES ..........................5-1 5.4 INFLUENCE OF SAMPLE SIZE ...............................................................................................5-2 5.5 HYPOTHESIS TESTING .......................................................................................................5-10 5.6 THE NATURE OF PROBABILITY .........................................................................................5-11 5.7 TYPES OF STATISTICAL ERRORS .......................................................................................5-11 5.8 STATISTICAL SIGNIFICANCE AND PRACTICAL IMPORTANCE ............................................5-12 5.9 LESSON SUMMARY ...........................................................................................................5-13 5.10 LEARNING ACTIVITY ......................................................................................................5-13 LESSON 6: RELATIONSHIPS BETWEEN CATEGORICAL VARIABLES ........................................................................................... 6-1 ii TABLE OF CONTENTS 6.1 OBJECTIVES ........................................................................................................................ 6-1 6.2 INTRODUCTION ................................................................................................................... 6-1 6.3 CROSSTABS......................................................................................................................... 6-2 6.4 CROSSTABS ASSUMPTIONS................................................................................................. 6-3 6.5 REQUESTING CROSSTABS ................................................................................................... 6-3 6.6 CROSSTABS OUTPUT .......................................................................................................... 6-3 6.7 PROCEDURE: CROSSTABS ................................................................................................... 6-4 6.8 EXAMPLE: CROSSTABS ....................................................................................................... 6-5 6.9 CHI-SQUARE TEST .............................................................................................................. 6-7 6.10 REQUESTING THE CHI-SQUARE TEST ............................................................................... 6-8 6.11 CHI-SQUARE OUTPUT....................................................................................................... 6-8 6.12 PROCEDURE: CHI-SQUARE TEST ...................................................................................... 6-9 6.13 EXAMPLE: CHI-SQUARE TEST ........................................................................................ 6-10 6.14 CLUSTERED BAR CHART ................................................................................................ 6-11 6.15 REQUESTING A CLUSTERED BAR CHART WITH CHART BUILDER .................................. 6-12 6.16 CLUSTERED BAR CHART FROM CHART BUILDER OUTPUT ............................................ 6-12 6.17 PROCEDURE: CLUSTERED BAR CHART WITH CHART BUILDER ..................................... 6-13 6.18 EXAMPLE: CLUSTERED BAR CHART WITH CHART BUILDER ......................................... 6-15 6.19 ADDING A CONTROL VARIABLE ..................................................................................... 6-16 6.20 REQUESTING A CONTROL VARIABLE ............................................................................. 6-17 6.21 CONTROL VARIABLE OUTPUT ........................................................................................ 6-17 6.22 PROCEDURE: ADDING A CONTROL VARIABLE ............................................................... 6-18 6.23 EXAMPLE: ADDING A CONTROL VARIABLE ................................................................... 6-19 6.24 EXTENSIONS: BEYOND CROSSTABS ............................................................................... 6-22 6.25 ASSOCIATION MEASURES............................................................................................... 6-23 6.26 LESSON SUMMARY ......................................................................................................... 6-23 6.27 LEARNING ACTIVITY ...................................................................................................... 6-24 LESSON 7: THE INDEPENDENT- SAMPLES T TEST .......................... 7-1 7.1 OBJECTIVES ........................................................................................................................ 7-1 7.2 INTRODUCTION ................................................................................................................... 7-1 7.3 THE INDEPENDENT-SAMPLES T TEST ................................................................................ 7-1 7.4 INDEPENDENT-SAMPLES T TEST ASSUMPTIONS ................................................................ 7-2 7.5 REQUESTING THE INDEPENDENT-SAMPLES T TEST ........................................................... 7-2 7.6 INDEPENDENT-SAMPLES T TEST OUTPUT .......................................................................... 7-3 7.7 PROCEDURE: INDEPENDENT-SAMPLES T TEST .................................................................. 7-5 7.8 DEMONSTRATION: INDEPENDENT-SAMPLES T TEST.......................................................... 7-6 7.9 ERROR BAR CHART .......................................................................................................... 7-10 7.10 REQUESTING AN ERROR BAR CHART WITH CHART BUILDER ........................................ 7-11 7.11 ERROR BAR CHART OUTPUT .......................................................................................... 7-11 7.12 DEMONSTRATION: ERROR BAR CHART WITH CHART BUILDER .................................... 7-12 7.13 LESSON SUMMARY ......................................................................................................... 7-14 7.14 LEARNING ACTIVITY ...................................................................................................... 7-14 LESSON 8: THE PAIRED-SAMPLES T TEST ........................................ 8-1 iii INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS 8.1 OBJECTIVES ........................................................................................................................8-1 8.2 INTRODUCTION ...................................................................................................................8-1 8.3 THE PAIRED-SAMPLES T TEST ............................................................................................8-1 8.4 ASSUMPTIONS FOR THE PAIRED-SAMPLES T TEST .............................................................8-2 8.5 REQUESTING A PAIRED-SAMPLES T TEST ..........................................................................8-3 8.6 PAIRED-SAMPLES T TEST OUTPUT .....................................................................................8-3 8.7 PROCEDURE: PAIRED-SAMPLES T TEST..............................................................................8-4 8.8 DEMONSTRATION: PAIRED-SAMPLES T TEST .....................................................................8-4 8.9 LESSON SUMMARY .............................................................................................................8-6 8.10 LEARNING ACTIVITY ........................................................................................................8-6 LESSON 9: ONE-WAY ANOVA ............................................................... 9-1 9.1 OBJECTIVES ........................................................................................................................9-1 9.2 INTRODUCTION ...................................................................................................................9-1 9.3 ONE-WAY ANOVA ..............................................................................................................9-1 9.4 ASSUMPTIONS OF ONE-WAY ANOVA ...............................................................................9-2 9.5 REQUESTING ONE-WAY ANOVA ......................................................................................9-2 9.6 ONE-WAY ANOVA OUTPUT ..............................................................................................9-3 9.7 PROCEDURE: ONE-WAY ANOVA ......................................................................................9-4 9.8 DEMONSTRATION: ONE-WAY ANOVA .............................................................................9-6 9.9 POST HOC TESTS WITH A ONE-WAY ANOVA ...................................................................9-8 9.10 REQUESTING POST HOC TESTS WITH A ONE-WAY ANOVA ...........................................9-9 9.11 POST HOC TESTS OUTPUT.................................................................................................9-9 9.12 PROCEDURE: POST HOC TESTS WITH A ONE-WAY ANOVA..........................................9-10 9.13 DEMONSTRATION: POST HOC TESTS WITH A ONE-WAY ANOVA .................................9-12 9.14 ERROR BAR CHART WITH CHART BUILDER ...................................................................9-14 9.15 REQUESTING AN ERROR BAR CHART WITH CHART BUILDER ........................................9-14 9.16 ERROR BAR CHART OUTPUT ..........................................................................................9-14 9.17 PROCEDURE: ERROR BAR CHART WITH CHART BUILDER ..............................................9-15 9.18 DEMONSTRATION: ERROR BAR CHART WITH CHART BUILDER .....................................9-16 9.19 LESSON SUMMARY .........................................................................................................9-18 9.20 LEARNING ACTIVITY ......................................................................................................9-18 LESSON 10: BIVARIATE PLOTS AND CORRELATIONS FOR SCALE VARIABLES .......................................................................................... 10-1 10.1 OBJECTIVES ....................................................................................................................10-1 10.2 INTRODUCTION ...............................................................................................................10-1 10.3 SCATTERPLOTS ...............................................................................................................10-1 10.4 REQUESTING A SCATTERPLOT ........................................................................................10-2 10.5 SCATTERPLOT OUTPUT ...................................................................................................10-3 10.6 PROCEDURE: SCATTERPLOT ...........................................................................................10-3 10.7 DEMONSTRATION: SCATTERPLOT ...................................................................................10-4 10.8 ADDING A BEST FIT STRAIGHT LINE TO THE SCATTERPLOT ..........................................10-5 10.9 PEARSON CORRELATION COEFFICIENT...........................................................................10-7 10.10 REQUESTING A PEARSON CORRELATION COEFFICIENT................................................10-8 10.11 BIVARIATE CORRELATION OUTPUT ..............................................................................10-8 10.12 PROCEDURE: PEARSON CORRELATION WITH BIVARIATE CORRELATIONS ...................10-9 10.13 DEMONSTRATION: PEARSON CORRELATION WITH BIVARIATE CORRELATIONS ........10-10 10.14 LESSON SUMMARY .....................................................................................................10-11 10.15 LEARNING ACTIVITY ..................................................................................................10-12 iv TABLE OF CONTENTS LESSON 11: REGRESSION ANALYSIS................................................ 11-1 11.1 OBJECTIVES .................................................................................................................... 11-1 11.2 INTRODUCTION ............................................................................................................... 11-1 11.3 SIMPLE LINEAR REGRESSION ......................................................................................... 11-1 11.4 SIMPLE LINEAR REGRESSION ASSUMPTIONS ................................................................. 11-3 11.5 REQUESTING SIMPLE LINEAR REGRESSION ................................................................... 11-4 11.6 SIMPLE LINEAR REGRESSION OUTPUT ........................................................................... 11-4 11.7 PROCEDURE: SIMPLE LINEAR REGRESSION ................................................................... 11-5 11.8 DEMONSTRATION: SIMPLE LINEAR REGRESSION........................................................... 11-7 11.9 MULTIPLE REGRESSION................................................................................................ 11-11 11.10 MULTIPLE LINEAR REGRESSION ASSUMPTIONS ........................................................ 11-11 11.11 REQUESTING MULTIPLE LINEAR REGRESSION........................................................... 11-11 11.12 MULTIPLE LINEAR REGRESSION OUTPUT .................................................................. 11-11 11.13 PROCEDURE: MULTIPLE LINEAR REGRESSION ........................................................... 11-14 11.14 DEMONSTRATION: MULTIPLE LINEAR REGRESSION .................................................. 11-16 11.15 LESSON SUMMARY ..................................................................................................... 11-22 11.16 LEARNING ACTIVITY .................................................................................................. 11-22 LESSON 12: NONPARAMETRIC TESTS .............................................. 12-1 12.1 OBJECTIVES .................................................................................................................... 12-1 12.2 INTRODUCTION ............................................................................................................... 12-1 12.3 NONPARAMETRIC ANALYSES ......................................................................................... 12-2 12.4 THE INDEPENDENT SAMPLES NONPARAMETRIC ANALYSIS .......................................... 12-2 12.5 REQUESTING AN INDEPENDENT SAMPLES NONPARAMETRIC ANALYSIS ....................... 12-3 12.6 INDEPENDENT SAMPLES NONPARAMETRIC TESTS OUTPUT .......................................... 12-3 12.7 PROCEDURE: INDEPENDENT SAMPLES NONPARAMETRIC TESTS ................................... 12-5 12.8 DEMONSTRATION: INDEPENDENT SAMPLES NONPARAMETRIC TESTS .......................... 12-8 12.9 THE RELATED SAMPLES NONPARAMETRIC ANALYSIS ................................................ 12-11 12.10 REQUESTING A RELATED SAMPLES NONPARAMETRIC ANALYSIS ............................. 12-12 12.11 RELATED SAMPLES NONPARAMETRIC TESTS OUTPUT .............................................. 12-12 12.12 PROCEDURE: RELATED SAMPLES NONPARAMETRIC TESTS ...................................... 12-13 12.13 DEMONSTRATION: RELATED SAMPLES NONPARAMETRIC TESTS .............................. 12-16 12.14 LESSON SUMMARY ..................................................................................................... 12-19 12.15 LEARNING ACTIVITY .................................................................................................. 12-20 LESSON 13: COURSE SUMMARY........................................................ 13-1 13.1 COURSE OBJECTIVES REVIEW ........................................................................................ 13-1 13.2 COURSE REVIEW: DISCUSSION QUESTIONS ................................................................... 13-1 13.3 NEXT STEPS .................................................................................................................... 13-2 APPENDIX A: INTRODUCTION TO STATISTICAL ANALYSIS REFERENCES 1 1.1 INTRODUCTION .................................................................................................................. A-1 1.2 REFERENCES ...................................................................................................................... A-1 v INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS vi COURSE INTRODUCTION Lesson 0: Course Introduction 0.1 Introduction The focus of this two-day course is an introduction to the statistical component of IBM SPSS Statistics. This is an application-oriented course and the approach is practical. You'll take a look at several statistical techniques and discuss situations in which you would use each technique, the assumptions made by each method, how to set up the analysis using PASW Statistics, as well as how to interpret the results. This includes a broad range of techniques for exploring and summarizing data, as well as investigating and testing underlying relationships. You will gain an understanding of when and why to use these various techniques as well as how to apply them with confidence, and interpret their output, and graphically display the results. 0.2 Course Objectives After completing this course students will be able to: Perform basic statistical analysis using selected statistical techniques with PASW Statistics To support the achievement of this primary objective, students will also be able to: Explain the basic elements of quantitative research and issues that should be considered in data analysis Determine the level of measurement of variables and obtain appropriate summary statistics based on the level of measurement Run the Frequencies procedure to obtain appropriate summary statistics for categorical variables Request and interpret appropriate summary statistics for scale variables Explain how to make inferences about populations from samples Perform crosstab analysis on categorical variables Perform a statistical test to determine whether there is a statistically significant relationship between categorical variables Perform a statistical test to determine whether there is a statistically significant difference between two groups on a scale variable Perform a statistical test to determine whether there is a statistically significant difference between the means of two scale variables Perform a statistical test to determine whether there is a statistically significant difference among three or more groups on a scale dependent variable Perform a statistical test to determine whether two scale variables are correlated (related) Perform linear regression to determine whether one or more variables can significantly predict or explain a dependent variable Perform non-parametric tests on data that don't meet the assumptions for standard statistical tests 0.3 About SPSS SPSS Inc., an IBM Company is a leading global provider of predictive analytics software and solutions. The Company's complete portfolio of products - data collection, statistics, modeling and deployment - captures people's attitudes and opinions, predicts outcomes of future customer interactions, and then acts on these insights by embedding analytics into business processes. SPSS solutions address interconnected business objectives across an entire organization by focusing on the convergence of analytics, IT architecture and business process. Commercial, government and academic customers worldwide rely on SPSS technology as a competitive advantage in attracting, 0-1 INTRODUCTION TO STATISTICAL ANALYSIS WITH IBM SPSS STATISTICS retaining and growing customers, while reducing fraud and mitigating risk. SPSS was acquired by IBM in October 2009. For more information, visit http://www.spss.com. 0.4 Supporting Materials We use several datasets in the course because no one data file contains all the types of variables and relationships between them that are ideal for every technique we discuss. As much as possible, we try to minimize the need within one lesson to switch between datasets, but the first priority is to use appropriate data for each method. The following data files are used in this course: Bank.sav Drinks.sav Census.sav Employee data.sav SPSS_CUST.sav 0.5 Course Assumptions General computer literacy. Completion of the "Introduction to PASW Statistics" and/or "Data Management and Manipulation with PASW Statistics" courses or experience with PASW Statistics including familiarity with, opening, defining, and saving data files and manipulating and saving output. Basic statistical knowledge or at least one introductory level course in statistics is recommended. Note about Default Startup Folder and Variable Display in Dialog Boxes In this course, all of the files used for the demonstrations and exercises are located in the folder c:\\Train\\Statistics_IntroAnalysis. Note: If the course files are stored in a different location, your instructor will give you instructions specific to that location. Either variable names or longer variable labels will appear in list boxes in dialog boxes. Additionally, variables in list boxes can be ordered alphabetically or by their position in the file. In this course, we will display variable names in alphabetical order within list boxes. 1) 2) 3) 4) 5) 0-2 Select Edit...Options Select the General tab (if necessary) Select Display names in the Variable Lists group on the General tab Select Alphabetical Select OK and OK in the information box to confirm the change INTRODUCTION TO STATISTICAL ANALYSIS Lesson 1: Introduction to Statistical Analysis 1.1 Objectives After completing this lesson students will be able to: Explain the basic elements of quantitative research and issues that should be considered in data analysis To support the achievement of the primary objective, students will also be able to: Explain the basic steps of the research process Explain differences between populations and samples Explain differences between experimental and non-experimental research designs Explain differences between independent and dependent variables 1.2 Introduction The goal of this course is to enable you to perform useful analyses on your data using PASW Statistics. Keeping this in mind, these lessons demonstrate how to perform descriptive and inferential statistical analyses and create charts to support these analyses. This course guide will focus on the elements necessary for you to answer questions from your data. In this chapter, we begin by briefly reviewing the basic elements of quantitative research and issues that should be considered in data analysis. We will then discuss a number of statistical procedures that PASW Statistics performs. This is an application-oriented course and the approach will be practical. We will discuss: 1) 2) 3) 4) The situations in which you would use each technique. The assumptions made by the method. How to set up the analysis using PASW Statistics. Interpretation of the results. We will not derive proofs, but rather focus on the practical matters of data analysis in support of answering research questions. For example, we will discuss what correlation coefficients are, when to use them, and how to produce and interpret them, but will not formally derive their properties. This course is not a substitute for a course in statistics. You will benefit if you have had such a course in the past, but even if not, you will understand the basics of each technique after completion of this course. We will cover descriptive statistics and exploratory data analysis, and then examine relationships between categorical variables using crosstabulation tables and chi-square tests. Testing for mean differences between groups using T Tests and analysis of variance (ANOVA) will be considered. Correlation and regression will be used to investigate the relationships between interval/scale variables and we will also discuss some non-parametric techniques. Graphs comprise an integral part of the analyses and we will demonstrate how to create and interpret these as well. 1.3 Basic Steps of the Research Process All research projects, whether analyzing a survey, doing program evaluations, assessing marketing campaigns, doing pharmaceutical research, etc., can be broken down into a number of discrete 1-1 INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS components. These components can be categorized in a variety of ways. We might summarize the main steps as: 1) Specify exactly the aims and objectives of the research along with the main hypotheses. 2) Define the population and sample design. 3) Choose a method of data collection, design the research and decide upon an appropriate sampling strategy. 4) Collect the data. 5) Prepare the data for analysis. 6) Analyse the data. 7) Report the findings. Some of these points may seem obvious, but it is surprising how often some of the most basic principles are overlooked, potentially resulting in data that is impossible to analyze with any confidence. Each step is crucial for a successful research project and it is never too early in the process to consider the methods that you intend to use for your data analysis. In order to place the statistical techniques that we will discuss in this course in the broader framework of research design, we will briefly review some of the considerations of the first steps. Statistics and research design are highly interconnected disciplines and you should have a thorough grasp of both before embarking on a research project. This introductory chapter merely skims the surface of the issues involved in research design. If you are unfamiliar with these principles, we recommend that you refer to the research methodology literature for more thorough coverage of the issues. Research Objectives It is important that a research project begin with a set of well-defined objectives. Yet, this step is often overlooked or not well defined. The specific aims and objectives may not be addressed because those commissioning the research do not know exactly which questions they would like answered. This rather vague approach can be a recipe for disaster and may result in a completely wasted opportunity as the most interesting aspects of the subject matter under investigation could well be missed. If you do not identify the specific objectives, you will fail to collect the necessary information or ask the necessary question in the correct form. You can end up with a data file that does not contain the information that you need for your data analysis step. For example, you may be asked to conduct a survey "to find out about alcohol consumption and driving". This general objective could lead to a number of possible survey questions. Rather than proceeding with this general objective, you need to uncover more specific hypotheses that are of interest to your organization. This example could lead to a number of very specific research questions, such as: \"What proportion of people admits to driving while above the legal alcohol limit?\" \"What demographic factors (e.g., age/sex/social class) are linked with a propensity to drunk-driving?\" \"Does having a conviction for drunk-driving affect attitudes towards driving while over the legal limit?\" These specific research questions would then define the questionnaire items. Additionally, the research questions will affect the definition of the population and the sampling strategy. For example, the third question above requires that the responder have a drunk-driving conviction. Given that a 1-2 INTRODUCTION TO STATISTICAL ANALYSIS relatively small proportion of the general population has such a conviction, you would need to take that into consideration when defining the population and sampling design. Therefore, it is essential to state formally the main aims and objectives at the outset of the research so the subsequent stages can be done with these specific questions in mind. 1.4 Populations and Samples In studies involving statistical analysis it is important to be able to characterize accurately the population under investigation. The population is the group to which you wish to generalize your conclusions, while the sample is the group you directly study. In some instances the sample and population are identical or nearly identical; consider the Census of any country. In the majority of studies, the sample represents a small proportion of the population. In the example above, the population might be defined as those people with registered drivers' licenses. We could select a sample from the drivers' license registration list for our survey. Other common examples are: membership surveys in which a small percentage of members are sent questionnaires, medical experiments in which samples of patients with a disease are given different treatments, marketing studies in which users and non users of a product are compared, and political polling. The problem is to draw valid inferences from data summaries in the sample so that they apply to the larger population. In some sense you have complete information about the sample, but you want conclusions that are valid for the population. An important component of statistics and a large part of what we cover in the course involves statistical tests used in making such inferences. Because the findings can only be generalized to the population under investigation, you should give careful thought to defining the population of interest to you and making certain that the sample reflects this population. To state it in a simple way, statistical inference provides a method of drawing conclusions about a population of interest based on sample results. 1.5 Research Design With specific research goals and a target population in mind, it is then possible to begin the design stage of the research. There are many things to consider at the design stage. We will consider a few issues that relate specifically to data analysis and statistical techniques. This is not meant as a complete list of issues to consider. For example, for survey projects, the mode of data collection, question selection and wording, and questionnaire design are all important considerations. Refer to the survey research literature as well as general research methodology literature for discussion of these and other research design issues. First, you must consider the type of research that will be most appropriate to the research aims and objectives. Two main alternatives are experimental and non-experimental research. The data may be recorded using either objective or subjective techniques. The former includes items measured by an instrument and by computer such as physiological measures (e.g. heart-rate) while the latter includes observational techniques such as recordings of a specific behavior and responses to questionnaire surveys. Most research goals lend themselves to one particular form of research, although there are cases where more than one technique may be used. For example, a questionnaire survey would be inappropriate if the aim of the research was to test the effectiveness of different levels of a new drug to relieve high blood pressure. This type of work would be more suited to a tightly controlled experimental study in which the levels of the drug administered could be carefully controlled and objective measures of blood pressure could be accurately recorded. On the other hand, this type of laboratory-based work would not be a suitable means of uncovering people's voting intentions. 1-3 INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS The classic experimental design consists of two groups: the experimental group and the control group. They should be equivalent in all respects other than that those in the former group are subjected to an effect or treatment and the latter is not. Therefore, any differences between the two groups can be directly attributed to the effect of this treatment. The treatment variables are usually referred to as independent variables, and the quantity being measured as the effect is the dependent variable. There are many other research designs, but most are more elaborate variations on this basic theme. In non-experimental research, you rarely have the opportunity to implement such a rigorously controlled design. For example, we cannot randomly assign students to schools, however, the same general principles apply to many of the analyses you perform. 1.6 Independent and Dependent Variables In general, the dependent (sometimes referred to as the outcome) variable is the one we wish to study as a function of other variables. Within an experiment, the dependent variable is the measure expected to change as a result of the experimental manipulation. For example, a drug experiment designed to test the effectiveness of different sleeping pills might employ the number of hours of sleep as the dependent variable. In surveys and other non-experimental studies, the dependent variable is also studied as a function of other variables. However, no direct experimental manipulation is performed; rather the dependent variable is hypothesized to vary as a result of changes in the other (independent) variables. Correspondingly, independent (sometimes referred to as predictor) variables are those used to measure features manipulated by the experimenter in an experiment. In a non-experimental study, they represent variables believed to influence or predict a dependent measure. Thus terms (dependent, independent) reasonably applied to experiments have taken on more general meanings within statistics. Whether such relations are viewed causally, or as merely predictive, is a matter of belief and reasoning. As such, it is not something that statistical analysis alone can resolve. To illustrate, we might investigate the relationship between starting salary (dependent) and years of education, based on survey data, and then develop an equation predicting starting salary from years of education. Here starting salary would be considered the dependent variable although no experimental manipulation of education has been performed. One way to think of the distinction is to ask yourself which variable is likely to influence the other? In summary, the dependent variable is believed to be influenced by, or be predicted by, the independent variable(s). Finally, in some studies, or parts of studies, the emphasis is on exploring and characterizing relationships among variables with no causal view or focus on prediction. In such situations there is no designation of dependent and independent variables. For example, in crosstabulation tables and correlation matrices the distinction between dependent and independent variables is not necessary. It rather resides in the eye of the beholder (researcher). 1.7 Note about Default Startup Folder and Variable Display in Dialog Boxes In this course, all of the files used for the demonstrations and exercises are located in the folder c:\\Train\\Statistics_IntroAnalysis.You can set the startup folder that will appear in all Open and Save dialog boxes. We will use this option to set the startup folder. Select Edit...Options, and then select the File Locations tab Select the Browse button to the right of the Data Files text box Select Train from the Look In: drop down list, then select Statistics_IntroAnalysis from the list of folders and select Set button Click the Browse button to the right of the Other Files text box and repeat the process to set this folder to Train\\Statistics_IntroAnalysis 1-4 INTRODUCTION TO STATISTICAL ANALYSIS Figure 1.1 Set Default File Location in the Edit Options Dialog Box Note: If the course files are stored in a different location, your instructor will give you instructions specific to that location. Either variable names or longer variable labels will appear in list boxes in dialog boxes. Additionally, variables in list boxes can be ordered alphabetically or by their position in the file. In this course, we will display variable names in alphabetical order within list boxes. Select General tab Select Display names in the Variable Lists group on the General tab Select Alphabetical (Not shown) Select OK and then OK in the information box to confirm the change 1.8 Lesson Summary In this lesson, we reviewed the basic elements of quantitative research and issues that should be considered in data analysis. Lesson Objectives Review Students who have completed this lesson should now be able to: Explain the basic elements of quantitative research and issues that should be considered in data analysis To support the achievement of the primary objective, students should now also be able to: Explain the basic steps of research process Explain differences between populations and samples Explain differences between experimental and non-experimental research designs Explain differences between independent and dependent variables 1-5 INTRODUCTION TO STATISTICAL ANALYSIS USING IBM SPSS STATISTICS 1.9 Learning Activity In this set of learning activities you won't need any supporting material. 1. In each of the following scenarios, state the possible goals of the research, the type of design you can use, and the independent and dependent variables: a. The relationship between gender and whether a product was purchased. b. The difference between income categories (e.g., low, medium, and high) and number of years of education. c. The effect of two different marketing campaigns on number of items purchased. 2. In your own organization/field, are experimental studies ever done? If not, can you imagine how an experiment might be done to study a topic of interest to you or your organization? Describe that and the challenges such an experimental design would encounter. 1-6 UNDERSTANDING DATA DISTRIBUTIONS - THEORY Lesson 2: Understanding Data Distributions - Theory 2.1 Objectives After completing this lesson students will be able to: Determine the level of measurement of variables and obtain appropriate summary statistics based on the level of measurement To support the achievement of this primary objective, students will also be able to: Describe the levels of measurement used in PASW Statistics Use measures of central tendency and dispersion Use normal distributions and z-scores Introduction Ideally, we would like to obtain as much information as possible from our data. In practice however, given the measurement level of our variables, only some information is meaningful. In this lesson we will discuss level of measurement and see how this determines the summary statistics we can request. Business Context Understanding how level of measurement impacts the kind of information we can obtain is an important step before we collect our data. In addition, level of measurement also determines the kind of research questions we can answer, and so this is a critical step in the research process. Supporting Materials The file Census.sav, a PASW Statistics data file from a survey done on the general adult population. Questions were included about various attitudes and demographic characteristics. 2.2 Levels of Measurement and Statistical Methods The term levels of measurement refers to the properties and meaning of numbers assigned to observations for each item. Many statistical techniques are only appropriate for data measured at particular levels or combinations of levels. Therefore, when possible, you should determine the analyses you will be using before deciding upon the level of measurement to use for each of your variables. For example, if you want to report and test the mean age of your customers, you will need to ask their age in years (or year of birth) rather than asking them to choose an age group into which their age falls. Because measurement type is important when choosing test statistics, we briefly review the common taxonomy of level of measurement. 2-1 INTRODUCTION TO STATISTICAL ANALYSIS WITH IBM SPSS STATISTICS The four major classifications that follow are found in many introductory statistics texts. They are presented beginning with the weakest and ending with those having the strongest measurement properties. Each successive level can be said to contain the properties of the preceding types and to record information at a higher level. Nominal In nominal measurement each numeric value represents a category or group identifier, only. The categories cannot be ranked and have no underlying numeric value. An example would be marital status, coded 1 (Married), 2 (Widowed), 3 (Divorced), 4 (Separated) and 5 (Never Married); each number represents a category and the matching of specific numbers to categories is arbitrary. Counts and percentages of observations falling into each category are appropriate summary statistics. Such statistics as the mean (the average marital status?) would not be appropriate, but the mode would be appropriate (the most frequent category). Ordinal For ordinal measures the data values represent ranking or ordering information. However, the difference between the data values along the scale is not equal. An example would be specifying how happy you are with your life, coded 1 (Very Happy), 2 (Happy), and 3 (Not Happy). There are specific statistics associated with ranks; PASW Statistics provides a number of them mostly within the Crosstabs, Nonparametric and Ordinal Regression procedures. The mode and median can be used as summary statistics. Interval In interval measurement, a unit increase in numeric value represents the same change in quantity regardless of where it occurs on the scale. For interval scale variables such summaries as means and standard deviations are appropriate. Statistical techniques such as regression and analysis of variance assume that the dependent (or outcome) variable is measured on an interval scale. Examples might be temperature in degrees Fahrenheit or SAT score. Ratio Ratio measures have interval scale properties with the addition of a meaningful zero point; that is, zero indicates complete absence of the characteristic measured. For statistics such as ANOVA and regression only interval scale properties are assumed, so ratio scales have stronger properties than necessary for most statistical analyses. Health care researchers often use ratio scale variables (number of deaths, admissions, discharges) to calculate rates. The ratio of two variables with ratio scale properties can thus be directly interpreted. Money is an example of a ratio scale, so someone with $10,000 has ten times the amount as someone with $1,000. The distinction between the four types is summarized below. Table 2.1 Level of Measurement Properties Property Level of Measurement Categories Nominal Ordinal Interval Ratio Ranks Equal Intervals True Zero Point These four levels of measurement are often combined into two main types, categorical consisting of nominal and ordinal measurement levels and scale (or continuous) consisting of interval and ratio measurement levels. 2-2 UNDERSTANDING DATA DISTRIBUTIONS - THEORY The measurement level variable attribute in PASW Statistics recognizes three measurement levels: Nominal, Ordinal and Scale. The icon indicating the measurement level is displayed preceding the variable name or label in the variable lists of all dialog boxes. The following table shows the most common icons used for the measurement levels. Special data types, such as Date and Time variables have distinct icons not shown in this table. Table 2.2 Variable List Icons Measurement Level Data Type Numeric String Nominal Ordinal Not Applicable Scale Not Applicable Rating Scales and Dichotomous Variables A common scale used in surveys and market research is an ordered rating scale usually consisting of five- or seven-point scales. Such ordered scales are also called Likert scales and might be coded 1 (Strongly Agree, or Very Satisfied), 2 (Agree, or Satisfied), 3 (Neither agree nor disagree, or Neutral), 4 (Disagree, or Dissatisfied), and 5 (Strongly Disagree, or Very Dissatisfied). There is an ongoing debate among researchers as to whether such scales should be considered ordinal or interval. PASW Statistics contains procedures capable of handling such variables under either assumption. When in doubt about the measurement scale, some researchers run their analyses using two separate methods, since each make different assumptions about the nature of the measurement. If the results agree, the researcher has greater confidence in the conclusion. Dichotomous (binary) variables containing two possible responses (often coded 0 and 1) are often considered to fall into all of the measurement levels except ratio (at least as independent variables). As we will see, this flexibility allows them to be used in a wide range of statistical procedures Implications of Measurement Level As we have discussed, the level of measurement of a variable is important because it determines the appropriate summary statistics, tables, and graphs to describe the data. The following table summarizes the most common summary measures and graphs for each of the measurement levels and PASW Statistics procedures that can produce them. 2-3 INTRODUCTION TO STATISTICAL ANALYSIS WITH IBM SPSS STATISTICS Table 2.3 Summary of Descriptive Statistics and Graphs NOMINAL ORDINAL SCALE Definition Unordered Categories Ordered Categories Metric/Numeric Values Examples Labor force status, gender, marital status Satisfaction ratings, degree of education Income, height, weight Measures of Central Tendency Mode Mode Median Mode Median Mean Min/Max/Range, InterQuartile Range (IQR) Min/Max/Range, IQR, Standard Deviation/Variance Measures of Dispersion N/A Graph Pie or Bar Pie or Bar Histogram, Box & Whisker, Stem & Leaf Procedures Frequencies Frequencies Frequencies, Descriptives, Explore Measurement Level and Statistical Methods Statistics are available for variables at all levels of measurement for more advanced analysis. In practice, your choice of method depends on the questions you are interested in asking of the data and the nature of the measurements you make. The table below suggests which statistical techniques are most appropriate, based on the measurement level of the dependent and independent variable. Much more extensive diagrams and discussion are found in Andrews et al. (1981), or other standard statistical texts. Table 2.4 Level of Measurement and Appropriate Statistical Methods Dependent Variable Independent Variables Nominal Ordinal Nominal Crosstabs Crosstabs Ordinal Nonparametric tests, Ordinal Regression Interval/ Ratio T Test, ANOVA Nonparametric correlation, Optimal Scaling Regression Nonparametric Correlation 2-4 Interval/Ratio Discriminant, Logistic Regression Ordinal Regression Correlation, Linear Regression UNDERSTANDING DATA DISTRIBUTIONS - THEORY Best Practice If in doubt about the measurement properties of your variables, you can apply a statistical technique that assumes weaker measurement properties and compare the results to methods making stronger assumptions. A consistent answer provides greater confidence in the conclusions. Apply Your Knowledge 1. PASW Statistics distinguishes three levels of measurement. Which of these is not one of those levels? a. Categorical b. Scale c. Nominal d. Ordinal 2. True or false? An ordinal variable has all properties of a nominal variable? 3. Consider the dataset depicted below. Which statements are correct? a. The variable region is an ordinal variable b. The variable age is a scale variable c. The variable agecategory is an ordinal variable d. The variable salarycategory is a scale variable 2.3 Measures of Central Tendency and Dispersion Measures of central tendency and dispersion are the most common measures used to summarize the distribution of variables. We give a brief description of each of these measures below. Measures of Central Tendency Statistical measures of central tendency give that one number that is often used to summarize the distribution of a variable. They may be referred to generically as the "average." There are three main 2-5 INTRODUCTION TO STATISTICAL ANALYSIS WITH IBM SPSS STATISTICS central tendency measures: mode, median, and mean. In addition, Tukey devised the 5% trimmed mean. Mode: The mode for any variable is merely the group or class that contains the most cases. If two or more groups contain the same highest number of cases, the distribution is said to be \"multimodal.\" This measure is more typically used on nominal or ordinal data and can easily be determined by examining a frequency table. Median - If all the cases for a variable are arranged in order according to their value, the median is that value that splits the cases into two equally sized groups. The median is the same as the 50th percentile. Medians are resistant to extreme scores, and so are considered robust measures of central tendency. Mean: - The mean is the simple arithmetic average of all the values in the distribution (i.e., the sum of the values of all cases divided by the total number of cases). It is the most commonly reported measure of central tendency. The mean along with the associated measures of dispersion are the basis for many statistical techniques. 5% trimmed mean - The 5% trimmed mean is the mean calculated after the extreme upper 5% and the extreme lower 5% of the data values are dropped. Such a measure is resistant to extreme values. The specific measure that you choose will depend on a number of factors, most importantly the level of measurement of the variable. The mean is considered the most "powerful" measure of the three classic measures of central tendency. However, it is good practice to compare the median, mean, and 5% trimmed mean to get a more complete understanding of a distribution. Measures of Dispersion Measures of dispersion or variability describe the degree of spread, dispersion, or variability around the central tendency measure. You might think of this as a measure of the extent to which observations cluster within the distribution. There are a number of measures of dispersion, including simple measures such as maximum, minimum, and range, common statistical measures, such as standard deviation and variance, as well as the interquartile range (IQR). 2-6 Maximum: Simply the highest value observed for a particular variable. By itself, it can tell us nothing about the shape of the distribution, merely how high the top value is. Minimum: The lowest value in the distribution and, like the maximum, is only useful when reported in conjunction with other statistics. Range: The difference between the maximum and minimum values gives a general impression of how broad the distribution is. It says nothing about the shape of a distribution and can give a distorted impression of the data if just one case has an extreme value. Variance: Both the variance and standard deviation provide information about the amount of spread around the mean value. They are overall measures of how clustered around the mean the data values are. The variance is calculated by summing the square of the difference between the value and the mean for each case and dividing this quantity by the number of cases minus 1. If all cases had the same value, the variance (and standard deviation) would be zero. The variance measure is expressed in the units of the variable squared. This can cause difficulty in interpretation, so more often the standard deviation is used. In general terms, the larger the variance, the more spread there is in the data, the smaller the variance, the more the data values are clustered around the mean. Standard Deviation: The standard deviation is the square root of the variance which restores the value of variability to the units of measurement of the original variable. It is therefore easier to interpret. Either the variance or standard deviation is often used in conjunction with the mean as a basis for a wide variety of statistical techniques. Interquartile Range (IQR) - This measure of variation is the range of values between the 25th and 75th percentile values. Thus, the IQR represents the range of the middle 50 percent of the sample and is more resistant to extreme values than the standard deviation. UNDERSTANDING DATA DISTRIBUTIONS - THEORY Like the measures of central tendency, these measures differ in their usefulness with variables of different measurement levels. The variability measures, variance and standard deviation, are used in conjunction with the mean for statistical evaluation of the distribution of a scale variable. The other measures of dispersion, although less useful statistically, can provide useful descriptive information about a variable. Apply Your Knowledge 1. True or false? The mode is that value that splits the cases into two equally sized groups. 2. True or false? Consider the table depicted below. The salaries of men are clustered tighter around their mean than the salaries of women around their mean? 2.4 Normal Distributions An important statistical concept is that of the normal distribution. This is a frequency (or probability) distribution which is symmetrical and is often referred to as the normal bell-shaped curve. The histogram below illustrates a normal distribution. The mean, median and mode exactly coincide in a perfectly normal distribution. And the proportion of cases contained within any portion of the normal curve can be exactly calculated mathematically. Its symmetry means that 50% of cases lie to either side of the central point as defined by the mean. Two of the other most frequently-used representations are the portions lying between plus and minus one standard deviation of the mean (containing approximatel

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock

Our experts were un... View full answer

blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Finance Questions!