Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Various Income Groups in Edmonton and Calgary The distribution of income groups is one of the important economical factors that a capitalist. an economist, and
Various Income Groups in Edmonton and Calgary The distribution of income groups is one of the important economical factors that a capitalist. an economist, and the government want to know. For example, the distribution of income groups is important for a capitalist to determine what type of business is more suitable for investment in a certain city, a certain province, anda'or a certain country. While the economist and the government want to know whether the distribution of the income groups is at the right level to determine if any policy will need to be in place. Therefore, from time to time, the government will conduct a census and gather this information from their citizens. A census is expensive to conduct each year, however, so Statistics Canada usually completes one every 5 years, the latest conducted in the year 2021. Since the data for this assignment was collected in January 2020 to verily the effect of the COVID pandemic, the information for census 2016 is used in this assignment for comparisons. The information about households for the 2016 census can be obtained from this Statistics Canada Website: htt s::'}"www12.statcan.ac.cas'census-recensementf2016rd - dx'dt-tdx'index-en .cfm. The website offers census data for the year 2015 [since the census was conducted in 2016]. According to the census data provided by Statistics Canada. they have divided the income groups into 19 different income groups, but suppose a researcher is only interested in observing the distribution of 3 income groups (group 1: households with income under $50,000; group 2: households with income between $50,000 and $99,999; and group 3: households with income $100,000 and over}. Based on census 2016, the distributions of household income for 3 income groups for Edmonton and Calgary are given in the following table: Household Income Under $50,000 Between $50,000 and $99,999 $100,000 and over Edmonton 115100 (22.92%) |515f1530.|8'1{1) 235485 [46.90%] 502150 [100%] Calgary |093T5 {21.05%} 151590 (29. 171%) 2587120 [4908943] 519635 (100%) The researcher randomly selects 150 households in Edmonton and another 150 households in Calgary, calling and asking each household whether they belong to income group 1 (low: under $50,000 in total household income), income group 2 (medium: with total household income from $50,000 to $99,999), or income group 3 (high: with total household income at least $100,000]. The dataset (Law-Damn: andr'or LabS-Da!a.csv) relates to this study. The dataset is available in the Data link located in the Lab 3 tab display in the Labs section on eClass. The data are not to be printed in your submission. The following is a description of the variables in the data le: Column Variable Name Description of Variable 1 Household The household number 2 City Name of city where the household is located 3 Income Income group to which household belongs 4 Level Income level (Low , Medium, and High) Answer the following questions using the data: 1. Looking at the study design of the dataset can you generalize the results of the study to Edmonton ande'or Calgary? Explain briey. Identify the identier variable and comment briey. What isa'are the categorical and numerical variablefs) 1n the data, if any? 2. Use data for Edmonton to answer the following questions. (a) Create frequency tables [showing both frequency and percentages) to summarize the distribution of income groups for households in Edmonton. Paste the table into your report. Compare this sample distribution with the distribution of the 3 income groups for Edmonton in the census. Specically. provide the exact differences in distribution of each of the 3 income groups in Edmonton between the census 2016 and the sample data in 2021. {b} Can'y out an appropriate hypothesis test at [I = 0.01 to see whether the distribution of household income in 2021 is different from the distribution of household income in 2016. State the null and alternative hypotheses in terms of parameters. population proportions for the three household group income. Report the value of the appropriate test statistic, the distribution of the test statistic under the null hypothesis, and the P-value of the test to answer the question. State your conclusion. (c) Regardless of your results in part (b), carry out an appropriate hypothesis test at o. = 0.01 to see whether the proportion of households in Edmonton with income under $50,000 is now higher than 22.92% (which is the rounded percentage of households with income under $50,000 in the 2016 census). State the null and alternative hypotheses in terms of parameters. Report the value of the appropriate :-test statistic, the distribution of the test statistic under the null hypothesis, and the P- value of the test to answer the question. State your conclusion. ((1) Find a 98% two-sided condence interval for the proportion of households in Edmonton with income under $50,000. (Hint: Although a one-sided confidence interval can be obtained in R, this type of condence interval is not discussed in STAT 151 classes. Therefore, students must use a two- sided condence interval to answer this question). Interpret the condence interval. Use the condence interval to answer the question in part (c). Compare the result from the confidence interval with the conclusion from part (c)
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started