Question
StatCrunch Assignment 2A Focus on a formulating and answering a question related to a categorical variable that can be answered using survey data and the
StatCrunch Assignment 2A
Focus on a formulating and answering a question related to a categorical variable that can be answered using survey data and the techniques have studied thus far in this course. The question formulate should be about the entire population of StatCrunch U studentsnot just about those in the sample. Assume that sample is representative of the population.
1.
Be focusing on one of the categorical variables in the sample made in assignment 1A. Indicate the variable on which will be focusing.
2.
Formulate a question related to the variable chose. Question should be simple and focus only on the variable chose. For example, my question might be 'What proportion of male students are in class 3 at StatCrunch University?' Are not allowed to use this example question. Be sure question is about the population (StatCrunch University), not just the students in sample.
3.
What type of graph/chart will use to describe data? Be sure to choose a graph/chart that is appropriate for a categorical variable.
4.
Describe the methodology that will use to answer the question.
5.
Answer the question formulated. Be sure to include the chart/graph made. Upload answer as a file.
Gender ClassHoursWorkLoansCC Debt
Female 414007286
Male1121730261525
Female 4180170564089
Female 41316.505283
Male212002109
Male1211045721717
Male3120127013787
Male41613.5144140
Female 41518.5156310
Female 114001318
Male31511114340
Female 2151487632961
Male315002970
Female 415005656
Female 315002684
Female 114039631182
Female 315005428
Female 11121.501036
Female 41028149452832
Female 492309654
Female 11200975
Male215082383066
Female 1929.537151301
Female 11517.544940
Female 11214.539771370
Female 317006127
Female 11810.550741007
Male114001191
Female 11517.529521186
Male313004079
Female 216067491923
Male413003730
Female 217003320
Female 115001367
Female 415003147
Male214070202842
Male112000
Female 215003257
Female 31217.5106672852
Female 116001178
Male1121946311902
Male3170153215166
Male315005128
Female 118001259
Male117001525
Male112001369
Female 1171332891261
Female 41417.5144107430
Female 26002682
Male2634.578900
Female 1161335451208
Male3160122462380
Female 115001313
Male118001325
Female 2171081653176
Female 4926.5149890
Female 114028031184
Female 11514.531431135
Female 117001230
Male111285441410
Female 2633.576052892
Male315004363
Female 217002914
Female 41512.5158970
Female 273472271842
Female 31515135562961
Female 4627.5151272152
Female 31415122132637
Female 220668842304
Female 21613109793067
Female 118001375
Female 415003521
Male2627.588013147
Male1113050031565
Male31519118644028
Female 414000
Female 3130126903500
Male115001678
Male316005479
Male4170017669
Female 417007535
Male212002060
Male3200110044863
Female 316004727
Male2627.578373105
Male216002784
Male390119943555
Male2151476963519
Male117000
Female 412004195
Female 21316.568222147
Female 2189.575062472
Female 415004957
Female 118024111267
Female 31119.5124932398
Female 21117.584833188
Male116001476
Female 11516.532571210
Male312006608
Male320004577
1.The female students are majority that are 61% based on the sample percentage than the male.
Female = 61
Male = 39
Total = 100
Proportion of female = 61/100 = 0.61
Proportion of male = 39/100 = 0.39
2.
Classes
Frequency
Freshman(1)
34
Sophomore(2)
23
Junior(3)
23
Senior(4)
20
The distribution is skewed to the right that the peak of the data is falling from left to right side of the graph.
3.
Classes
Freshman
Sophomore
Junior
Senior
Male
13
9
14
3
Female
21
14
9
17
From freshman to sophomore, female students are greater than male students. In junior class, female students are dropped down less than male students in junior year. In senior class, male students had dropped very low in senior year.
4.
Credit Hours
Frequency
5-9
11
10-14
32
15-19
53
20-23
4
Credit Hours: 6, 1, 0, 4, 6, 12, 5, 9, 35, 11, 7, 0, 4
Range = 35 - 0 = 35, there will be 4 credit hours. Class width = 35/4 = 8.75 9
Lowest Class Limit = 5
Class Width= 5
Number of Classes = 4
The distribution for credit hours are bell-shaped, which make a normal distribution.
5.1.) Mean =
=The sum of all terms = 457716
Sample size n = 100
Mean x = 457716/100 = 4577.16
2.) Median
Median= 2877.5
3.) Mode
Mode will be the most repeated number = 0
4.) Range
Range =
Range = 17056
5.) Variance
= 29142747.27
6.) Standard Deviation
Descriptive Statistics
Value
Mean
4577.16
Median
2877.5
Mode
0
Range
17056
Variance
29142747.27
Standard deviation
5398.40
6.
Female:
Five-number summary
minimum
0
Q1
1230
median
2398
Q3
3257
maximum
9654
Interquartile Range = 3257 - 1230 = 2027
IQR = 2027
Finding the outlier/s
Lower Fence = 1230 -1.5(2027) = -1810.5
Upper Fence = 3257 + 1.5(2027) = 6297.5
If the value is less than -1810.5 or greater than 6297.5, then is the outliers.
Outliers = 7286, 7430, 7535, 9654
Male:
Five-number summary
minimum
0
Q1
1476
median
2784
Q3
4028
maximum
17669
Interquartile Range = 4028 - 1476
IQR = 2552
Lower Fence = 2552 - 1.5(2552) = -1276
Upper Fence = 4028 + 1.5(2552) = 7856
The value less than -1276 or greater than 7856, then it is an outliers.
Outlier = 17669
Both of the boxplot are left skewed since the distance from Q1 has the larger median distance than the Q3 median distance. The female boxplot has 4 outliers while the male boxplot has only 1 outlier.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started