Answered step by step
Verified Expert Solution
Link Copied!

Question

00
1 Approved Answer

The purpose of this project is to familiarize you with basic concepts of statistics and data analysis. The exercise will help you develop an understanding

The purpose of this project is to familiarize you with basic concepts of statistics and data analysis. The exercise will help you develop an understanding of data analysis through data visualization and data summaries (descriptive statistics). You will conduct a basic analysis on a provided dataset to identify and understand distributions of variables and their effect on death events. The dataset that you will analyze includes a collection of variables that potentially affect/lead to death of the patient. Using Microsoft Excel, you will analyze and visualize the data and include your results in a report, providing appropriate explanation where applicable. Follow the steps below to conduct your analysis and compose your report. Be sure to include charts you produce in Excel in your report.

Directions

1. Identify variables according to their data types (Numerical or categorical)? (4 points)

2. Calculate the minimum, maximum, mean, mode, standard deviation and variance for all numerical variables. (4 points) a. Plot the frequency histogram to show the distribution of variables. (4 points) b. Describe what conclusions you can make from the histograms and statistics. (4 points)

3. Identify all categories of all categorical variables.(4 points) a. Plot (use bar graph) distribution of categories (the count of each category) for all categorical variables. Comment on comparisons. (4 points) b. Identify the leading cause of death. (Hint: Plot and compare the number of deaths caused by diabetes, high blood pressure, smoking habit). (4 points

.4 Does age/ sex have any influence in cause of death for this dataset? Explain. (4 points)

5. Compare the distributions of each numerical variable in the events of death. Use appropriate graphs. (Hint: For example, compare distribution of number of platelets when the patient either died or stayed alive). (4 points)

Thank you

excel sheet is below

Patient id age (years) creatinine_phosphokinase diabetes high_blood_pressure platelets sex smoking DEATH_EVENT
1 75 582 0 1 265000 1 0 1
2 55 7861 0 0 263358 1 0 1
3 65 146 0 0 162000 1 1 1
4 50 111 0 0 210000 1 0 1
5 65 160 1 0 327000 0 0 1
6 90 47 0 1 204000 1 1 1
7 75 246 0 0 127000 1 0 1
8 60 315 1 0 454000 1 1 1
9 65 157 0 0 263358 0 0 1
10 80 123 0 1 388000 1 1 1
11 75 81 0 1 368000 1 1 1
12 62 231 0 1 253000 1 1 1
13 45 981 0 0 136000 1 0 1
14 50 168 0 1 276000 1 0 1
15 49 80 0 1 427000 0 0 0
16 82 379 0 0 47000 1 0 1
17 87 149 0 0 262000 1 0 1
18 45 582 0 0 166000 1 0 1
19 70 125 0 1 237000 0 0 1
20 48 582 1 0 87000 0 0 1
21 65 52 0 1 276000 0 0 0
22 65 128 1 1 297000 0 0 1
23 68 220 0 1 289000 1 1 1
24 53 63 1 0 368000 1 0 0
25 75 582 1 1 263358 0 0 1
26 80 148 1 0 149000 1 1 1
27 95 112 0 1 196000 0 0 1
28 70 122 1 1 284000 1 1 1
29 58 60 0 0 153000 1 0 1
30 82 70 1 0 200000 1 1 1
31 94 582 1 1 263358 1 0 1
32 85 23 0 0 360000 1 0 1
33 50 249 1 1 319000 0 0 1
34 50 159 1 0 302000 0 0 0
35 65 94 1 1 188000 1 0 1
36 69 582 1 0 228000 1 0 1
37 90 60 1 0 226000 1 0 1
38 82 855 1 1 321000 0 0 1
39 60 2656 1 0 305000 1 0 0
40 60 235 1 0 329000 0 0 1
41 70 582 0 1 263358 1 1 1
42 50 124 1 1 153000 0 1 1
43 70 571 1 1 185000 1 1 1
44 72 127 1 1 218000 1 0 0
45 60 588 1 0 194000 0 0 1
46 50 582 1 0 310000 1 1 1
47 51 1380 0 1 271000 1 0 1
48 60 582 1 1 451000 1 1 1
49 80 553 0 1 140000 1 0 1
50 57 129 0 0 395000 0 0 1
51 68 577 0 1 166000 1 0 1
52 53 91 0 1 418000 0 0 1
53 60 3964 1 0 263358 0 0 1
54 70 69 1 1 351000 0 0 1
55 60 260 1 0 255000 0 1 1
56 95 371 0 0 461000 1 0 1
57 70 75 0 0 223000 1 1 0
58 60 607 0 0 216000 1 1 0
59 49 789 0 1 319000 1 1 1
60 72 364 1 1 254000 1 1 1
61 45 7702 1 1 390000 1 0 1
62 50 318 0 1 216000 0 0 1
63 55 109 0 0 254000 1 1 0
64 45 582 0 0 385000 1 0 1
65 45 582 0 0 263358 0 0 0
66 60 68 0 0 119000 1 1 1
67 42 250 1 0 213000 0 0 1
68 72 110 0 0 274000 1 1 1
69 70 161 0 0 244000 0 0 1
70 65 113 1 0 497000 1 0 1
71 41 148 0 0 374000 1 1 0
72 58 582 1 0 122000 1 1 0
73 85 5882 0 0 243000 1 1 1
74 65 224 1 0 149000 1 1 0
75 69 582 0 0 266000 1 1 1
76 60 47 0 0 204000 1 1 1
77 70 92 0 1 317000 0 1 0
78 42 102 1 0 237000 1 0 0
79 75 203 1 1 283000 1 1 0
80 55 336 0 1 324000 0 0 0
81 70 69 0 0 293000 0 0 0
82 67 582 0 0 263358 1 1 0
83 60 76 1 0 196000 0 0 1
84 79 55 0 1 172000 1 0 0
85 59 280 1 1 302000 0 0 1
86 51 78 0 0 406000 1 0 0
87 55 47 0 1 173000 1 0 0
88 65 68 1 1 304000 1 0 0
89 44 84 1 1 235000 1 0 0
90 57 115 0 1 181000 1 0 0
91 70 66 1 0 249000 1 1 0
92 60 897 1 0 297000 1 0 0
93 42 582 0 0 263358 0 0 0
94 60 154 0 0 210000 1 0 1
95 58 144 1 1 327000 0 0 0
96 58 133 0 1 219000 1 0 0
97 63 514 1 1 254000 1 0 0
98 70 59 0 0 255000 0 0 0
99 60 156 1 1 318000 0 0 0
100 63 61 1 0 221000 0 0 0
101 65 305 0 0 298000 1 0 0
102 75 582 0 1 263358 1 0 0
103 80 898 0 0 149000 1 1 0
104 42 5209 0 0 226000 1 1 0
105 60 53 0 1 286000 0 0 0
106 72 328 0 1 621000 0 1 1
107 55 748 0 0 263000 1 0 0
108 45 1876 1 0 226000 1 0 0
109 63 936 0 0 304000 1 1 0
110 45 292 1 0 850000 1 1 0
111 85 129 0 0 306000 1 1 1
112 55 60 0 0 228000 1 1 0
113 50 369 1 0 252000 1 0 0
114 70 143 0 0 351000 0 0 1
115 60 754 1 1 328000 1 0 0
116 58 400 0 0 164000 0 0 0
117 60 96 1 1 271000 0 0 0
118 85 102 0 0 507000 0 0 0
119 65 113 1 1 203000 0 0 0
120 86 582 0 0 263358 0 0 1
121 60 737 0 1 210000 1 1 0
122 66 68 1 1 162000 0 0 0
123 60 96 1 0 228000 0 0 0
124 60 582 0 1 127000 0 0 0
125 60 582 0 0 217000 1 0 1
126 43 358 0 0 237000 0 0 0
127 46 168 1 1 271000 0 0 1
128 58 200 1 0 300000 0 0 0
129 61 248 0 1 267000 1 1 0
130 53 270 1 0 227000 1 0 0
131 53 1808 0 1 249000 1 1 0
132 60 1082 1 0 250000 1 0 0
133 46 719 0 1 263358 0 0 0
134 63 193 0 1 295000 1 1 0
135 81 4540 0 0 231000 1 1 0
136 75 582 0 0 263358 1 0 0
137 65 59 1 0 172000 0 0 0
138 68 646 0 0 305000 1 0 0
139 62 281 1 0 221000 0 0 0
140 50 1548 0 1 211000 1 0 0
141 80 805 0 0 263358 1 0 1
142 46 291 0 0 348000 0 0 0
143 50 482 1 0 329000 0 0 0
144 61 84 0 1 229000 0 0 0
145 72 943 0 1 338000 1 1 1
146 50 185 0 0 266000 1 1 0
147 52 132 0 0 218000 1 1 0
148 64 1610 0 0 242000 1 0 0
149 75 582 0 0 225000 1 0 1
150 60 2261 0 1 228000 1 0 0
151 72 233 0 1 235000 0 0 1
152 62 30 1 1 244000 1 0 0
153 50 115 0 1 184000 1 1 0
154 50 1846 1 0 263358 1 1 0
155 65 335 0 1 235000 0 0 0
156 60 231 1 0 194000 1 0 0
157 52 58 0 0 277000 0 0 0
158 50 250 0 0 262000 1 1 0
159 85 910 0 0 235000 1 0 0
160 59 129 0 1 362000 1 1 0
161 66 72 0 1 242000 1 0 0
162 45 130 0 0 174000 1 1 0
163 63 582 0 0 448000 1 1 0
164 50 2334 1 0 75000 0 0 1
165 45 2442 1 0 334000 1 0 1
166 80 776 1 1 192000 0 0 1
167 53 196 0 0 220000 1 1 0
168 59 66 1 0 70000 1 0 1
169 65 582 1 0 270000 0 0 0
170 70 835 0 1 305000 0 0 0
171 51 582 1 0 263358 1 1 0
172 52 3966 0 0 325000 1 1 0
173 70 171 0 1 176000 1 1 0
174 50 115 0 0 189000 1 0 0
175 65 198 1 1 281000 1 1 0
176 60 95 0 0 337000 1 1 0
177 69 1419 0 0 105000 1 1 0
178 49 69 0 0 132000 0 0 0
179 63 122 1 0 267000 1 0 0
180 55 835 0 0 279000 1 1 0
181 40 478 1 0 303000 1 0 0
182 59 176 1 0 221000 1 1 1
183 65 395 1 0 265000 1 1 1
184 75 99 0 1 224000 1 0 1
185 58 145 0 0 219000 1 1 1
186 60 104 1 0 389000 1 0 1
187 50 582 0 0 153000 0 0 1
188 60 1896 1 0 365000 0 0 1
189 60 151 1 1 201000 0 0 0
190 40 244 0 1 275000 0 0 0
191 80 582 1 0 350000 1 0 0
192 64 62 0 0 309000 0 0 0
193 50 121 1 0 260000 1 0 0
194 73 231 1 0 160000 1 1 0
195 45 582 0 1 126000 1 0 1
196 77 418 0 0 223000 1 0 1
197 45 582 1 1 263358 0 0 0
198 65 167 0 0 259000 0 0 0
199 50 582 1 1 279000 0 0 0
200 60 1211 1 0 263358 1 1 0
201 63 1767 0 0 73000 1 0 0
202 45 308 1 1 377000 1 0 0
203 70 97 0 1 220000 1 0 0
204 60 59 0 1 212000 1 1 0
205 78 64 0 0 277000 1 1 0
206 50 167 1 0 362000 0 0 0
207 40 101 0 0 226000 0 0 0
208 85 212 0 0 186000 1 0 0
209 60 2281 1 0 283000 0 0 0
210 49 972 1 1 268000 0 0 0
211 70 212 1 1 389000 1 1 0
212 50 582 0 1 147000 1 1 0
213 78 224 0 0 481000 1 1 0
214 48 131 1 1 244000 0 0 1
215 65 135 0 1 290000 1 0 0
216 73 582 0 1 203000 1 0 0
217 70 1202 0 1 358000 0 0 0
218 54 427 0 1 151000 0 0 1
219 68 1021 1 0 271000 1 0 0
220 55 582 1 1 371000 0 0 0
221 73 582 0 0 263358 1 0 1
222 65 118 0 0 194000 1 1 0
223 42 86 0 0 365000 1 1 0
224 47 582 0 0 130000 1 0 0
225 58 582 1 0 504000 1 0 0
226 75 675 1 0 265000 0 0 0
227 58 57 0 0 189000 1 1 0
228 55 2794 0 1 141000 1 0 0
229 65 56 0 0 237000 0 0 0
230 72 211 0 0 274000 0 0 0
231 60 166 0 0 62000 0 0 1
232 70 93 0 0 185000 1 1 0
233 40 129 0 0 255000 1 0 0
234 53 707 0 0 330000 1 1 0
235 53 582 0 0 305000 1 1 0
236 77 109 0 1 406000 1 0 0
237 75 119 0 1 248000 1 0 0
238 70 232 0 0 173000 1 0 0
239 65 720 1 0 257000 0 0 0
240 55 180 0 0 263358 1 1 0
241 70 81 1 1 533000 0 0 0
242 65 582 1 0 249000 1 1 0
243 40 90 0 0 255000 1 1 0
244 73 1185 0 1 220000 0 0 0
245 54 582 1 0 264000 1 0 0
246 61 80 1 0 282000 1 0 0
247 55 2017 0 0 314000 1 0 1
248 64 143 0 0 246000 1 0 0
249 40 624 0 0 301000 1 1 0
250 53 207 1 0 223000 0 0 0
251 50 2522 0 1 404000 0 0 0
252 55 572 1 0 231000 0 0 0
253 50 245 0 1 274000 1 0 0
254 70 88 1 1 236000 0 0 0
255 53 446 0 1 263358 1 0 0
256 52 191 1 1 334000 1 1 0
257 65 326 0 0 294000 0 0 0
258 58 132 1 1 253000 1 0 0
259 45 66 1 0 233000 1 0 0
260 53 56 0 0 308000 1 1 0
261 55 66 0 0 203000 1 0 0
262 62 655 0 0 283000 0 0 0
263 65 258 1 0 198000 1 0 1
264 68 157 1 0 208000 0 0 0
265 61 582 1 0 147000 1 0 0
266 50 298 0 0 362000 1 1 0
267 55 1199 0 0 263358 1 1 1
268 56 135 1 0 133000 1 0 0
269 45 582 1 0 302000 0 0 0
270 40 582 1 0 222000 1 0 0
271 44 582 1 1 263358 1 1 0
272 51 582 1 0 221000 0 0 0
273 67 213 0 0 215000 0 0 0
274 42 64 0 0 189000 1 0 0
275 60 257 1 0 150000 1 1 0
276 45 582 0 1 422000 0 0 0
277 70 618 0 0 327000 0 0 0
278 70 582 1 0 25100 1 0 0
279 50 1051 1 0 232000 0 0 0
280 55 84 1 0 451000 0 0 0
281 70 2695 1 0 241000 1 0 0
282 70 582 0 0 51000 1 1 0
283 42 64 0 0 215000 1 1 0
284 65 1688 0 0 263358 1 1 0
285 50 54 0 0 279000 1 0 0
286 55 170 1 0 336000 1 0 0
287 60 253 0 0 279000 1 0 0
288 45 582 1 0 543000 0 0 0
289 65 892 1 0 263358 0 0 0
290 90 337 0 0 390000 0 0 0
291 45 615 1 0 222000 0 0 0
292 60 320 0 0 133000 1 0 0
293 52 190 1 0 382000 1 1 0
294 63 103 1 0 179000 1 1 0
295 62 61 1 1 155000 1 1 0
296 55 1820 0 0 270000 0 0 0
297 45 2060 1 0 742000 0 0 0
298 45 2413 0 0 140000 1 1 0
299 50 196 0 0 395000 1 1 0

For the variables diabetes, smoking, death event and high blood pressure: 0=YES, 1=NO

For the variable sex: 0=Male, 1=Female

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Auditing Assurance Services and Ethics in Australia an Integrated Approach

Authors: Alvin A Arens, Peter J. Best, Greg Shailer, Brenton Fiedler

9th edition

978-1442539365

Students also viewed these Mathematics questions