Question
1. Identify variables according to their data types (Numerical or categorical)? 2. Calculate the minimum, maximum, mean, mode, standard deviation and variance for all numerical
1. Identify variables according to their data types (Numerical or categorical)?
2. Calculate the minimum, maximum, mean, mode, standard deviation and variance for all numerical variables.
a. Plot the frequency histogram to show the distribution of variables.
b. Describe what conclusions you can make from the histograms and statistics.
3. Identify all categories of all categorical variables.
a. Plot (use bar graph) distribution of categories (the count of each category) for all categorical variables. Comment on comparisons.
b. Identify the leading cause of death. (Hint: Plot and compare the number of deaths caused by diabetes, high blood pressure, smoking habit).
4. Does age/ sex have any influence in cause of death for this dataset? Explain.
5. Compare the distributions of each numerical variable in the events of death. Use appropriate graphs. (Hint: For example, compare distribution of number of platelets when the patient either died or stayed alive)
https://acrobat.adobe.com/id/urn:aaid:sc:VA6C2:9d4a2a98-4625-4954-95ca-2603511c7620
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started