Question
Problem: Explore the dataset with Simple Linear Regression. The data is found in the file hospital.csv, with the following columns: Column 1: Days: The number
Problem: Explore the dataset with Simple Linear Regression. The data is found in the file hospital.csv, with the following columns: Column 1: Days: The number of days a patient has been in the hospital. Column 2: Infect: The estimated probability of an infection for the patient. This data was collected as part of the SENIC project, and the overall goal is to build a model to predict a patient's risk of infection based on how many days they are in the hospital. It is requested that you attempt the following predictions: Predict the infection risk for a patient who stayed 9 days. Predict the average infection risk for a patients who stay 15 days. Predict the infection risk for a patient who stayed 40 days.
Format Submit a short report. This means you should write in full sentences, and have the following sections for each question, while being as specific as you can about your results. There should not be any "copy and pasted" R code in this report. Create an R Markdown document. I. Introduction. State the question you are trying to answer, why it is a question of interest (why might we be interested in the answer), and what approach you are going to take (just the name of the approach). II. Summary of your data. This should include things like plots (histograms, boxplots) including the interpretation of the plots, and summary values such as sample means and standard deviations. You should have an idea about the trend of the data from this section. Display your plots, summary statistics, and scatterplot in this section. III. Diagnostics. You should discuss the assumptions here. Remove outliers if necessary. IV. Analysis. Report back the model fit, confidence intervals, test-statistic/s, and p-value/s, nulls and alternatives. Include your predictions here. You may use tables here, but be sure that you organize your work. Remember to write your results in full sentences where possible. V. Interpretation. State your conclusion, and what inference you may draw from your corresponding tests or confidence intervals. These should all be in terms of your problem. VI. Conclusion. Summarize briefly your findings. Here you do not have to re-iterate your numeric values, but summarize all relevant conclusions.
Days | Infect |
7.13 | 4.1 |
8.82 | 1.6 |
8.34 | 2.7 |
8.95 | 5.6 |
11.2 | 5.7 |
9.76 | 5.1 |
9.68 | 4.6 |
11.18 | 5.4 |
8.67 | 4.3 |
8.84 | 6.3 |
11.07 | 4.9 |
8.3 | 4.3 |
12.78 | 7.7 |
7.58 | 3.7 |
9 | 4.2 |
11.08 | 5.5 |
8.28 | 4.5 |
11.62 | 6.4 |
9.06 | 4.2 |
9.35 | 4.1 |
7.53 | 4.2 |
10.24 | 4.8 |
9.78 | 5 |
9.84 | 4.8 |
9.2 | 4 |
8.28 | 3.9 |
9.31 | 4.5 |
8.19 | 3.2 |
11.65 | 4.4 |
9.89 | 4.9 |
11.03 | 5 |
9.84 | 5.2 |
11.77 | 5.3 |
13.59 | 6.1 |
9.74 | 6.3 |
10.33 | 5 |
9.97 | 2.8 |
7.84 | 4.6 |
10.47 | 4.1 |
8.16 | 1.3 |
8.48 | 3.7 |
10.72 | 4.7 |
11.2 | 3 |
10.12 | 5.6 |
8.37 | 5.5 |
10.16 | 4.6 |
19.56 | 6.5 |
10.9 | 5.5 |
7.67 | 1.8 |
8.88 | 4.2 |
11.48 | 5.6 |
9.23 | 4.3 |
11.41 | 7.6 |
12.07 | 7.8 |
8.63 | 3.1 |
11.15 | 3.9 |
7.14 | 3.7 |
7.65 | 4.3 |
10.73 | 3.9 |
11.46 | 4.5 |
10.42 | 3.4 |
11.18 | 5.7 |
7.93 | 5.4 |
9.66 | 4.4 |
7.78 | 5 |
9.42 | 4.3 |
10.02 | 4.4 |
8.58 | 3.7 |
9.61 | 4.5 |
8.03 | 3.5 |
7.39 | 4.2 |
7.08 | 2 |
9.53 | 5.2 |
10.05 | 4.5 |
8.45 | 3.4 |
6.7 | 4.5 |
8.9 | 2.9 |
10.23 | 4.9 |
8.88 | 4.4 |
10.3 | 5.1 |
10.79 | 2.9 |
7.94 | 3.5 |
7.63 | 5.5 |
8.77 | 4.7 |
8.09 | 1.7 |
9.05 | 4.1 |
7.91 | 2.9 |
10.39 | 4.3 |
9.36 | 4.8 |
11.41 | 5.8 |
8.86 | 2.9 |
8.93 | 2 |
8.92 | 1.3 |
8.15 | 5.3 |
9.77 | 5.3 |
8.54 | 2.5 |
8.66 | 3.8 |
12.01 | 4.8 |
7.95 | 2.3 |
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started