Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Use R instead of Python (R is a language used in the field of Data Science) to do these questions: The SUPPORT Study (Study to

Use R instead of Python (R is a language used in the field of Data Science) to do these questions:

The SUPPORT Study (Study to Understand Prognoses Preferences Outcomes and Risks of Treatments) was a five-hospital study of 10000 critically ill hospitalized adults. Patients were followed for in-hospital outcomes and for long-term survival. We analyze 35 variables and a random sample of 1000 patients from the study. To get the data, load the Hmisc package and run the command getHdata(support). (Use R Studio)

Explore the variables and patterns of missing data in the SUPPORT dataset.

(a) Print univariable summaries of all variables.

(b) Make a plot showing the extent of missing data and tendencies for some variables to be missing on the same patients. (Weve done this before, but alternate techniques from functions in the Hmisc package could be useful.)

(c) Total hospital costs (variable totcst) were estimated from hospital-specific Medicare cost- to-charge ratios. Characterize what kind of patients have missing totcst. For this charac- terization use the following patient descriptors: age,sex, dzgroup, num.co, edu, income, scoma, meanbp, hrt, resp, temp.

(d) Use predictive mean matching to multiply impute cost 10 times per missing observation. De- scribe graphically the distributions of imputed values and briefly compare these to distributions of non-imputed values.

(e) Using the multiple imputed values, develop an overall least squares model for total cost (using the log transformation). The model should use the predictors in Problem 1 and should not assume linearity in any predictor but should assume additivity.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Decisions Based On Data Analytics For Business Excellence

Authors: Bastian Weber

1st Edition

9358681683, 978-9358681680

More Books

Students also viewed these Databases questions

Question

Explain the importance of community and user dialogue exchange.

Answered: 1 week ago

Question

3. Describe the communicative power of group affiliations

Answered: 1 week ago