Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 11, 2024

BMED 4 7 0 2 Medical Information systems Midterm Assignment The data set we'll be using in the midterm assignment, ClaimsData.csv , is structured to

BMED

4702

Medical Information systems

Midterm Assignment

The data set we'll be using in the midterm assignment, ClaimsData.csv

,

is structured to

represent a sample of patients in the Medicare program, which provides health insurance to

Americans aged

65

and older, as well as some younger people with certain medical conditions.

The observations represent a

1 %

random sample of Medicare beneficiaries, limited to those

still alive at the end of

2008 .

Our independent variables are from

2008,

and we will be predicting cost in

2009 .

Our independent variables are the patient's age in years at the end of

2008,

and then several

binary variables indicating whether or not the patient had diagnosis codes for a particular

disease or related disorder in

2008

: alzheimers, arthritis, cancer, chronic obstructive pulmonary

disease, or copd, depression, diabetes, heart.failure, ischemic heart disease, or ihd, kidney

disease, osteoporosis, and stroke.

Each of these variables will take value

1

if the patient had a diagnosis code for the particular

disease and value

0

otherwise.

Reimbursement

2008

is the total amount of Medicare reimbursements for this patient in

2008 .

And reimbursement

2009

is the total value of all Medicare reimbursements for the patient in

2009 .

Bucket

2008

is the cost bucket the patient fell into in

2008,

and bucket

2009

is the cost bucket

the patient fell into in

2009 .

These cost buckets are defined using the thresholds determined by data supplier.

So the first cost bucket contains patients with costs less than $

3, 000,

the second cost bucket

contains patients with costs between $

3, 000

and $

8, 000,

the third cost bucket contains

patients with costs between $

8, 000

and $

19, 000,

and the fourth cost bucket contains patients

with costs between $

19, 000

and $

55, 000,

and fifth cost bucket contains patients greater than

55, 000 .

1

: Calculate the patient number percentages of each bucket by creating a table of the

variable bucket

2009

and divide by the number of rows in Claims.

Our goal will be to predict the cost bucket the patient fell into in

2009

using a CART model.

But before we build our model, we need to split our data into a training set

(

ClaimsTrain

)

and a

testing set

(

ClaimsTest

) .

Therefore, load the package caTools, and then set our random seed to

88

so that we all get the same split. And set SplitRatio to be

0.6 .

2

: What is the average age of patients in the training set, ClaimsTrain?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

101 Database Exercises Text Workbook

Authors: McGraw-Hill

2nd Edition

0028007484, 978-0028007489

More Books

Students also viewed these Databases questions

Question

★★★★★

Show how to compute the length of an LCS using only 2 min (m, n) entries in the c table plus O (1) additional space. Then show how to do this using min (m, n) entries plus O (1) additional space.

Answered: 1 week ago

Question

★★★★★

Compute the Jacobian J(u, v) of the following transformations. T: x= 3u, y = 2v + 2

Answered: 1 week ago

Question

★★★★★

38. An engineering system consisting of n components is said to be a k-out-ofn system (k n) if the system functions if and only if at least k of the n components function. Suppose that all...

Answered: 1 week ago

Question

★★★★★

Refer to the previous question. Suppose that each of the audit activities can be crashed by the amounts indicated in the following table. a. What is the earliest the audit could be completed and what...

Answered: 1 week ago

Question

★★★★★

BMED 4 7 0 2 Medical Information systems Midterm Assignment The data set we'll be using in the midterm assignment, ClaimsData.csv , is structured to represent a sample of patients in the Medicare...

Answered: 1 week ago

Question

★★★★★

The Pandemic supply chain disruption and war in Ukraine have disrupted international trade. How great has this disruption been, i.e., has it affected commodities, industrial goods, etc.? How much has...

Answered: 1 week ago

Question

★★★★★

Kohler Corporation reports the following components of stockholders' equity at December 31 of the prior year. Common stock-$20 par value, 100,000 shares authorized, 50,000 shares issued and...

Answered: 1 week ago

Question

★★★★★

Assessment Instructions 1. Answer the following four questions: i. What are your career goals? ii. What are you really good at professionally? iii. What are you not good at or not interested in doing...

Answered: 1 week ago

Question

★★★★★

How does your personality align with your organisation's culture? If you are not part of a specific organisation, reflect on the outcomes of the personality and spend some time reflecting on your...

Answered: 1 week ago

Question

★★★★★

Go tohttps://paulhammant.com/2016/09/26/visualizing-the-theory-of-constraints/ Find TOC for a manufacturing production line - Click "Click to Play" 1 What is the constraint (which station) and what...

Answered: 1 week ago

Question

★★★★★

Cybercriminals and ransomware attacks often target organizations with insecure systems. As a database designer, you need to understand the secure database design. Read the topic Resource "Security in...

Answered: 1 week ago

Question

★★★★★

f. Did they change their names? For what reasons?

Answered: 1 week ago

Question

★★★★★

2. How do these communication technologies change intercultural communication interaction?

Answered: 1 week ago

Question

★★★★★

1. How do electronic means of communication (e-mail, the Internet, fax, and so on) differ from face-to-face interactions?

Answered: 1 week ago

Previous Question Next Question