Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Data cleaning To have meaningful test results, we need to clean the datasetnhanes.nh1516 first. In the dataset, we saw the records with refused to answer

Data cleaning

To have meaningful test results, we need to clean the datasetnhanes.nh1516 first. In the dataset, we saw the records with "refused to answer" or "do not know" or else.

Usually we need to recode those records as missing because in the dataset they are treated as a category, but they are no real meaning.

To do that, runprocfreq to check which variables have those codes and need to recode .

For example ,

/***DUQ200: Ever used marijuana or hashish***/

procfreqdata = nhanes.nh1516;table DUQ200;run;

You can see three people answered 'refused' (7) and three people answered 'Do not know ' (9 ) once you check the dictionary from the CDC website. We will recode those people as missing.

image text in transcribedimage text in transcribed
Ever used marijuana or hashish Cumulative Cumulative DUQ200 Frequency Percent Frequency Percent 1715 50.03 1715 50.03 2 1707 49.80 3422 99.82 3 0.09 3425 99.91 9 3 0.09 3428 100.00 Frequency Missing = 6543

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Calculus Early Transcendentals

Authors: Jon Rogawski, Colin Adams, Robert Franzosa

4th Edition

1319055907, 9781319055905

More Books

Students also viewed these Mathematics questions

Question

Do I have evidence for this statement?

Answered: 1 week ago