Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

You are allowed at most zero DATA steps and seven PROC steps to complete this assignment. Create a library that points to the provided data

You are allowed at most zero DATA steps and seven PROC steps to complete this assignment.
Create a library that points to the provided data sets on your class shared drive. Additionally, create a library named
HW1 that points to the location where you will store any data sets you create. Also save any non-SAS results you
are required to produce (e.g., PDF) to the same directory as your HW1 data sets.
Recall, any path outside of an X statement must be relative!
Lecture 2 gave you the name and paths for all the relevant information I provide to you and your assignments
specifications provide any specific names you should use for that assignment.
In this assignment, youll start with the RTPtall data set from our shared library. You should recognize the data values
from HW0 a kind person has come along and rearranged the RaleighTempPrecip data set for you so that it has a
more useful structure! (Guess what youll be learning to do in a few weeks?!) Unfortunately, the kind person isnt
perfect, so the data set is not correctly sorted. Nevertheless, you have all the skills necessary to produce a report that
matches mine (HW1 Duggins Weather Analysis.pdf ). As always, there are a few elements youll need that cannot be
inferred from the provided report or some critical information I want you to have easy access to, for those Ive provided
these pointers!
The PDF uses the Festival style.
In PROC MEANS, the keyword for standard deviation is STDDEV and for interquartile range the keyword is
QRANGE. (For a full list of keywords, go to the SAS Documentation for the PROC MEANS statement here.)
My smaller titles and all footnotes use an 8 pt font. All other font sizes are the default and all the fonts and colors
are defaults as well.
In the formats for temperature, Ive used notation like [75,90) which would indicate that 75 is included in that
range and that 90 is not included. Pay attention to that when building your formats.
You now know how to ensure your date, page numbers, and proc titles are each included/excluded as needed.
In the frequency analysis table, note that missing values are included in the table (which requires you modify
the frequency analysis code to ensure that happens) and have a custom label (which requires you think about
this when building your custom format - you want to make sure that missing values would be included with the
appropriate label whether they appear as a row, a column, or both).
When producing output objects that require a subset of the data, I want you to practice creating subsets in a few
different ways so that you are comfortable with different approaches:
For the numeric summary table, select year using a numeric variable and select month by using a character
variable and specifying the values you do want.
For the weighted frequency analysis, select days of the week by using a character variable and specifying the
values you do not want.
For the listing (printout of records), use only numeric variables when specifying your conditions.
For the output objects that require BY-group analysis, keep in mind that your BY statements do not have to
match your PROC SORT, they just cannot conflict. We discussed this in class, but this is your chance to really
think about what that means in a practical application. (Related question: why would we not just re-sort the
data whenever we wanted to?!)
As discussed on the first day of class, we dont focus on analysis in this course because you have other classes
for that. However, to simulate the inclusion of statistical results in your work and to practice wonderful things
like ODS! youre being provided with some code to include in your program that will fit some statistical models
and place part of the results in your report. While you are not required to understand the code, you will need to
modify it using a concept that we did learn.
Insert the following into your program at the appropriate location to add the statistical model results to your
report.
ods select 'Type III Model ANOVA';
proc glm data = HW1.;
class DayC;
model Prcp = tMax tMin DayC;
run;
Programming HW #1
In the PROC GLM statement, replace the pseudocode with your actual data set name.
Inside the GLM step, replace the pseudocode statement with the an actual statement that will produce
separate statistical models for each distinct 15-year group in the data set.
Note that the provided ODS SELECT statement looks a little different than usual it has quotes! You dont
need to modify this at all and it will work fine if your GLM code is correct. However, to make sure you
understand ODS Id encourage you to figure out (a) how I found out what to put in those quotes and (b)
why I chose to use this particular ODS SELECT statement instead of the syntax youve seen before was it
necessary?
When creating the report, pl

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Concepts

Authors: David Kroenke

4th Edition

0136086535, 9780136086536

More Books

Students also viewed these Databases questions