Answered step by step
Verified Expert Solution
Question
1 Approved Answer
You are allowed at most zero DATA steps and seven PROC steps to complete this assignment. Create a library that points to the provided data
You are allowed at most zero DATA steps and seven PROC steps to complete this assignment.
Create a library that points to the provided data sets on your class shared drive. Additionally, create a library named
HW that points to the location where you will store any data sets you create. Also save any nonSAS results you
are required to produce eg PDF to the same directory as your HW data sets.
Recall, any path outside of an X statement must be relative!
Lecture gave you the name and paths for all the relevant information I provide to you and your assignments
specifications provide any specific names you should use for that assignment.
In this assignment, youll start with the RTPtall data set from our shared library. You should recognize the data values
from HW a kind person has come along and rearranged the RaleighTempPrecip data set for you so that it has a
more useful structure! Guess what youll be learning to do in a few weeks?! Unfortunately, the kind person isnt
perfect, so the data set is not correctly sorted. Nevertheless, you have all the skills necessary to produce a report that
matches mine HW Duggins Weather Analysis.pdf As always, there are a few elements youll need that cannot be
inferred from the provided report or some critical information I want you to have easy access to for those Ive provided
these pointers!
The PDF uses the Festival style.
In PROC MEANS, the keyword for standard deviation is STDDEV and for interquartile range the keyword is
QRANGE. For a full list of keywords, go to the SAS Documentation for the PROC MEANS statement here.
My smaller titles and all footnotes use an pt font. All other font sizes are the default and all the fonts and colors
are defaults as well.
In the formats for temperature, Ive used notation like which would indicate that is included in that
range and that is not included. Pay attention to that when building your formats.
You now know how to ensure your date, page numbers, and proc titles are each includedexcluded as needed.
In the frequency analysis table, note that missing values are included in the table which requires you modify
the frequency analysis code to ensure that happens and have a custom label which requires you think about
this when building your custom format you want to make sure that missing values would be included with the
appropriate label whether they appear as a row, a column, or both
When producing output objects that require a subset of the data, I want you to practice creating subsets in a few
different ways so that you are comfortable with different approaches:
For the numeric summary table, select year using a numeric variable and select month by using a character
variable and specifying the values you do want.
For the weighted frequency analysis, select days of the week by using a character variable and specifying the
values you do not want.
For the listing printout of records use only numeric variables when specifying your conditions.
For the output objects that require BYgroup analysis, keep in mind that your BY statements do not have to
match your PROC SORT, they just cannot conflict. We discussed this in class, but this is your chance to really
think about what that means in a practical application. Related question: why would we not just resort the
data whenever we wanted to
As discussed on the first day of class, we dont focus on analysis in this course because you have other classes
for that. However, to simulate the inclusion of statistical results in your work and to practice wonderful things
like ODS! youre being provided with some code to include in your program that will fit some statistical models
and place part of the results in your report. While you are not required to understand the code, you will need to
modify it using a concept that we did learn.
Insert the following into your program at the appropriate location to add the statistical model results to your
report.
ods select 'Type III Model ANOVA';
proc glm data HW;
class DayC;
model Prcp tMax tMin DayC;
run;
Programming HW #
In the PROC GLM statement, replace the pseudocode with your actual data set name.
Inside the GLM step, replace the pseudocode statement with the an actual statement that will produce
separate statistical models for each distinct year group in the data set.
Note that the provided ODS SELECT statement looks a little different than usual it has quotes! You dont
need to modify this at all and it will work fine if your GLM code is correct. However, to make sure you
understand ODS Id encourage you to figure out a how I found out what to put in those quotes and b
why I chose to use this particular ODS SELECT statement instead of the syntax youve seen before was it
necessary?
When creating the report, pl
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started