Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

HWZ #1. {a} [4 points} Define the data set ORGANIC5 as a data source for the project. The data is in our EM class server

image text in transcribed
HWZ #1. {a} [4 points} Define the data set ORGANIC5 as a data source for the project. The data is in our EM class server already {where you find the PVGQTNK data set}. Please checlt the model role and measurement level for the variables. {Use the data explanation table and compare itwith the properties you have in Enterprise Miner} How.I many observations are there in the ORGANICS data set? {b} {3 points} Set the model role for the target variable and examine the distribution ofthe variable using a random sample from the data table. Please note that although two target variables are listed, this exercise concentrates on the binary variable ORGY". So the other original target variable should be rejected. What is the proportion of individuals who purchased organic products? {c} [4 points} Please check the distribution [bar chart! for variable GENDER. Is there any transformation you want to suggest after checking the distribution of GENDER? {There is no need to do the transformation now, just give the suggestion} {d} {E points} Several variables are in the rejected role. [to you find any other variable better in the rejected role? If yes, why do you think it should be rejected? {You can use the data information table given in the beginning to see whether there are redundant, repeated1 irrelevant, inappropriate variables. Use exploration to double-check your thoughts. There's no need to use other nodes currently} Complete the definition of the data source. [I.e.. modify the modeling roles of the variables you find better in the rejected role in End} {e} [not grade} The same data set will be used in the later homework. So please malte sure you correct the settings after the solutions are discussed in class. {5D points In total} Enterprise Miner Exercise The ORG-ARIES exercise continued You are welcome to use any other software. Please download the data {from SAS Studio} to your own computer and transform it into the appropriate format {.csv, etc.) by yourself. For each question below. I am expecting some graph and associated comments. {a} {3'} Please create a diagram and pull the ORGANIC-3 data setinslde the diagram. {b} {3'} Please check whether there is any duplicated BUSTED. {c} {3'} Please double check EDATE Is a unary variable. {d} {ZU'j For all the variables except IEHJBTIIII1 ORGANIOS. GEMIEIIER1 and the rejected variables DOB. El:l.iI.TE1 LODATE. BILL please check the unlvarlate distribution and make comments. {e} {5'} Please double check the replication between DOB and AGE. {f} {5'1 Please double check the replication between AGE. AOEGROUPL and AGEGROUP. {g} {5'} Please double check the replication between TU_REOION and REGION. {h} {5'} Please double check the replication between NEIGHBORHOOD and "GROUP

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Real Analysis On Intervals

Authors: A D R Choudary, Constantin P Niculescu

1st Edition

8132221486, 9788132221487

More Books

Students also viewed these Mathematics questions

Question

How is a standardized residual different from a residual?

Answered: 1 week ago

Question

What is a key public for this product/service/concept?

Answered: 1 week ago