Question
DATA MINING QUESTION: Can someone please provide me with an example workflow (knime) in which I can use to develop a random forest classifier to
DATA MINING QUESTION:
Can someone please provide me with an example workflow (knime) in which I can use to develop a random forest classifier to predict whether or not the row entry (house sale) will be qualified or not (1=qualified, 0=not qualified). I have posted pictures and example entries for each of the attributes in the dataset.
If the example workflow is not possible, the main questions I have are:
- What pre-processing methods should I apply to the data set ? (e.g. column filter out row ID, SSL and attributes with string values maybe?)
- Do any transformations need to be performed?
- Are there any connections between specific combinations of attributes that I should be exploring as a starting point?
I am not really sure where to begin, so any advice would be greatly appreciated, thanks!
BATHRM HF BATHFHEAT HEAT D AC NUM UNI ROOMS BEDRM AYE YR RMDL EYB STORIES SALEDATEPRICE UALIFIEC SALE NUMGBA 28915 1605 005 11913 0938 00 41326 2588 00 88621 5154 090 31340 1957 00 4747 2751 00 2 2012-06-1 810000 2 1993-06-2350000 2 1900-01-01T00:00:00 2 2000-10-1 371000 2 2014-02-0 74900 2 2011-10-2 828000 2 1900-01 01T00:00:00 2 1900-01-01T00:00:00 13 Hot Water N 13 Hot WaterY 2 46015 2774 00 2 2018-04-0 625000 13128 0908 00 57883 3165 08 10749 0946 00 13 Hot Water N 3 Wall Furn N 13 Hot WaterY 2 2017-01-3 250000 2 2003-05-0635000 BLDG NUISTYLE STYLE D STRUCT STRUCT CGRADE GRADE D CNDTN CNDTN D EXTWALL EXTWALL ROOF ROOF_D INTWALL INTWALL KITCHENS FIREPLACE USECODE LANDAREAGIS_LAST_M 3750 2018-07-221T 12 7480 2018-07-22T 111603 2018 07-22T 1760 2018-07-22T 5000 2018-07-22T 2789 2018-07-22T 8969 2018-07-22T 7 Excellent BATHRM HF BATHFHEAT HEAT D AC NUM UNI ROOMS BEDRM AYE YR RMDL EYB STORIES SALEDATEPRICE UALIFIEC SALE NUMGBA 28915 1605 005 11913 0938 00 41326 2588 00 88621 5154 090 31340 1957 00 4747 2751 00 2 2012-06-1 810000 2 1993-06-2350000 2 1900-01-01T00:00:00 2 2000-10-1 371000 2 2014-02-0 74900 2 2011-10-2 828000 2 1900-01 01T00:00:00 2 1900-01-01T00:00:00 13 Hot Water N 13 Hot WaterY 2 46015 2774 00 2 2018-04-0 625000 13128 0908 00 57883 3165 08 10749 0946 00 13 Hot Water N 3 Wall Furn N 13 Hot WaterY 2 2017-01-3 250000 2 2003-05-0635000 BLDG NUISTYLE STYLE D STRUCT STRUCT CGRADE GRADE D CNDTN CNDTN D EXTWALL EXTWALL ROOF ROOF_D INTWALL INTWALL KITCHENS FIREPLACE USECODE LANDAREAGIS_LAST_M 3750 2018-07-221T 12 7480 2018-07-22T 111603 2018 07-22T 1760 2018-07-22T 5000 2018-07-22T 2789 2018-07-22T 8969 2018-07-22T 7 ExcellentStep by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started