Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

DATA MINING QUESTION: Can someone please provide me with an example workflow (knime) in which I can use to develop a random forest classifier to

DATA MINING QUESTION:

Can someone please provide me with an example workflow (knime) in which I can use to develop a random forest classifier to predict whether or not the row entry (house sale) will be qualified or not (1=qualified, 0=not qualified). I have posted pictures and example entries for each of the attributes in the dataset.

If the example workflow is not possible, the main questions I have are:

- What pre-processing methods should I apply to the data set ? (e.g. column filter out row ID, SSL and attributes with string values maybe?)

- Do any transformations need to be performed?

- Are there any connections between specific combinations of attributes that I should be exploring as a starting point?

I am not really sure where to begin, so any advice would be greatly appreciated, thanks!

image text in transcribedimage text in transcribed

BATHRM HF BATHFHEAT HEAT D AC NUM UNI ROOMS BEDRM AYE YR RMDL EYB STORIES SALEDATEPRICE UALIFIEC SALE NUMGBA 28915 1605 005 11913 0938 00 41326 2588 00 88621 5154 090 31340 1957 00 4747 2751 00 2 2012-06-1 810000 2 1993-06-2350000 2 1900-01-01T00:00:00 2 2000-10-1 371000 2 2014-02-0 74900 2 2011-10-2 828000 2 1900-01 01T00:00:00 2 1900-01-01T00:00:00 13 Hot Water N 13 Hot WaterY 2 46015 2774 00 2 2018-04-0 625000 13128 0908 00 57883 3165 08 10749 0946 00 13 Hot Water N 3 Wall Furn N 13 Hot WaterY 2 2017-01-3 250000 2 2003-05-0635000 BLDG NUISTYLE STYLE D STRUCT STRUCT CGRADE GRADE D CNDTN CNDTN D EXTWALL EXTWALL ROOF ROOF_D INTWALL INTWALL KITCHENS FIREPLACE USECODE LANDAREAGIS_LAST_M 3750 2018-07-221T 12 7480 2018-07-22T 111603 2018 07-22T 1760 2018-07-22T 5000 2018-07-22T 2789 2018-07-22T 8969 2018-07-22T 7 Excellent BATHRM HF BATHFHEAT HEAT D AC NUM UNI ROOMS BEDRM AYE YR RMDL EYB STORIES SALEDATEPRICE UALIFIEC SALE NUMGBA 28915 1605 005 11913 0938 00 41326 2588 00 88621 5154 090 31340 1957 00 4747 2751 00 2 2012-06-1 810000 2 1993-06-2350000 2 1900-01-01T00:00:00 2 2000-10-1 371000 2 2014-02-0 74900 2 2011-10-2 828000 2 1900-01 01T00:00:00 2 1900-01-01T00:00:00 13 Hot Water N 13 Hot WaterY 2 46015 2774 00 2 2018-04-0 625000 13128 0908 00 57883 3165 08 10749 0946 00 13 Hot Water N 3 Wall Furn N 13 Hot WaterY 2 2017-01-3 250000 2 2003-05-0635000 BLDG NUISTYLE STYLE D STRUCT STRUCT CGRADE GRADE D CNDTN CNDTN D EXTWALL EXTWALL ROOF ROOF_D INTWALL INTWALL KITCHENS FIREPLACE USECODE LANDAREAGIS_LAST_M 3750 2018-07-221T 12 7480 2018-07-22T 111603 2018 07-22T 1760 2018-07-22T 5000 2018-07-22T 2789 2018-07-22T 8969 2018-07-22T 7 Excellent

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Concepts

Authors: David Kroenke, David Auer, Scott Vandenberg, Robert Yoder

10th Edition

0137916787, 978-0137916788

More Books

Students also viewed these Databases questions

Question

What are the stages of project management? Write it in items.

Answered: 1 week ago

Question

1. Who is responsible for resolving this dilemma?

Answered: 1 week ago

Question

7. How might you go about testing these assumptions?

Answered: 1 week ago