Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Use the python - based popular industry data mining software packages and development / analysis tools - Panda, Scikit - Learn, Seaborn, etc. Study the
Use the pythonbased popular industry data mining software
packages and developmentanalysis tools Panda,
ScikitLearn, Seaborn, etc. Study the related documentation
from the software user guide.
On the given dataset, execute any predictive data mining
technique such as decision trees, Bayesian classifiers, neural
networks, knearest neighbors, ensemble learning, linear
regression etc.
You can implement multiple techniques, one followed by another
on the same dataset if you prefer to use this for your analysis.
If needed, you should convert the format of your dataset as
needed by the respective techniques If you have already
performed conversion in the assignment on data preprocessing,
you can use the converted data here.
If you would like to use the results of your descriptive data
mining techniques and conduct further analysis with predictive
data mining, that is fine as well.
You should aim to achieve robustness and generalization in the
mining, eg by altering seeds in the algorithm.
You must modify the concerned parameters, eg learning rate
and error threshold for neural networks, the value of k for
knearest neighbors etc. to get good results. Execute at least
different combinations of parameters and present the
experimental results accordingly for at least one technique.
Observe the experimental results and draw useful conclusions
from the data.
Based on the hypothesis obtained by the learning in the
predictive data mining techniques write a simple program that
uses the learned hypothesis to classify new data, and thus
serves as a prototype mini classification tool.
If you have used multiple techniques, you can select the one
that gives greatest accuracy or the one that is most suitable to
your data and domain etc.
The program should communicate with the user to give
outputs based on new unseen data. For example, consider that
the learned hypothesis is based on a dataset that uses weather
data to predict if it is okay to play tennis. This can include various
attributes such as temperaturechanceofrain and so forth to
estimate the target playing tennis as being yesno or
maybe It can be used to manually derive rules such as if
temperature medium and chanceofrain low then playtennis
yes which can be coded into the program. Thereafter, when a
user inputs new values for parameters such as temperature
the program can use these rules to output the classification
target and thereby suggest to the user Yes you can surely play
tennis today or No you should not play tennis today or
Maybe it seems okay to play tennis today This example can
be modified based on your dataset and domain.
Show at least different runs of such inputs and outputs
based on user interaction. The final goal is to have simple
communication with the user based on the learning done via
predictive data mining for classifying the target attribute. Hence,
please work accordingly based on your respective techniques
and application. You can tune this as per the needs of your MS
project, research topics or any other area of interest.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started