Answered step by step
Verified Expert Solution
Question
1 Approved Answer
You are expected to visit data repositories such as Kaggle, UCI Machine learning repository, PROMISE, etc. and extract three different datasets in the areas of
You are expected to visit data repositories such as Kaggle, UCI
Machine learning repository, PROMISE, etc. and extract three
different datasets in the areas of classification, clustering and
regression respectively. Note that the dataset for classification
should be unbalanced marks
Preprocess the datasets based on the following issues to
address: marks
o Missing data values
o Duplicate instances
o Outlier detection
o Influential datapoint detection
o Checking normality of the set of features
o Data transformation
o Feature selection
In the case of the unbalanced dataset, consider an appropriate
approach to balance the dataset. Eg You can use an
oversampling technique such as SMOTE, ADASYN, MAHAKIL,
etc to balance the dataset marks
Select appropriate learners at least learners for each dataset
for the training and validation needs and justify from literature
why those learners are relevant for such datasets marks
Consider the following approaches for training and validating
the models:
o Kfold cross validation marks
o Leaveoneout cross validation marks
o Percentage split, Eg: for training and for
validation marks
Consider appropriate evaluation measures across the
approaches for training and validation to assess the
performance of the models. Select the best modellearner based
on a good justification marks
Per results obtained from the prediction, classification and
clustering across the three datasets, provide necessary plots to
aid in visualizing the results obtained marks
Use a set of holdout data to predict, classify or cluster into the
right bins and provide a good visualsplots of the results
marks
Discuss your visualized results marks
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started