Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

PYTHON : Create functions and loops when needed to avoid manual work process and analyse one of the datasets available (heart disease, cancer dataset or

PYTHON :

Create functions and loops when needed to avoid manual work

process and analyse one of the datasets available (heart disease, cancer dataset or any dataset available in libraries as sklearn ot tensorflow).

perform data preprocessing (change text to numbers, remove rows with no data, scaling

create at least 3 different feature sets based on correlation analysis or other EDA you have generated

use cross validation for spliting test and training data. generate stats about the samples in both test and training set.

create 4 supervised learning models using KNN, , SVM (there are 2 modesl), NB (there are 3 models) and Random Forest

run the models on each of the three features and save accuracy, precision, f measure and recall of each model in a colum in a dataframe

use the datarafme above to compare the perfromance of models using roc_arc and precions/accuracy

For each classifier compare the result obained by each feature set

For each feature set, compare the result obtained by each classifier

revisit your models, change parameters value and observe the changes in performance

analyse the result and explain why the model performed in a certain way and what are possible option to improve it. (e.g one could have performed better with True Positive and the other with True Negative.

use Ensembling classification algorithm to combine the result of top 5 model.

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access with AI-Powered Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions