Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Project 1 Iris Dataset Context: This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic

Project 1 Iris Dataset
Context: This is perhaps the best known database to be found in the pattern recognition
literature. Fisher's paper is a classic in the field and is referenced frequently to this day. The
data set contains 3 classes of 50 instances each, where each class refers to a type of iris plant.
One class is linearly separable from the other 2; the latter are NOT linearly separable from each
other.
Dataset Source: click here
Task: Predict the class of iris plant.
Questions to Answer:
Perform EDA and Explore the features using histograms
Encode the target variable ie convert it to numeric type
Experiment using two different ratios of training, validation and test data ie 60-20-20,
80-10-10. On the two different split ratios do the following
o Implement KFold Cross Validation
o Implement Grid Search to find optimal hyperparameters for any 3 algorithms
(out of LR, SVM, MLP, RF, Boosting)
o Analyze the results on Validation set and test set and mention which model
performed the best and why?
o Compare the performance of models(using precision, recall, accuracy, latency).
What was the best proportion or split ratio of data from the set of experiments
you conducted?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Students also viewed these Databases questions

Question

politeness and modesty, as well as indirectness;

Answered: 1 week ago