Numpy, pandas, and SKLearn packages allowed Select the pima diabetes dataset with binary target values from https machinelearningmastery com standard machine learning datasets Use pandas to read CSV file as dataframe (1pt) e g The following code helps import pima diabetes dataset col names 'pregnant', 'glucose', 'bp', 'skin', 'insulin', 'bmi', 'pedigree', 'age', 'label' load dataset pima pd read csv( pima indians diabetes database csv , header None, names col names) Select 5 (if not possible then select 4) features from the chosen dataset (1pt) List all features you selected in your report For example, the following code will select two features feature cols 'pregnant', 'age' X pima feature cols Use train test split from sklearn model selection to split test and training data by 40 testing 60 training (1pt) Fit your model with training data and test your model after fitting Calculate and print out the confusion matrix (1pt) precision score, recall score, F score (3pts)

The Answer is in the image, click to view ...

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 27, 2024

Numpy, pandas, and SKLearn packages allowed Select the pima diabetes dataset with binary target values from https://machinelearningmastery.com/standard-machine-learning-datasets/ Use pandas to read CSV file as dataframe.

**Numpy, pandas, and SKLearn packages allowed**

Select the pima diabetes dataset with binary target values from

https://machinelearningmastery.com/standard-machine-learning-datasets/

Use pandas to read CSV file as dataframe. (1pt)

e.g. The following code helps import pima diabetes dataset

col_names = ['pregnant', 'glucose', 'bp', 'skin', 'insulin', 'bmi', 'pedigree', 'age', 'label']

# load dataset

pima = pd.read_csv("pima-indians-diabetes-database.csv", header=None, names=col_names)

Select 5 (if not possible then select 4) features from the chosen dataset. (1pt)

List all features you selected in your report.

For example, the following code will select two features

feature_cols = ['pregnant', 'age']

X = pima[feature_cols]

Use train _test_split from sklearn.model_selection to split test and training data by 40% testing + 60% training. (1pt)

Fit your model with training data and test your model after fitting.

Calculate and print out

the confusion matrix (1pt)

precision score, recall score, F score (3pts)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

Step: 3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Informix Database Administrators Survival Guide

Authors: Joe Lumbley

1st Edition

0131243144, 978-0131243149

More Books

Students also viewed these Databases questions

Question

★★★★★

What is the meaning of the terms present value and future value? How can you determine whether to calculate the present value or the future value of an amount?

Answered: 1 week ago

Question

★★★★★

Explain in words how much better the three predictor variables combined predict PERCENT RAISE than the best single predictor by itself.

Answered: 1 week ago

Question

★★★★★

=+place to look for data on mutual funds is http:// www.morningstar.com.) What do you learn from this comparison?

Answered: 1 week ago

Question

★★★★★

Dan Jacobs, production manager for GreenLife, invested in computer-controlled production machinery last year. He purchased the machinery from Superior Design at a cost of $3,000,000. A representative...

Answered: 1 week ago

Question

★★★★★

**Numpy, pandas, and SKLearn packages allowed** Select the pima diabetes dataset with binary target values from https://machinelearningmastery.com/standard-machine-learning-datasets/ Use pandas to...

Answered: 1 week ago

Question

★★★★★

Identify an advantage of organizational conflict from the scenarios below. A team develops and participates in healthy competition to encourage innovation and creativity. Individuals avoid making eye...

Answered: 1 week ago

Question

★★★★★

Question: In Java, which of the following statements correctly describes the behavior of the final keyword when applied to a method? A ) A final method cannot be overridden by subclasses, ensuring...

Answered: 1 week ago

Question

★★★★★

Question: In Java, which of the following statements correctly describes the behavior of the final keyword when applied to a method? A ) A final method cannot be overridden by subclasses, ensuring...

Answered: 1 week ago

Question

★★★★★

Question : Which of the following statements accurately describes the outcome of a final variable in a Java class? A ) Once initialized, the value of a final variable can be changed in the same...

Answered: 1 week ago

Question

★★★★★

Howard Saari was employed by Smith Barney, Harris Upham & Co., Inc., as an account executive beginning in July 1988. He alleges that his work was satisfactory at all times. According to Saari\'s...

Answered: 1 week ago

Question

★★★★★

WHAT IS AUTOMATION TESTING?

Answered: 1 week ago

Question

★★★★★

=+ (a) affect the demand for loanable funds in world fi nancial markets?

Answered: 1 week ago

Question

★★★★★

=+ b. How would the change you describe in part

Answered: 1 week ago

Question

★★★★★

=+ a. If the worlds poor nations offer better production effi ciency and legal protection, what would happen to the investment demand function in those countries?

Answered: 1 week ago

Previous Question Next Question

Question

**Numpy, pandas, and SKLearn packages allowed** Select the pima diabetes dataset with binary target values from https://machinelearningmastery.com/standard-machine-learning-datasets/ Use pandas to read CSV file as dataframe.

Step by Step Solution

Step: 1

Get Instant Access to Expert-Tailored Solutions

Step: 2

Step: 3

Ace Your Homework with AI

Recommended Textbook for

Informix Database Administrators Survival Guide

Students also viewed these Databases questions

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Question

Numpy, pandas, and SKLearn packages allowed Select the pima diabetes dataset with binary target values from https://machinelearningmastery.com/standard-machine-learning-datasets/ Use pandas to read CSV file as dataframe.