Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Problem: Logistics Regression For this problem, you have a file containing 768 records from the National Institute of Diabetes and Digestive and Kidney Diseases. This

Problem: Logistics Regression

For this problem, you have a file containing 768 records from the National Institute of Diabetes and Digestive and Kidney Diseases. This is a subset from a larger database. Your task is to use Logistics Regression to predict an Outcome of diabetes. Some of the records are missing data. Part of this task is to clean up the data. You must use your Jupyter Notebook with python3, to change any of the records, so that the process can be run against my dataset. Develop the best Logistics Regression classifier that you can after cleaning up the data and analyze your results. Follow the process outlined in class. Explain your choices! Turn in your Jupiter notebook file and your write-up. Do Not Turn In The Data! If you change the data in anyway, it must be through your Jupyter Notebook file. Dont forget to include your visualizations.

image text in transcribed

Rent / Buy AutoSave Of Search File Home Inser Review Sell View Help X Calibri General In: XD Paste $ % 899 Conditional Format as Cell Formatting Table Styles Styles FC Clipboard Font Alignment Number IN C A1 X fx Pregnancies Outcome 50 1 1 31 0 32 1 21 0 33 1 30 0 40 A B C D E F G H 1 Pregnancies Glucose Blood Pressure Skin Thickness Insulin BMI Diabetes Pedigree Function Age 2 6 148 72 35 0 33.6 0.627 3 1 85 66 29 0 26.6 0.351 4 8 183 64 0 0 23.3 0.672 5 5 1 89 66 23 94 28.1 0.167 6 0 137 35 168 43.1 2.288 7 5 116 74 0 0 25.6 0.201 8 3 78 50 32 88 31 0.248 9 10 115 0 0 0 35.3 0.134 10 2 197 70 45 543 30.5 0.158 11 8 125 96 0 0 0 0.232 12 4 110 92 0 0 37.6 0.191 13 10 168 0 0 38 0.537 14 10 139 80 0 0 27.1 1.441 15 1 189 60 23 846 30.1 0.398 16 5 166 72 19 175 25.8 0.587 17 7 100 0 0 0 30 0.484 18 0 118 84 47 230 45.8 0.551 19 7 107 74 0 0 29.6 0.254 20 1 103 30 38 83 43.3 0.183 21 1 115 70 30 96 34.6 0.529 1 0 1 1 0 26 29 53 54 30 34 57 59 51 32 74 1 0 1 1 1 31 31 1 1 33 0 32 1 diabetes2a Rent / Buy AutoSave Of Search File Home Inser Review Sell View Help X Calibri General In: XD Paste $ % 899 Conditional Format as Cell Formatting Table Styles Styles FC Clipboard Font Alignment Number IN C A1 X fx Pregnancies Outcome 50 1 1 31 0 32 1 21 0 33 1 30 0 40 A B C D E F G H 1 Pregnancies Glucose Blood Pressure Skin Thickness Insulin BMI Diabetes Pedigree Function Age 2 6 148 72 35 0 33.6 0.627 3 1 85 66 29 0 26.6 0.351 4 8 183 64 0 0 23.3 0.672 5 5 1 89 66 23 94 28.1 0.167 6 0 137 35 168 43.1 2.288 7 5 116 74 0 0 25.6 0.201 8 3 78 50 32 88 31 0.248 9 10 115 0 0 0 35.3 0.134 10 2 197 70 45 543 30.5 0.158 11 8 125 96 0 0 0 0.232 12 4 110 92 0 0 37.6 0.191 13 10 168 0 0 38 0.537 14 10 139 80 0 0 27.1 1.441 15 1 189 60 23 846 30.1 0.398 16 5 166 72 19 175 25.8 0.587 17 7 100 0 0 0 30 0.484 18 0 118 84 47 230 45.8 0.551 19 7 107 74 0 0 29.6 0.254 20 1 103 30 38 83 43.3 0.183 21 1 115 70 30 96 34.6 0.529 1 0 1 1 0 26 29 53 54 30 34 57 59 51 32 74 1 0 1 1 1 31 31 1 1 33 0 32 1 diabetes2a

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Data Mining Concepts And Techniques

Authors: Jiawei Han, Micheline Kamber, Jian Pei

3rd Edition

0123814790, 9780123814791

More Books

Students also viewed these Databases questions

Question

6. How does knowledge management attain its primary objective?

Answered: 1 week ago