Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Aug 23, 2024

1.In this part you will use Python to analyze the heart disease data set (the link and explanation is included here) by training and building

image text in transcribed

1.In this part you will use Python to analyze the heart disease data set (the link and explanation is included here) by training and building a model with regression analysis. Test your model and discuss the result of your test with performance metrics. Make sure you separate training set and testing data properly. Then analyse the input data and explain which of them have more effects on output and modify your models by eliminating non significant variables. (5 marks) Heart Disease Dataset: Here, is the link for heart disease dataset of patients. http://archive.ics.uci.edu/ml/datasets/Heart+Disease After going to this link you will find two folders: One: Data Folder and two: Dataset description. Data folder that has the dataset. It is better to use processed cleveland data. In the dataset description folder, you will find the description about the columns' names referring to the14 column of the dataset as the following: The last one attribute (number 14) is the result. Include your R source code of regression analysis, training and generating results. Here are the example of attributes and their Information (please see data set documents for more details) 1. \#3 (age) 2. \#4 (sex) 3. \#9 (cp) 4. \#10 (trestbps) 5. \#12 (chol) 6. \#16 (fbs) 7. \#19 (restecg) 8. \#32 (thalach) 9.\#38 (exang) 10. \#40 (oldpeak) 13. \#51 (thal) 14. \#58 (num) -------------->result For more information related to this assignment you can read Chapter 2 and Linear Regression section of Chapter 3 of "Doing Data Science" book. 2.Nonlinear Models In this part you will use the heart disease data set to analyse the data with logistic regression analysis ( or any other nonlinear classifier on your choice) and compare with linear regression analysis then answering which method is better. First use two models as the estimator (with CPS521 Lab4 numerical result). Here you need to compare both methods by calculating Errors such as Mean Square Error (MSE) and other performance metrics (R-squared) to find which method can do prediction more accurately. Make sure you separate training set and testing data and there is no overfitting. Exercise-6 (1 Mark) Download the Excel file "Sample-probability-distributions-graph.xlsx" of the sample distributions from the lecture notes. By visual observation and running the regression analysis (for example by Excel regression analysis) find out which probability distribution is linear. You can examine fitting the distribution data by using linear regression model or by explaining the equation of each distribution

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Database Reliability Engineering Designing And Operating Resilient Database Systems

Database Reliability Engineering Designing And Operating Resilient Database Systems

Authors: Laine Campbell, Charity Majors

1st Edition

978-1491925942

More Books

Students also viewed these Databases questions

Question

★★★★★

Both of the following questions are essentially the same. Does the difference in wording seem as though it could affect the way that people respond? Are you in favor of the "Defense of Marriage...

Answered: 1 week ago

Question

★★★★★

A 2.00-L container at 463 K contains 0.500 mol phosgene. The equilibrium constant, K c , is 4.93 10 -3 for Calculate the concentrations of all species after the system reaches equilibrium. COC1(g) =...

Answered: 1 week ago

Question

★★★★★

9.58 Acidity in Rainfall Refer to Exercise 8.34 and the collection of water samples to estimate the mean acidity (in pH) of rainfalls in Eastern Canada. As noted, the pH for pure rain falling through...

Answered: 1 week ago

Question

★★★★★

The International Bank of Commerce (IBC) is an audit client of your public accounting firm. IBC is a multinational financial institution that operates in 23 countries. During the current years audit,...

Answered: 1 week ago

Question

★★★★★

The Alpine House, Incorporated, Is a large retaller of snow skIs. The company assembled the information shown below for the quarter ended March 3 1 : \ table [ [ , Amount ] , [ Sales , $ 1 5 0 , 0 0 0

Answered: 1 week ago

Question

★★★★★

The following data has been taken from the 2nd department of Albari Company: Cost from proceeding department: $9,675 Cost added by the department (Materials + Conversion): $6,500 Units received from...

Answered: 1 week ago

Question

★★★★★

You have a conference room that contains a Teams Rooms on Windows device. You need to update the background image of the device with new company branding. What should you do? Select only one answer....

Answered: 1 week ago

Question

★★★★★

How many moving trucks, professional movers, and moving cost would it take to support supporting a Sensitive Compartmented Information Facility (SCIF) move to another building? Use the following...

Answered: 1 week ago

Question

★★★★★

What is the speedup of the basic pipeline (without bypassing) shown in the Introduction slides (Tsequential / Tpipelined) if 1/5 of all instructions contain a data dependence? Can you give a general...

Answered: 1 week ago

Question

★★★★★

Write a one hundred and fifty word response about which resource(s) you found to be most useful in terms of not only researching the different types of networks that exist, but which might be best...

Answered: 1 week ago

Question

★★★★★

Person is a named tuple with fields: first_name, last_name, and age. Variable person_data is a Person object created with two strings and one integer read from input as attributes. Output the...

Answered: 1 week ago

Question

★★★★★

What are some creative, and low cost/no cost, ways that an organization could make a new employee feel welcome?

Answered: 1 week ago

Question

★★★★★

What specific types of future training should BC Ferries focus on to ensure ongoing success?

Answered: 1 week ago

Question

★★★★★

BC Ferries involves front-line employees in the development of SEA material and STC curricula. What could be a potential downfall with this approach?

Answered: 1 week ago

Previous Question Next Question