Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

Using the multivariate data in the file fld1.xisx: (a) determine the discriminant line found by Fishers Linear Discriminant. (b) Plot both the data and the

image text in transcribed

Using the multivariate data in the file fld1.xisx: (a) determine the discriminant line found by Fishers Linear Discriminant. (b) Plot both the data and the discriminant line on a scatter plot (c) Using this line, determine the class of each of the data points in the dataset, assuming that the threshold is 0 (i.e. positive values are in one class and negative values in the other). (d) Determine what percentage of data points are incorrectly classified. NOTE: The first 2 columns in fld1.xlsx are data columns. The third column is the class to which each data point belongs. Question 2. Using the multivariate data in the file spam.xlsx, determine the discriminant line found by Fishers Linear Discriminant. Using this line, determine the class of each of the data points in the dataset, assuming that the threshold is 0 . NOTE: The first 57 columns in spam.xlsx are data columns. The 58th column is the class to which each data point belongs. Determine what percentage of the data points are incorrectly classified. You will notice that most of the data in the first class is classified correctly while the data in the second class is not. Therefore, it makes sense to adjust the threshold. Try the classification again a few times while adjusting the threshold so that it is a small negative number to see if you can improve the overall classification error rate (percentage of errors in BOTH classes). FYI, information about the dataset is given in the folder with the spam dataset. This is a real dataset, from the Machine Learning Repository at UCI (University of California Irvine). I simply made it a bit smaller by only using the first 500 data points from the first class and the first 500 data points from the second. The original database has over 4000 data points and it was awkward to work with that in Excel. 1 have not, however, reduced the number of attributes. For those that are interested in a real-life example, this is one. It is a dataset with attributes used to try to detect spam from email

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Mastering Real Time Analytics In Big Data A Comprehensive Guide For Everyone

Mastering Real Time Analytics In Big Data A Comprehensive Guide For Everyone

Authors: Lennox Mark

1st Edition

B0CPTC9LY9, 979-8869045706

More Books

Students also viewed these Databases questions

Question

★★★★★

Presented here is an aging schedule for Zander Company. At December 31, 2011, the unadjusted balance in Allowance for Doubtful Accounts is a credit of $11,700.Instructions(a) Journalize and post the...

Answered: 1 week ago

Question

★★★★★

68. Tee tests, part 2. Given the test results on golf tees described in Exercise 67, is there evidence that balls hit off Stinger tees would travel farther? Again assume that 6 balls were hit off...

Answered: 1 week ago

Question

★★★★★

17.1 Stratified Sampling Analysis of Results from Stratified Random Sampling Allocation of Sample Effort Among Strata Determining Sample Sizes for Stratified Random Sampling with Specified Degree of...

Answered: 1 week ago

Question

★★★★★

During 2010, Howard Company purchased land for $375,000. It paid $125,000 in cash and signed a $250,000 mortgage for the rest. The company also sold for $95,000 cash a building that originally cost...

Answered: 1 week ago

Question

★★★★★

Growing Annuity Payments You want to accumulate $3 millions by your retirement date, which is 25 years from now. You will make 25 deposits in your bank, with the first occurring today. The bank pays...

Answered: 1 week ago

Question

★★★★★

Suppose we compute a p-value and conclude that the population proportion of adult residents in this area who use the internet is not greater than 0.60 (i.e. 60%), but in fact it is truly greater than...

Answered: 1 week ago

Question

★★★★★

What are some of the ideals of socialism as it applies to the US sociology and economy, compared or contrasted to what we are currently seeing in Venezuela? I am having trouble with this assignment....

Answered: 1 week ago

Question

★★★★★

Given: f(1)=2, f'(1)=4, f'(2)=1; Write the equation of the normal line at x=1.

Answered: 1 week ago

Question

★★★★★

Discuss about a kaizen event (or kaizen blitz) that once took place in your work environment and how does it differs from other traditional kaizen applications? (10 Marks)

Answered: 1 week ago

Question

★★★★★

The required rate of return of stock ABC, which has a beta of 1.25, is 10.75%. The required rate of return of stock XYZ, which has a beta of 0.80, is 8.5%. What is the required rate of return of...

Answered: 1 week ago

Question

★★★★★

In the game Clacker, the numbers 1 through 12 are initially displayed. The player throws two dice and may cover the number representing the total or the two numbers on the dice. For example, for a...

Answered: 1 week ago

Question

★★★★★

Using some of the ideas in case study 12.4, consider proposals to change the recruitment of one key job in your organisation that would aim to reduce the attrition rate.

Answered: 1 week ago

Question

★★★★★

In the CIPD labour turnover research, the cost of replacement of employees who leave is high. Why is this?

Answered: 1 week ago

Question

★★★★★

Set out the arguments for and against instituting a legal requirement for a quota of women on boards of private companies

Answered: 1 week ago

Previous Question Next Question