Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Dec 02, 2019

(2) The following table consists of training data from an employee database. (2) The following table consists of training data from an employee database. The

(2) The following table consists of training data from an employee database.

(2) The following table consists of training data from an employee database. The data have been generalized. For example, "31... 35" for age represents the age range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. salary department status sales sales sales . age 31... 35 senior junior 26. 30 35 junior 31. junior 21. 25 systems 66K... 70K systems senior 31... 35 systems junior 26. 30 46K... 50K systems senior 41. 45 66K... 70K 46K... 50K 10 41K... 45K 4 marketing senior 36. 40 marketing junior 31. 35 secretary senior 46... 50 secretary junior 26... 30 4 36K... 40K 26K... 30K 6 count 30 40 40 46K... 50K 26K... 30K 31K... 35K 46K... 50K 20 5 3 3 Let status be the class label attribute. i. [5 points] How would you modify the basic decision tree algorithm to take into consideration the count of each generalized data tuple (i.e., of each row entry)? ii. [10 points] Use your algorithm to construct a decision tree from the given data. iii. [5 points] Given a data tuple having the values "systems", "26... 30", and "46-50K" for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be?

Step by Step Solution

★★★★★

3.42 Rating (158 Votes )

There are 3 Steps involved in it

Step: 1

Run info Assessor wekaattributeSelectionInfoGainAttributeEval Search wekaattributeSelectionRanker T ... blur-text-image

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Practical Business Statistics

Practical Business Statistics

Authors: Andrew Siegel

6th Edition

0123852080, 978-0123852083

More Books

Students explore these related Algorithms questions

Question

Identify ethical dilemmas that are most challenging for leaders and managers. Q. Describe specific ethical dilemmas that exist within organizations. Q. Identify and recommend ethical solutions to...

Answered: 3 weeks ago

Question

The following table consists of training data from an employee database. The following table consists of training data from an employee database. The data have been generalized. For example, "31......

Answered: 3 weeks ago

Question

Table 2.6.4 is an excerpt from a salespersons database of customers. a. What is an elementary unit for this data set? b. What kind of data set is this: univariate, bivariate, or multivariate? c....

Answered: 3 weeks ago

Question

characterize the duplicate constructor utilized in c++ alongside its overall capacity model explaon the different situations which it is called what is the distinction between CSMA/CD/CSMA/CA what...

Answered: 3 weeks ago

Question

At December 31, 2013, Niki Company reviewed the following situations to consider their impact on its 2013 financial statements: 1. In December 2013, Niki became aware of a safety hazard related to...

Answered: 3 weeks ago

Question

Cigarette Nicotine Refer to Data Set 5 in Appendix B and consider the nicotine content of the 29 different cigarette brands. The average (mean) of those amounts is 0.94 mg. Is this result likely to...

Answered: 3 weeks ago

Question

If X has a mixed distribution, then its range contains infinitely many points.

Answered: 3 weeks ago

Question

The following data have been collected for a British health care IT project for two-week reporting periods 2 through 12. Compute the SV, CV, SPI, and CPI for each period. Plot the EV and the AC on...

Answered: 3 weeks ago

Question

2. Activities included (and not included) in the calculation ofGDP The gross domestic product (GDP) of the United States is defined as the all in a given period of time. Based on this definition,...

Answered: 3 weeks ago

Question

Use truth tables to determine whether the following pairs of symbolized statements are logically equivalent, contradictory, consistent, or inconsistent. First, determine whether the pairs of...

Answered: 3 weeks ago

Question

Question 5 Joel Williams follows Sonoco Products Company (NYSE: SON), a manufacturer of paper and plastic packaging for both consumer and industrial use. SON appears to have a dividend policy of...

Answered: 3 weeks ago

Question

Fit the various channels that are used in an IMC strategy into the proper quadrants of the chart below. Group of answer choices A billboard touts the clean bathrooms at the Pilot truck stop. [ Choose

Answered: 3 weeks ago

Question

The customer accounts at a certain departmental store have an average balance of Rs. 480 and a standard deviation of Rs. 160. Assuming that the account balances are normally distributed. (a) What...

Answered: 3 weeks ago

Question

Suppose a Dover store in Madison, Missouri, ended September 2021 with 600,000 units of merchandise that cost $8 each. Suppose the store then sold 110,000 units for $990,000 during October. Further,...

Answered: 3 weeks ago

Question

We have made the system of a production line that works full 24 hours for 7 days a week. We wanted to estimate the average queuing time of parts before a machine and for this reason we built a...

Answered: 3 weeks ago

Question

Margaret Dairy is a CPA and the managing partner of Dairy and Cheese, a regional CPA firm located in northwest Wisconsin. She just left a meeting with a well-respected regional credit union...

Answered: 3 weeks ago

Question

Suppose you are shopping for a new computer system for yourself. a. List the top 5 technical/performance requirements that you want or need and briefly explain why each one is important for you. b....

Answered: 3 weeks ago

Question

a. Why does the Wi-Fi Alliance release compatibility testing profiles in waves instead of combining the entire standards features initially? 27a1.) An 802.11ac Wi-Fi compatibility testing profile...

Answered: 3 weeks ago

Question

Many people do not realize how much a funeral costs and how much these costs can vary from one provider to another. Consider the price of a traditional funeral service with visitation (excluding...

Answered: 3 weeks ago

Question

a. What salary would you expect for a 50-year-old individual? b. Find the 95% confidence interval for a new individual (from the same population from which the data were drawn) who is 50 years old....

Answered: 3 weeks ago

Question

For each of the following, say whether it is stationary or nonstationary: a. Autoregressive process. b. Random walk. c. Moving-average process. d. ARMA process.

Answered: 3 weeks ago

Question

The pie chart summarizes the genres of 193 movies shown in a suburban multiplex theatre in 2012. a) Is this an appropriate display for the genres? Why or why not? b) Which genre was least common?

Answered: 3 weeks ago

Question

The table gives the numbers of passenger car occupants killed in accidents in 2011 by car type. Subcompact and Mini . 1351 Compact . 3789 Intermediate .. 4050 Full . 2627 Unknown . 164 Convert this...

Answered: 3 weeks ago

Question

Twenty-six countries won medals in the 2010 Winter Olympics. The table lists them, along with the total number of medals each won: a) Try to make a display of these data. What problems do you...

Answered: 3 weeks ago

Previous Question Next Question