The following table consists of training data from an employee database. The following table consists of training
Fantastic news! We've Found the answer you've been seeking!
Question:
The following table consists of training data from an employee database.
Transcribed Image Text:
The following table consists of training data from an employee database. The data have been generalized. For example, "31... 35" for age represents the age category with range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. department sales sales sales status age senior 31...35 junior 26...30 junior junior senior 31...35 salary 46K...50K count 30 26K... 30K 40 31K...35K 40 20 31...35 21...25 46K... 50K systems systems systems junior 26...30 systems senior 41...45 marketing senior 36...40 marketing junior 31...35 secretary secretary Let status be the class label attribute. 66K... 70K 5 46K...50K 3 66K... 70K 3 46K...50K 10 41K...45K 4 4 senior 46...50 36K... 40K junior 26...30 26K... 30K 6 (a) Construct a decision tree from the given data using information gain. Use R to verify your result and show your code. (b) Given a data tuple having the values "systems", "31...35", and "46K-50K" for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be? The following table consists of training data from an employee database. The data have been generalized. For example, "31... 35" for age represents the age category with range of 31 to 35. For a given row entry, count represents the number of data tuples having the values for department, status, age, and salary given in that row. department sales sales sales status age senior 31...35 junior 26...30 junior junior senior 31...35 salary 46K...50K count 30 26K... 30K 40 31K...35K 40 20 31...35 21...25 46K... 50K systems systems systems junior 26...30 systems senior 41...45 marketing senior 36...40 marketing junior 31...35 secretary secretary Let status be the class label attribute. 66K... 70K 5 46K...50K 3 66K... 70K 3 46K...50K 10 41K...45K 4 4 senior 46...50 36K... 40K junior 26...30 26K... 30K 6 (a) Construct a decision tree from the given data using information gain. Use R to verify your result and show your code. (b) Given a data tuple having the values "systems", "31...35", and "46K-50K" for the attributes department, age, and salary, respectively, what would a naive Bayesian classification of the status for the tuple be?
Expert Answer:
Related Book For
Posted Date:
Students also viewed these mathematics questions
-
Identify ethical dilemmas that are most challenging for leaders and managers. Q. Describe specific ethical dilemmas that exist within organizations. Q. Identify and recommend ethical solutions to...
-
(2) The following table consists of training data from an employee database. (2) The following table consists of training data from an employee database. The data have been generalized. For example,...
-
Table 2.6.4 is an excerpt from a salespersons database of customers. a. What is an elementary unit for this data set? b. What kind of data set is this: univariate, bivariate, or multivariate? c....
-
In Exercise solve the given equations and check the results. F-3 12 2 3 || 1 - 3F 2
-
The board of directors of Circuits Plus authorizes the issue of $9,000,000 of 8%, 25-year bonds payable. The semiannual interest dates are May 31 and November 30. The bonds are issued on May 31,...
-
As an example of impure serial correlation caused by an incorrect functional form, lets return to the equation for the percentage of putts made (P i ) as a function of the length of the putt in feet...
-
Show that after nearly all of the positrons were annihilated and the electron number density had nearly leveled off at the proton density, the ratio of the positron number density to the photon...
-
The comparative condensed income statements of Emley Corporation are shown below. Instructions (a) Prepare a horizontal analysis of the income statement data for Emley Corporation using 2014 as a...
-
What sql commands are used in RDBMS? What are the 5 basic SQL queries?
-
Shake Shack Incorporated, which began as a hot dog stand in 2001, now has more than 200 locations worldwide. The following is adapted from Shake Shack's financial statements for the quarter ended...
-
A flask holds a mixture of the 3 main gases in air. The masses of the nitrogen and oxygen in the mixture are given below, what is the partial pressure (in bar) of Ar if the total pressure of the gas...
-
How does working capital impact a firms value?
-
Company A recorded a profit before tax of $2,500,000 for the year ended 31 December 20x3. The tax rate for 20x3 was 24% while that of 20x2 was 22%. Deferred tax liability as at 31 December 20x2 was...
-
What is the difference between direct paper and dealer paper?
-
What is the difference between temporary and permanent working capital?
-
What mechanisms allow hostile acquirers to get around the free rider problem in takeovers?
-
A company recognizes an Advertising payable liability balance of $ 1 , 2 9 0 on its balance sheet dated December 3 1 , 2 0 2 4 . During the 2 0 2 5 fiscal year, the company recognizes Advertising...
-
The purpose of this case is to come up with a contingency plan[s] in order to sustain the program Move With Me, a program that serves thousands of community members throughout Lower Manhattan. The...
-
Continuing with the sample from the preceding exercise: a. Find the population mean for salary. b. Compare this population mean to the sample average for salary. In particular, how many standard...
-
View each column as a collection of independent observations of a random variable. a. In each case, what kind of variable is represented, continuous or discrete? Why? b.* Consider the event annual...
-
Which summary measure(s) may be used on a. Nominal data? b. Ordinal data? c. Quantitative data?
-
The Lorenz curve for Bangladesh looks like this: How much income do individuals in the top income quintile in Bangladesh receive? Cumulative percentage of income 58.7% 37.4 21.3 8.9 20 40 60 80 100%...
-
What would the Lorenz curve for lawyers represent?
-
The accompanying table shows income distribution data for three countries: a. Using this information, draw a Lorenz curve for each country. b. Which country has the most equal distribution of income?...
Study smarter with the SolutionInn App