Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jun 24, 2024

1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c) ZIP code, (e) height, and (f) intensity

1) What is the type of the following kinds of attributes (a) age (in years), (b) salary, (c)

ZIP code, (e) height, and (f) intensity of rain? Classify them as continuous or discrete, and as

qualitative (nominal or ordinal) or quantitative (interval or ratio).

2)An analyst sets up a sensor network in order to measure the temperature of different

locations over a time period. What is the type of attributes collected (temperature)? What is the type of the dataset?

3) It is desired to partition customers into similar groups on the basis of their demographic profile.

a. What features could we use? Provide 3 examples. Would you describe such data as heterogeneous?

b. Which data mining problem is best suited to this task?

4)Suppose that you had a set of arbitrary objects, each representing different characteristics of gadgets. A domain expert gave you the similarity value between every pair of objects. How would you convert these objects into a multidimensional data set for clustering the gadgets ?

5)Suppose that you had a data set, such that each data point corresponds to sea-surface

temperatures over a square mile of resolution 1010. In other words, each data record contains a 1010 grid of temperature values with spatial locations. You also have some text

associated with each 1010 grid. How would you convert this data into a multidimensional

data set? How many features will each data point have?

6) Compute the cosine similarity, Jaccard coefficient

(if possible, for binary vectors), Euclidean distance, correlation coefficient for the following vectors, x, y:

a. x = (0, -1, 1, 2,-2), y = (0, -2, 2, 4, -4)

b. x = (0, 1, 0, 0, 0), y = (0, 1, 0, 0, 1)

c. x = (-1, -1, -1, -1, -1), y = (1, 1, 1, 1, 1)

7)Compute the cosine similarity and the Jaccard coefficient, between the two sets {A, B, C} and {A, C, D, E}. Hint: how will you represent each set?

8) Create three documents, A, B, and C such that the Euclidean distance between A and B is smaller than the Euclidean distance between A and C, even though documents A and B have no common words whereas documents A and C have some common words.

9)Are the following similarity measures good or bad for finding similarity in document-term data? Provide a one-line justification for each answer you provide.

a. correlation

b. cosine

c. Euclidean

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Chemistry

Chemistry

Authors: Raymond Chang

10th edition

77274318, 978-0077274313

More Books

Students also viewed these Mathematics questions

Question

★★★★★

What is an explanation for the fact, experimentally proven, that quartz sand grains are rounded much faster by wind than by a river?

Answered: 1 week ago

Question

★★★★★

Comparative statement of financial position accounts of Secada Inc., which follows IFRS, follow: Additional information: Secada Inc. has adopted the policy of classifying interest paid as operating...

Answered: 1 week ago

Question

★★★★★

How did James McKeen Cattell contribute to the scientific stature and visibility of psychology?

Answered: 1 week ago

Question

★★★★★

Rand Medical manufactures lithotripters. Lithotripsy uses shock waves instead of surgery to eliminate kidney stones. Physicians Leasing purchased a lithotripter for $3,000,000 and leased it to...

Answered: 1 week ago

Question

★★★★★

Following is the balance sheet of Finch Company for 2018: FINCH COMPANY Balance sheet Assets Cash $ 14,500 Marketable securities 7,620 Accounts receivable 13,160 Inventory 10,500 Property and...

Answered: 1 week ago

Question

★★★★★

SCENARIO 2 Based on Scenario 1, what happens to your effective rate, monthly payment and total interest throughout the life of the loan if, a) your rate (APR) drops by 0.5%? and, b) what happens if...

Answered: 1 week ago

Question

★★★★★

In 2021, Santa Fe Corporation had profits of $500,000 on sales of $10,000,000. At the beginning of 2021 Santa Fe's book equity was $2,500,000, and at the end of 2021 Santa Fe's total assets were...

Answered: 1 week ago

Question

★★★★★

What validations has Cvent put in place to ensure venues are receiving qualified leads? Cvent compares venues' sleeping room information to the planner's RFP sleeping room requirements before the RFP...

Answered: 1 week ago

Question

★★★★★

Changes to Australia's superannuation legislation now enable eligible employees to choose the superannuation fund they wish their super contributions to be paid into. Employers are legally obliged to...

Answered: 1 week ago

Question

★★★★★

= {x|ln 3.2 x 20} and two events A Problem 1. (6 points) Given a sample space S x 6} and B {x|3 x 7}, describe the following events. a) AUB. b) A - B. c) AU (BNS). = {x|2

Answered: 1 week ago

Question

★★★★★

Case Study You are a business analyst who has just joined the commission system replacement project at PrivateWealth. The project has been running for six months, and half of the planned analysis...

Answered: 1 week ago

Question

★★★★★

A restaurant forecasts that it will need to serve 70three-ounce portions of trimmed green beans tomorrow and the next day (the next order period). If the yield percent for green beans is 91% and...

Answered: 1 week ago

Question

★★★★★

3. To retrieve information from memory.

Answered: 1 week ago

Question

★★★★★

2. Value-oriented information and

Answered: 1 week ago

Question

★★★★★

1. Empirical or factual information,

Answered: 1 week ago

Previous Question Next Question