Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

4. (20 points) Consider the following sample document collection: D1 = (2, 4, 1, 9, 2, 0) D2 = (1, 1, 2, 1, 0,4) D3

image text in transcribed

4. (20 points) Consider the following sample document collection: D1 = (2, 4, 1, 9, 2, 0) D2 = (1, 1, 2, 1, 0,4) D3 = (7, 2, 5, 0, 1, 0) D4 = (0, 1, 2, 6, 1, 2) D5 =(3, 0, 1, 4, 2, 1) D6 = (1,6, 0, 2, 6, 2) D7 = (2, 6, 3, 2, 8, 1) 1.) Using the following similarity calculation expression to calculate the similarities between documents. SIM(DOCK, DOCH) = { TERMik x TERMih i=1 2). Set the threshold to 45 to group the documents into clusters. 3). Calculate the centroid for each group by the following expression: CTERME = 1/m [TERMik i=1 4). Match the given a query Q = (1, 0, 5, 7, 4, 4) to find the documents that is most similar to the query

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

More Books

Students also viewed these Databases questions

Question

Comment should this MNE have a global LGBT policy? Why/ why not?

Answered: 1 week ago