Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Example 14: Assume we have a text collection D of 900 documents from three topics (or three classes), Science, Sports, and Politics. Each class has

image text in transcribed

Example 14: Assume we have a text collection D of 900 documents from three topics (or three classes), Science, Sports, and Politics. Each class has 300 documents. Each document in D is labeled with one of the topics (classes). We use this collection to perform clustering to find three clus- ters. Note that class/topic labels are not used in clustering. After clustering. we want to measure the effectiveness of the clustering algorithm. Cluster Science Sports | Politics 1 250 20 10 Entropy Purity 0.589 0.893 2 20 180 80 1.198 0.643 3 30 100 210 1.257 0.617 Total 300 300 1.031 0.711

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Handbook Of Relational Database Design

Authors: Candace C. Fleming, Barbara Von Halle

1st Edition

0201114348, 978-0201114348

More Books

Students also viewed these Databases questions

Question

Do Question 1 if the death benefit is 1000 instead of 1.

Answered: 1 week ago

Question

Define Management by exception

Answered: 1 week ago

Question

Explain the importance of staffing in business organisations

Answered: 1 week ago

Question

What are the types of forms of communication ?

Answered: 1 week ago

Question

identify current issues relating to equal pay in organisations

Answered: 1 week ago