Question
Assume we have a text collection D of 900 documents from three topics (or three classes), Science, Sports, and Politics. Each class has 300
Assume we have a text collection D of 900 documents from three topics (or three classes), Science, Sports, and Politics. Each class has 300 documents. Each document in D is labeled with one of the topics (classes). We use this collection to perform clustering to find three clus- ters. Note that class/topic labels are not used in clustering. After clustering. we want to measure the effectiveness of the clustering algorithm. Cluster Science Sports Politics Entropy Purity 250 20 10 0.589 0.893 2. 20 180 80 1.198 0.643 3. 30 100 210 1.257 0.617 Total 300 300 300 1.031 0.711
Step by Step Solution
3.57 Rating (164 Votes )
There are 3 Steps involved in it
Step: 1
The entropy of the clusters is measured by comparing them to the cl...Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get StartedRecommended Textbook for
Computer Networking A Top-Down Approach
Authors: James F. Kurose, Keith W. Ross
5th edition
136079679, 978-0136079675
Students also viewed these Computer Network questions
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
Question
Answered: 1 week ago
View Answer in SolutionInn App