Question
Part A: Consider these documents: Doc 1 breakthrough technology for communication Doc 2 new communication technology Doc 3 new approach for dealing with communication Doc
Part A: Consider these documents: Doc 1 breakthrough technology for communication Doc 2 new communication technology Doc 3 new approach for dealing with communication Doc 4 new hopes for persons who deal with communication a- Draw the term-document incidence matrix for this document collection b- Draw the inverted index representation for this collection c- What are the returned results for these queries: 1. communication AND technology 2. for AND NOT(technology OR approach) d- Write a nave algorithm that handles the following query: x AND NOT y. Why such nave algorithm will be expensive? Would a posting merge algorithm be more efficient? Write such merge algorithm Part B: a- Write a java method that constructs the incident matrix of the above four documents. b- Write a java method that uses the incident matrix to answer the queries listed in Part A -c. c- Write a java method that implements the positing merge algorithm that you have developed in part A -d.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started