Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Consider a collection made of 5 0 0 0 0 0 documents, each containing on average 8 0 0 tokens and 6 characters per token.
Consider a collection made of documents, each containing on average tokens and characters per token. The number of different terms is estimated to and there are nonpositional posting lists.
a If fixed bytes are used for terms, bytes for term frequency and bytes for storing the pointer to postings list, compute the memory usage for dictionary.
b What is the minimum number of bits required for document identifiers and the total size of the collection?
c What is the size of the uncompressed posting list?
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started