Answered step by step
Verified Expert Solution
Question
1 Approved Answer
Task 2 : Design a Jelinek - Mercer based Language Model ( JM _ LM ) that ranks documents in each data collection using the
Task : Design a JelinekMercer based Language Model JMLM that ranks documents in
each data collection using the corresponding topic query for all data collections.
Inputs: long queries topics in theQueries.txt and the corresponding data collections
DataC DataC DataC
Output: ranked document files eg for Query R the output file name is
JMLMRRanking.dat for all data collections and save them in the folder
RankingOutputs
For each long query topic Rx you need to use the following equation to calculate a conditional
probability for each document D in the corresponding data collection dataset:
where is the number of times query word qi occurs in document DD is the number of
word occurrences in D is the number of times query word qi occurs in the data collection
DataCxDataCx is the total number of word occurrences in data collection DataCx and
parameter lambda
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started