Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Theme: Understanding Basic Information Retrieval System Task: Index the document collection provided for this assignment, and compare the retrieval performance using different weighting methods. Weighting

image text in transcribedimage text in transcribedimage text in transcribed

Theme: Understanding Basic Information Retrieval System Task: Index the document collection provided for this assignment, and compare the retrieval performance using different weighting methods. Weighting Method: (i) TFIDF, (ii) BM25 You may choose to use any variant of the above two weighting methods. Evaluation Matrix: Precision@k, AveragePrecision@k, MAP@k, Recall@k, AverageRecall@k, MAR@k. You also compare the weighting methods by plotting MAP@k vs MAR@k. The values of k can be considered k=10,20,30, and so on as appropriate. You would need to report the performance of the systems by considering the above two scenarios. (i) evaluate considering only the respective body of the title query. You will have only one relevant document for this setup. Do for all titles. (ii) evaluate considering top 30 retrieved documents with manual annotation. You may do this for at least 5 queries of your choice. Retrieval System: There are many open source retrieval system. You may choose any retrieval system of your choice. Some of such systems are (i)Whoosh, (ii) solr, (iii) elasticsearch. Considering the followup assignments, you may choose Whoosh, but not necessary

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Beginning Apache Cassandra Development

Authors: Vivek Mishra

1st Edition

1484201426, 9781484201428

More Books

Students also viewed these Databases questions