Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jun 21, 2024

The main difference between retrieval functions of this form comes from their different choice of smoothing method that is applied to the unigram language model

image text in transcribed

image text in transcribed

The main difference between retrieval functions of this form comes from their different choice of smoothing method that is applied to the unigram language model Op. In general, when we smooth with a collection background language model, we can write this probability as P(w | 0D) = P.(w |BD) ifwED app(w | C) otherwise where Ps(W | OD ) is the discounted maximum likelihood estimate of observing word w in document D and an is a document- specific coefficient that controls the amount of probability mass assigned to unseen words to ensure that all of the probabilities sum to one. Noting that log is a monotonic transform (thus leading to equivalent results under ranking), and using the above smoothing formulation, we can show the following: log P(Q | D) = _ log p(q: | 0D) _ c(w, Q) log p(w | 0D) WEV (w, Q) logp. (w | 0p) + _ c(w, Q) log app(w | C) LED c(w, Q) logp. (w | 0p) + > c(w, Q) log app(w | C) - > c(w, Q) log app(w | C) P. (w | 8D) + 1Q| logan + _ c(w, Q) logp(w | C) BED c(w, Q) 108 app(w |C) IEV rank [ (w, Q) log Pluton + 1Q| logan WED a. [5 pts] Show that if we use the query-likelihood scoring method (i.e., p(Q D)) and the Jelinek-Mercer smoothing method (i.e., fixed co-efficient interpolation with smoothing parameter )) for retrieval, we can rank documents based on the following scoring function: + (1-A) x c(w, D) score(Q, D) = _ c(w, Q) log (1+ x x p(w | REF) x [DI) WEQND where the sum is taken over all the matched query terms in D, IDI is the document length, c(w,D) is the count of word w in document D (i.e., how many times W occurs in D), c(w, Q) is the count of word w in Q, A is the smoothing parameter, and p(WIREF) is the probability of word w given by the reference language model estimated using the whole collection. b. [5 pts] This scoring function above can also be interpreted as a vector space model. If we make this interpretation, what would be the query vector? What would be the document vector? What would be the similarity function? Does the term weight in the document vector capture TF-IDF weighting and document length normalization heuristics? Why

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Real Mathematical Analysis

Real Mathematical Analysis

Authors: Charles C Pugh

2nd Edition

3319177710, 9783319177717

More Books

Students also viewed these Mathematics questions

Question

★★★★★

Championship Billiards, owned by D & R Industries, in Lincolnwood, Illinois, provides some of the finest billiard fabrics, cushion rubber, and component parts in the industry. It sells billiard cloth...

Answered: 1 week ago

Question

★★★★★

Despite the fact that many work psychologists believe that job satisfaction and job performance are weakly related (see Judge et al., 2001), the majority of laypeople believe that job satisfaction...

Answered: 1 week ago

Question

★★★★★

Why is the work of Jacques Qutelet so important in the history of psychology?

Answered: 1 week ago

Question

★★★★★

The manager of a car wash received a revised price list from the vendor who supplies soap, and a promise of a shorter lead time for deliveries. Formerly the lead time was four days, but now the...

Answered: 1 week ago

Question

★★★★★

what is the WACC of Walmart? please explain and please show work.

Answered: 1 week ago

Question

★★★★★

Now that operations for outdoor clinics and TEAM events are running smoothly, Suzle thinks of another area for business expansion. She notices that a few clinic participants wear multiuse (MU)...

Answered: 1 week ago

Question

★★★★★

What are the two primary components of the accounting provision of the Foreign Corrupt Practices Act (FCPA)? Group of answer choices A. 1) Fees and expenses and 2) external controls. B. 1)...

Answered: 1 week ago

Question

★★★★★

A. Draw a graph to show equilibrium price. (3 points) Demand Supply Equilibrium price Quantity A Draw a graph to show equilibriurn price. (3 points)

Answered: 1 week ago

Question

★★★★★

Imagine a senior executive has asked you to conduct an organizational self-assessment based on the Four-Frame Model. Choose an organization that you either worked for, are familiar with, or have...

Answered: 1 week ago

Question

★★★★★

A pony, tied to the end of a lead, can walk in a full circle, a distance of 56.55 meters. About how long is its lead? A. 28.3 meters B. 18.0 meters C. 9.0 meters D. 7.5 meters

Answered: 1 week ago

Question

★★★★★

2. For an idealized slipper-pad bearing, the bottom wall moves at a constant velocity U and the upper block is fixed. Both the moving and fixed walls are non- permeable. If the velocity within the...

Answered: 1 week ago

Question

★★★★★

4. The monthly profits of Hi-Mark corporation were organized into a frequency distribution. The mean of monthly profits was computed to be $105 600, the median $104 700, and the mode $104 200. (a)...

Answered: 1 week ago

Question

★★★★★

Effect of Bank Strategies on Bank Ratings. A bank has asked you to assess various strategies it is considering and explain how they could affect its regulatory review. Regulatory reviews include an...

Answered: 1 week ago

Question

★★★★★

Self-confidence

Answered: 1 week ago

Question

★★★★★

The number of people commenting on the statement

Answered: 1 week ago

Question

★★★★★

3. Choose a social incident in your local area. (a) List three points that you believe are the cause of the social incident and develop a model that shows a linear relationship. (b) What is the...

Answered: 1 week ago

Previous Question Next Question