Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

You were tasked with the construction of a language model from a corpus of bigrams. The following table shows all occurring bigrams and how often

image text in transcribed

You were tasked with the construction of a language model from a corpus of bigrams. The following table shows all occurring bigrams and how often they appear in the corpus. The rows list the first word of the bigram and the columns list the second word of the bigram. The values list the count of the bigrams (right) and the conditional probabilities (left). 9 he in least men of short the was he 0 0 0 0 0 0 0 127 21 in 0 0 0 1 1 7 299 0 1 least 0 2 0 0 0 0 1 0 11 men 0 0 0 0 0 0 0 0 10 of 0 0 0 50 1 623 0 1 short 0 0 0 0 0 0 0 09 the 0 0 19 0 0 0 0 0 0 was 12 200 0 0 0 36 0 13 148 111 0 2 26 1 247 75 0 he in least men of short the was he 0 0 0 0 0 0 0 0.858 0.142 in 00 0.003.003 .023 .967 0 .003 least men of short the was 9 (a) Complete the right side of the table by calculating the conditional probabilies P(wi|W-1) of observing a word w; after observing Wi-1 for each bigram. (b) Estimate the probabilities of the unseen bigrams by applying Laplace smoothing. Update the left side of the table with the new bigram counts and recalculate the conditional probabilities based on the updated counts. You were tasked with the construction of a language model from a corpus of bigrams. The following table shows all occurring bigrams and how often they appear in the corpus. The rows list the first word of the bigram and the columns list the second word of the bigram. The values list the count of the bigrams (right) and the conditional probabilities (left). 9 he in least men of short the was he 0 0 0 0 0 0 0 127 21 in 0 0 0 1 1 7 299 0 1 least 0 2 0 0 0 0 1 0 11 men 0 0 0 0 0 0 0 0 10 of 0 0 0 50 1 623 0 1 short 0 0 0 0 0 0 0 09 the 0 0 19 0 0 0 0 0 0 was 12 200 0 0 0 36 0 13 148 111 0 2 26 1 247 75 0 he in least men of short the was he 0 0 0 0 0 0 0 0.858 0.142 in 00 0.003.003 .023 .967 0 .003 least men of short the was 9 (a) Complete the right side of the table by calculating the conditional probabilies P(wi|W-1) of observing a word w; after observing Wi-1 for each bigram. (b) Estimate the probabilities of the unseen bigrams by applying Laplace smoothing. Update the left side of the table with the new bigram counts and recalculate the conditional probabilities based on the updated counts

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Audit Education

Authors: Karen Van Peursem, Elizabeth Monk, Richard M.S. Wilson, Ralph Adler

1st Edition

1138192856, 978-1138192850

More Books

Students also viewed these Accounting questions

Question

5. Understand how cultural values influence conflict behavior.

Answered: 1 week ago

Question

e. What do you know about your ethnic background?

Answered: 1 week ago