Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

PLEASE SHOW ALL WORK AND EXPLAIN AS I AM TRYING TO LEARN PLEASE. NLP A group of monkeys in Nevada learned to talk. A group

PLEASE SHOW ALL WORK AND EXPLAIN AS I AM TRYING TO LEARN PLEASE. NLP

A group of monkeys in Nevada learned to talk. A group of scientists study them and develop a training corpus. Their way of talking only has 3 letters. E, R, T.

This is the corpus scientist collected: EEERETRERRRTTETRTT

1. Consider training a simple unigram language model using maximum likelihood estimation

FIND all Unigrams and state their probability in fraction.( do Not add an end symbol or start )

2. Train a bigram Language Model using maximum likelihood estimation

FIND all Bigrams and state their probability in fraction ( ADD end symbol # but do NOT add start tokens.)

3. The scientist send their boss to test the data and examine their language model. Here is the boss's data : ERETERR

A) State the perplexity of the unigram Language Model. (DONT add any tokens to the test data for the sake of consistency)

B) State the perplexity of the bigram Language Model. (DO add an end symbol # for the sake make it consistently to make it consistent)

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Fundamentals Of Database Systems

Authors: Sham Navathe,Ramez Elmasri

5th Edition

B01FGJTE0Q, 978-0805317558

More Books

Students also viewed these Databases questions

Question

Define and measure service productivity.

Answered: 1 week ago