Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Jul 30, 2024

Q 1 : The following sentence is partially tagged with POS Tags, where bear can be either a VB ( verb ) or a NN

Q

1

: The following sentence is partially tagged with POS Tags, where "bear" can be either a VB

(

verb

)

or a NN

(

noun

)

:

Your

/

PPR$ efforts

/

NN will

/

MD bear

/ ?

fruit

/

NN

.

Which formulas can calculate the most probable tag sequence for "bear"?

1)

P

(

bear

|

NN

) *

P

(

NN

|

VB

) *

P

(

VB

|

NN

)

2)

P

(

bear

|

VB

) *

P

(

VB

|

MD

) *

P

(

NN

|

VB

)

3)

P

(

will

|

MD

) *

P

(

bear

|

VB

) *

P

(

fruit

|

NN

)

4)

P

(

NN

|

NN

) *

P

(

NN

|

MD

) *

P

(

bear

|

NN

)

5)

P

(

bear

|

VB

) *

P

(

VB

|

NN

) *

P

(

VB

|

MD

)

A

. 2, 4

B

. 1, 5

C

. 3, 5

D

. 1, 2

E

. 4, 5

Q

2

: The following sentence is partially tagged with POS Tags, where "race" can be either a VB

(

verb

)

or a NN

(

noun

)

:

Secretariat

/

NNP is

/

VBZ expected

/

VBN to

/

TO race

/ ?

tomorrow

/

NR

.

Which formulas can calculate the most probable tag sequence for "race"?

Q

3

: What is cosine similarity? How to calculate it

?

Q

4

: TF

-

IDF helps to establish how important a particular word is in the context of the document corpus. TF

-

IDF takes into account the number of times the word appears in the document and is offset by the number of documents that appear in the corpus.

TF is the frequency of terms divided by the total number of terms in the document.

IDF is obtained by dividing the total number of documents by the number of documents containing the term and then taking the logarithm of that quotient.

TF

-

IDF is then the multiplication of two values TF and IDF.

Suppose that we have term count tables of a corpus consisting of only two documents, as in the picture

(

table

)

:

Calcuate TF

-

IDFs for the term

example

for Document

1

and Document

2,

respectively.

Q

5

: Describe Yarowsky's

(1995)

technique for word sense disambiguation and illustrate how it would disambiguate the following two senses of "sake":

Sense

1

: sake, interest

(

a reason for wanting something done: "for your sake", "died for the sake of his country"

)

Sense

2

: sake, saki, rice beer

(

Japanese alcoholic beverage made from fermented rice, usually served hot

)

Q

6

: Suppose you want to develop a new approach to summarization that extracts phrases rather than full sentences and puts together the phrases to form a sentence for the summary. Many summarization systems use language models. Please clearly explain your algorithm. You can draw a diagram of the summarization system architecture to help you answer this question.

image text in transcribed

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

OpenStack Trove

OpenStack Trove

Authors: Amrith Kumar, Douglas Shelley

1st Edition

1484212215, 9781484212219

More Books

Students also viewed these Databases questions

Question

★★★★★

a. What is a sampling frame? b. What did Mr. Clar use for a sampling frame in Example 1.10, page 20? c. Where did the number 1288 come from, and how was it used?

Answered: 1 week ago

Question

★★★★★

10 Rickys Record Store now employs five full-time employees and three part-time employees. The normal workload is 40 hours per week for full-time and 20 hours per week for part-time employees. Each...

Answered: 1 week ago

Question

★★★★★

why you want to attend graduate school in general;

Answered: 1 week ago

Question

★★★★★

Let's investigate the possibility of heteroskedasticity in time-series data by looking at a model of the black market for U.S. dollars in Brazil that was studied by R. Dornbusch and C. Pechman. In...

Answered: 1 week ago

Question

★★★★★

Chataqua Can Company manufactures metal cans used in the food-processing industry. A case of cans sells for $25. The variable costs of production for one case of cans are as follows: Direct material...

Answered: 1 week ago

Question

★★★★★

Selected comparative financial statements of Korbin Company follow. 1.Complete the below table to calculate each year's current ratio. 2.Complete the below table to calculate income statement data in...

Answered: 1 week ago

Question

★★★★★

Define digital transformation. How does digital transformation differ between the public and private sectors ? A

Answered: 1 week ago

Question

★★★★★

You plan to implement Teams Rooms systems for 40 meeting spaces with the core features of a Microsoft Teams Meeting Room. How many Microsoft Team Room Pro licenses do you need for this implementation?

Answered: 1 week ago

Question

★★★★★

Indicate how you intend to capture and sustain a healthcare service nicheand justify your preferred competitive service strategy.

Answered: 1 week ago

Question

★★★★★

Outline TWO ( 2 ) Business Decision Models that Build Resilience in the hospitality industry

Answered: 1 week ago

Question

★★★★★

Analytically determine the linear displacement of the piston in the compressor linkage shown in Figure P4.5 as the 45-mm crank is rotated from its current position 90 counterclockwise. 45 mm 30 100...

Answered: 1 week ago

Question

★★★★★

5. Working individually or in groups, develop several specific examples to illustrate how a professor teaching human resource management could use at least four of the techniques described in this...

Answered: 1 week ago

Question

★★★★★

7. What is organizational development and how does it differ from traditional approaches to organizational change?

Answered: 1 week ago

Question

★★★★★

1. Youre the supervisor of a group of employees whose task is to assemble disk drives that go into computers. You find that quality is not what it should be and that many of your groups devices have...

Answered: 1 week ago

Previous Question Next Question