Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 22, 2024

In this task you need to: . Use the pretrained model 'bert - base - uncased' for BERT encoding. . Ignore the requirement A few

In this task you need to:

.

Use the pretrained model 'bert

-

base

-

uncased' for BERT encoding.

.

Ignore the requirement

A few of transformer decoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

The task is given below:

Transformer

Implement a simple Transformer neural network that is composed of the following layers:

Use BERT as feature extractor for each token.

A few of transformer encoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

A few of transformer decoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

1

hidden layer with size

512 .

The final output layer with one cell for binary classification to predict whether two inputs are related or not.

Note that each input for this model should be a concatenation of a positive pair

(

i

.

e

.

question

+

one answer

)

or a negative pair

(

i

.

e

.

question

+

not related sentence

) .

The format is usually like

[

CLS

] +

question

+ [

SEP

] +

a positive

/

negative sentence.

Train the model with the training data, use the dev

_

test set to determine a good size of the transformer layers, and report the final results using the test set. Again, remember to use the test set only after you have determined the optimal parameters of the transformer layers.

Based on your experiments, comment on whether this system is better than the systems developed in the previous tasks.

NECESSARY STEPS:

The model has the correct layers, the correct activation functions, and the correct loss function.

The code passes the sentence text to the model correctly. The documentation needs to explain how to handle length difference for a batch of data

The code returns the IDs of the n sentences that have the highest prediction score in the given question.

The notebook reports the F

1

scores of the test sets and comments on the results.

For good coding and documentation in this task. In particular, the code and results must include evidence that shows your choice of best size of the transformer layers. The explanations must be clear and concise. To make this task less time

-

consuming, use n

= 1 .

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Deductive And Object Oriented Databases Second International Conference Dood 91 Munich Germany December 18 1991 Proceedings Lncs 566

Deductive And Object Oriented Databases Second International Conference Dood 91 Munich Germany December 18 1991 Proceedings Lncs 566

Authors: Claude Delobel ,Michael Kifer ,Yoshifumi Masunaga

1st Edition

3540550151, 978-3540550150

More Books

Students also viewed these Databases questions

Question

★★★★★

1. Why Cafe-Mom wouldnt have just used Facebook or another established social networking platform, rather than investing the extra money in building its own website? 2. If a number of Cafe-Mom...

Answered: 1 week ago

Question

★★★★★

Master Chef Appliance Company manufactures home kitchen appliances. The manufacturing process includes stamping, final assembly, testing, and shipping. In the stamping operation, a number of...

Answered: 1 week ago

Question

★★★★★

__________ is the existence of differencesin HRM, it deals with different types of people in an organization.

Answered: 1 week ago

Question

★★★★★

Name three efficiency criteria that might be considered when choosing a multi project scheduling system.

Answered: 1 week ago

Question

★★★★★

In this task you need to: . Use the pretrained model 'bert - base - uncased' for BERT encoding. . Ignore the requirement A few of transformer decoder layers, hidden dimension 7 6 8 . You need to...

Answered: 1 week ago

Question

★★★★★

Pavel invested in an S corporation. He is not materially involved in the activity. There is $3,000 of ordinary business income in box 1 of your Schedule K-1. What type of income is this? 1....

Answered: 1 week ago

Question

★★★★★

"This problem has such a simple solution. Why are we spending so much time discussing this? I could have been finishing my work instead of wasting my time here," thought Lamar while attending a...

Answered: 1 week ago

Question

★★★★★

Time Dilation: An astronaut travels at a speed of 0.95c away from Earth. The astronaut sends a light signal back to Earth every 1.0 s, as measured by her clock. An observer on Earth notes that these...

Answered: 1 week ago

Question

★★★★★

Suppose Coca-Cola and Pepsi-Cola, who do not communicate, are engaged in a sales game and each firm seeks to maximize profit. The firms set high price or low price. If both set high price, each makes...

Answered: 1 week ago

Question

★★★★★

Mal Co currently sells 25 styles of sports watches. The market has remained static with an overall revenue of $50 million. Mal Co is always trying to bring out new designs and colours to try and...

Answered: 1 week ago

Question

★★★★★

Two employees are arguing loudly in the hallway about their working relationship. One accuses the other of setting them up for failure and looking bad in front of their supervisor. Tensions escalate...

Answered: 1 week ago

Question

★★★★★

Your firm will soon be replacing the desktop telephones used by its 47 employees.The unit thats being considered accommodates three features in addition to voice mail and call forwarding.The plan is...

Answered: 1 week ago

Question

★★★★★

Teamwork. As directed by your instructor, complete one or more of the following activities: a. Form teams to develop a questionnaire that could be used to survey student opinions on the availability...

Answered: 1 week ago

Question

★★★★★

Review the online or print copy of a corporations annual report. Prepare an outline of the report using its major headings and subheadings as your guide. Submit your outline to your instructor....

Answered: 1 week ago

Previous Question Next Question