Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 11, 2024

In this task you need to: . Use the pretrained model 'bert - base - uncased' for BERT encoding. . Ignore the requirement A few

In this task you need to:

.

Use the pretrained model 'bert

-

base

-

uncased' for BERT encoding.

.

Ignore the requirement

A few of transformer decoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

The task is given below:

Transformer

Implement a simple Transformer neural network that is composed of the following layers:

Use BERT as feature extractor for each token.

A few of transformer encoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

A few of transformer decoder layers, hidden dimension

768 .

You need to determine how many layers to use between

1

~

3 .

1

hidden layer with size

512 .

The final output layer with one cell for binary classification to predict whether two inputs are related or not.

Note that each input for this model should be a concatenation of a positive pair

(

i

.

e

.

question

+

one answer

)

or a negative pair

(

i

.

e

.

question

+

not related sentence

) .

The format is usually like

[

CLS

] +

question

+ [

SEP

] +

a positive

/

negative sentence.

Train the model with the training data, use the dev

_

test set to determine a good size of the transformer layers, and report the final results using the test set. Again, remember to use the test set only after you have determined the optimal parameters of the transformer layers.

Based on your experiments, comment on whether this system is better than the systems developed in the previous tasks.

NECESSARY STEPS:

The model has the correct layers, the correct activation functions, and the correct loss function.

The code passes the sentence text to the model correctly. The documentation needs to explain how to handle length difference for a batch of data

The code returns the IDs of the n sentences that have the highest prediction score in the given question.

The notebook reports the F

1

scores of the test sets and comments on the results.

For good coding and documentation in this task. In particular, the code and results must include evidence that shows your choice of best size of the transformer layers. The explanations must be clear and concise. To make this task less time

-

consuming, use n

= 1 .

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image_2

Step: 3

blur-text-image_3

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Professional IPhone And IPad Database Application Programming

Professional IPhone And IPad Database Application Programming

Authors: Patrick Alessi

1st Edition

0470636173, 978-0470636176

More Books

Students also viewed these Databases questions

Question

★★★★★

Continue with the facts of Question 10. Another large shipment was made in May to a customer in South Dakota, a state that does not impose any corporate income tax. Is this sale to be included in...

Answered: 1 week ago

Question

★★★★★

6. Consider the accompanying observations on stream flow (1000s of acre-feet) recorded at a station in Colorado for the period April 1August 31 over a 31-year span (from an article in the 1974 volume...

Answered: 1 week ago

Question

★★★★★

3. What steps can you take now to enhance your life by applying some of this new knowledge to your current situation?

Answered: 1 week ago

Question

★★★★★

Financial ratio analysis is conducted by four groups of analysts: managers, equity investors, long-term creditors, and short-term creditors. What is the primary emphasis of each of these groups in...

Answered: 1 week ago

Question

★★★★★

In this task you need to: . Use the pretrained model 'bert - base - uncased' for BERT encoding. . Ignore the requirement A few of transformer decoder layers, hidden dimension 7 6 8 . You need to...

Answered: 1 week ago

Question

★★★★★

What did Lewis Wolpert mean when he stated that reliable scientific knowledge is value-free and has no moral or ethical value (p. 1254)? Why is the conflation of science and technology a serious...

Answered: 1 week ago

Question

★★★★★

Icebreaker Company (a U.S.-based company) purchases materials from a foreign supplier on December 1, 2020, with payment of 22,000 dinars to be made on March 1, 2021. The materials are consumed...

Answered: 1 week ago

Question

★★★★★

solve all parts. C1 Anne and Bill like strawberries x and chocolate y. Anne starts with an endowment of X kilos outof strawberries and Bill starts with an endowment of Y kilos of chocolate. Anne and...

Answered: 1 week ago

Question

★★★★★

Smith Enterprises recently was profiled on a financial information website and touted as a "hot" growth stock. You acquired the stock quote shown here from that website. Smith Enterprises (DAX: SME)...

Answered: 1 week ago

Question

★★★★★

A knife found buried 7 inches deep in the ground would be marked on the evidence collection site map as _____. a. -7 b. 7 c. 7 BG d. UG 7

Answered: 1 week ago

Question

★★★★★

Week 14 Homework i 5 points 1 Saved Andretti Company has a single product called a Dak. The company normally produces and sells 83,000 Daks each year at a selling price of $60 per unit. The company's...

Answered: 1 week ago

Question

★★★★★

Technology. The owner of your company recently purchased two pairs of season tickets for the local symphony orchestra concerts. He will retain one pair of tickets but make the other available to...

Answered: 1 week ago

Question

★★★★★

On July 1, you relocated your business from one Detroit suburb to another. A month prior to your move, you phoned your security system provider and notified its representative to cancel your service...

Answered: 1 week ago

Question

★★★★★

Technology. Brenda Durwood, one of your employees and president of the Woodland Ski and Snowboard Club,has requested permission to hold the clubs annual Gear Swap fund-raiser in the company parking...

Answered: 1 week ago

Previous Question Next Question