Answered step by step
Verified Expert Solution
Link Copied!

Question

1 Approved Answer

Assume max seq length = 1 0 , there are multiple samples, such as [ 1 , 1 , 1 , 1 ] length of

Assume max seq length =10, there are multiple samples, such as
[1,1,1,1] length of 4
[2,2,2,2,2]length of 5
During actual pretraining, the two will be spliced together as one sample for training, for example:
[1,1,1,1,EOS_token,2,2,2,2,2,EOS_token] In this way, for sample 2, it can actually see the token of sample 1. Is there any reason for this?

Step by Step Solution

There are 3 Steps involved in it

Step: 1

blur-text-image

Get Instant Access to Expert-Tailored Solutions

See step-by-step solutions with expert insights and AI powered tools for academic success

Step: 2

blur-text-image

Step: 3

blur-text-image

Ace Your Homework with AI

Get the answers you need in no time with our AI-driven, step-by-step assistance

Get Started

Recommended Textbook for

Machine Learning And Knowledge Discovery In Databases European Conference Ecml Pkdd 2015 Porto Portugal September 7 11 2015 Proceedings Part 1 Lnai 9284

Authors: Annalisa Appice ,Pedro Pereira Rodrigues ,Vitor Santos Costa ,Carlos Soares ,Joao Gama ,Alipio Jorge

1st Edition

3319235273, 978-3319235271

More Books

Students also viewed these Databases questions