Answered step by step
Verified Expert Solution
Question
1 Approved Answer
1 2 . In ISTM, the two gates that are responsible to update the cell state are and _ gates. ( A input, outputB )
In ISTM, the two gates that are responsible to update the cell state are and gates.A input, outputB input, forget C forget, outputDNone of theseIn LSTM the gates isare used to update the shortterm memoryA inputB forgetC outputD All of theseWhich of the following are the advantage ofTransformers over Recurrent sequence models?Faster to train and run on modern hardwareB Better at learning shortrange dependenciesC Require many fewer parameters to achievesimilar resultsAll of theseThe attention mechanism is a way ofAdetermining the similarity between two sentencesB identifying the topic of a sentence c predicting the next word in a sentenceDgiving the importance of each word in a sentence compared to othersTo prevent the decoder in transformer from looking at future tokens, we add A I lookahead mask B context vectorsC softmax layerD All of theseIn transformer encoder, the attention weights are he softmax output of scaled dot product of nd value, query key, value query, keyIn ViT, if the patch size is xx the vectorization of each patch has the dimensionAxB xcXx Xavier Initialization isA only used in fully connected neural networks.B a scaling factor to the mean of the randomweights.C designed to work well with ReLU. used to make the variance of the activationsthe same across every layerWhat is the purpose of dropout regularization in deep learning?A To reduce overfittingB To increase the model's capacityC To improve the training speedD To handle imbalanced datasetsWhich optimizer is based on both momentum and adaptive learning?A RMSPropB AdamC SGD momentumD AdaGradWhich optimizer has a problem of continually decaying of adaptive learning rates?A AdaGradB RMSPropG AdamD SGD momentumConsider a GAN model which successfully produ images of apples. Which of the following statements is false?The generator aims to learn the distributic apple images.BThe discriminator can be used to classify images as apple vs nonapple.C After training the GAN, the discriminator eventually reaches a constant value.D The generator can produce unseen imagapples.
Step by Step Solution
There are 3 Steps involved in it
Step: 1
Get Instant Access to Expert-Tailored Solutions
See step-by-step solutions with expert insights and AI powered tools for academic success
Step: 2
Step: 3
Ace Your Homework with AI
Get the answers you need in no time with our AI-driven, step-by-step assistance
Get Started