[Solved] Q1) Which two of the accompanying portray

Answered step by step

Verified Expert Solution

Link Copied!

Question

1 Approved Answer

Posted on Sep 21, 2024

Q1) Which two of the accompanying portray predisposition difference compromise among MC and TD? A) The MC calculation decreases fluctuation by testing until the terminal

Q1) Which two of the accompanying portray predisposition difference compromise among MC and TD?

A) The MC calculation decreases fluctuation by testing until the terminal state, prompting higher predisposition.

B) The MC calculation diminishes predisposition by testing until the terminal state, prompting higher fluctuation.

C) The TD calculation diminishes change by testing few time steps, prompting higher predisposition.

D) The TD calculation decreases predisposition by testing few a period steps, prompting higher difference.

Question 2) What is the contrast between on-arrangement and off-strategy learning?

A)On-strategy learning learns by assessing the consequences of a conduct strategy to perform strategy enhancement for an objective approach, though off-arrangement gains as a matter of fact by assessing an objective approach and performing strategy enhancement for the objective strategy.

b) On-strategy taking in gains for a fact by assessing an objective approach and performing strategy enhancement for the objective arrangement, though off-arrangement learning learns by assessing the aftereffects of a conduct strategy to perform strategy enhancement for an objective arrangement.

C) On-approach taking in gains for a fact by assessing an objective arrangement and performing strategy enhancement for the objective approach, though off-approach learning learns by assessing the objective strategy to perform strategy enhancement for a conduct strategy.

D) On-strategy taking in gains for a fact by assessing a conduct strategy and performing strategy enhancement for the objective arrangement, though off-approach learning learns by assessing the consequences of a conduct strategy to perform strategy enhancement for the conduct strategy.

Question 3) Which two proclamations depict qualification follows?

A) Eligibility follows down weight the commitment of states that are infrequently visited to registering normal Vs) or Q(s,a).

B) Eligibility follows empower further investigation of the state space.

C) Eligibility follows dole out credit to activity.

D) Eligibility follows dole out credit to both the most every now and again visited and last visited states.

Problem I. Let X and Y be random variables having a bivariate normal distri- bution with E(X) = E(Y) = 0, Var(X) = Var(Y ) = 1 and Cov(X, Y ) = p. Derive E[max (X, Y )] and derive Elmin (X, Y)]. n2. Let bivariate continuous random variables ) and X, have the following joint probability density function. fx x, (x , x2 ) = 3x, , for OS x, Six, $1 (a) Find the marginal probability density functions of X, and X, (b) Find P X, S-, X2 S (c) Find P X, SIX, S NI- (d) Find the conditional probability density function of X, given X, = X2 .\fAnswer the following questions: a. Let X1, X2, X3 be i.i.d. random variables N(0, 1). Show that Y1 = X1 + 8X, and Y2 = X2 + 8X3 have bivariate normal distribution. Find the value of d so that the correlation coefficient between Y1 and Y2 is p = 2. b. Let X and Y follow the bivariate normal distribution with parameters /1, /2, 01, 02, p. Show that W = X - #1 and Q = (Y - M2) - po? (X - 1) are independent normal random variables