Question: Suppose we choose our feature vector to be = [ , ] x = [ n white , n black ] , the number of

Suppose we choose our feature vector to be
=
[
,
]
x=[n
white
,n
black
], the number of white and black pieces on the board, respectively and we design our 1-layer neural network
(
;
)
=
(
)
f(x;\theta )=\sigma (z), where
=
0
+
1
+
z=w
0
n
white
+w
1
n
black
+b,
=
[
0
,
1
,
]
\theta =[w
0
,w
1
,b] and
\sigma is the sigmoid function.
Screenshot_2023-04-17_at_7.09.47_PM.png
We preprocess the data and organize it by features. The organized data comes in the form of
k tuples:
(
,
,
)
(x
i
,n
i
,m
i
) for
=
1
,
...
,
i=1,...,k, where
x
i
is the feature vector,
n
i
is the number of games played starting from states with those features, and
m
i
is the number of games won of those
n
i
games.
We decide that a reasonable utility function should be the probability of success and the output of our neural network will be the the probability
p of winning each game starting from the given state.
Our goal is to learn the parameters
\theta of
=
(
;
)
p=f(x;\theta ) which outputs the most likely
p for a given feature vector
x, given our training data.
What is the log likelihood function that we are trying to maximize? Keep your answer in terms of
,
,
,
(
;
)
n
i
,m
i
,x
i
,f(x
i
;\theta ). Hint: You may want to use rules for logs to expand out the log-likelihood to make the next part easier.

Step by Step Solution

There are 3 Steps involved in it

1 Expert Approved Answer
Step: 1 Unlock blur-text-image
Question Has Been Solved by an Expert!

Get step-by-step solutions from verified subject matter experts

Step: 2 Unlock
Step: 3 Unlock

Students Have Also Explored These Related Accounting Questions!