Consider the application of EM to learn the parameters for the network in Figure 20.1 O(a), given
Question:
Consider the application of EM to learn the parameters for the network in Figure 20.1 O(a), given the true parameters in Equation (20.7).
a. Explain why the EM algorithm would not work if there were just two attributes in the model rather than three.
b. Show the calculations for the first iteration of EM starting.from Equation (20.8).
c. What happens if we start with all the parameters set lo the same value p? (Hint: you may find it helpful to investigate this empirically before deriving the general result.)
d. Write out an expression for the log likelihood of the tabulated candy data on page 729 in terms of the parameters, calculate the partial derivatives with respect to each parameter, and investigate the nature of the fixed point reached in part (c).
Step by Step Answer:
Artificial Intelligence: A Modern Approach
ISBN: 9780137903955
2nd Edition
Authors: Stuart Russell, Peter Norvig