Given a set of training samples DN = x1, x2, , xN , the so-called empirical distribution
Question:
Given a set of training samples DN =
x1, x2, , xN
, the so-called empirical distribution corresponding to DN is defined as follows:
where ¹º denotes Dirac’s delta function. Show that the MLE is equivalent to minimizing the Kullback–
Leibler (KL) divergence between the empirical distribution and the data distribution described by a generative model pˆ¹xº:
Fantastic news! We've Found the answer you've been seeking!
Step by Step Answer:
Related Book For
Machine Learning Fundamentals A Concise Introduction
ISBN: 9781108940023
1st Edition
Authors: Hui Jiang
Question Posted: