Next: Derivation of the Baum-Welch
Up: Bayesian Learning 2: EM
Previous: Revision of Lagrange Multipliers
The EM Theorem:
Let X and Y be random variables with distributions P(X) and P(Y)
respectively under some model
.
Let P'(X) and P'(Y) be their
corresponding distributions under a different model
.
Then
![\begin{displaymath}
\left[ \sum\limits_X P(X\vert Y)\log \frac{P'(X,Y)}{P(X,Y)} > 0 \right]
\Rightarrow P'(Y) > P(Y)
\end{displaymath}](img215.gif) |
(15) |
Interpretation: If we find a model
for which the first
inequality holds, then the observed data Y will be more probable under
than under
.
Proof.
Since
,
where the last step follows from Jensen's inequality proved earlier.
If this quantity is at least zero, then so is
and consequently
P'(Y) - P(Y). Q.E.D.
Anand Venkataraman
1999-09-16