Fitting a mixture of Gaussians
.95
.05
.05
.95
    The EM algorithm alternates
between two steps:
E-step: Compute the posterior
probability that each Gaussian
generates each datapoint.
M-step: Assuming that the data
really was generated this way,
change the parameters of
each Gaussian to maximize
the probability that it would
generate the data it is
currently responsible for.
.5
.5
.5
.5