Computing the new mixing proportions
Each Gaussian gets a
certain amount of
posterior probability for
each datapoint.
The optimal mixing
proportion to use (given
these posterior
probabilities) is just the
fraction of the data that
the Gaussian gets
responsibility for.
Posterior for
Gaussian i
Data for
training
case c
Number of
training cases