How many components does a mixture
need?
Suppose we want the state of the latent
variables to impose about 330 bits of constraint
on the visible variables
In a mixture, the latent state consists of a
choice of ONE of the components of the
mixture.
So we need                        components
In a later lecture we will see how to fit mixture
models with this many components in a few
minutes.
This involves a lot of parameter sharing!