How to combine simple density models
mixing
proportion
Suppose we want to build a model of a
complicated data distribution by combining
several simple models. What combination
rule should we use?
Mixture models take a weighted sum of the
distributions
Easy to learn
The combination is always vaguer than
the individual distributions.
Products of Experts multiply the distributions
together and renormalize.
The product is much sharper than the
individual distributions.
A nasty normalization term  is needed to
convert the product of the individual
densities into a combined density.