Another use of contrastive divergence
CD is an efficient way to learn Restricted
Boltzmann Machines.
But it can also be used for learning other types
of energy-based model that have multiple
hidden layers.
Methods very similar to CD have been used for
learning non-probabilistic energy-based models
(LeCun, Hertzmann).