Two types of density model with hidden units
      Stochastic generative model
using directed acyclic graph
(e.g. Bayes Net)
Generation from model is easy
Inference is generally hard
Learning is easy after inference
    Energy-based models that
associate an energy with each
joint configuration
Generation from model is hard
Inference is generally hard
Learning requires a negative
phase that is even harder
than inference
This comparison looks bad for energy-based models