Backpropagation can compute the gradient
that Hybrid Monte Carlo needs
1.
Do a forward pass
computing hidden
activities.
2.
Do a backward pass all
the way to the data to
compute the derivative
of the global energy w.r.t
each component of the
data vector.
works with any smooth
non-linearity
E
k
k
E
j
j
data