Backpropagation can compute the gradient
that Hybrid Monte Carlo needs
1. Do a forward pass
computing hidden
activities.
2. Do a backward pass all
the way to the data to
compute the derivative
of the global energy w.r.t
each component of the
data vector.
works with any smooth
non-linearity
Ek
k
Ej
 j
data