Back-fitting
After we have learned all the layers greedily, the
weights in the lower layers will no longer be
optimal.
The weights in the lower layers can be fine-
tuned in several ways.
For the generative model that comes next, the
fine-tuning involves a complicated and slow
stochastic learning procedure.
If our ultimate goal is discrimination, we can
use backpropagation for the fine-tuning.