Learning multiple layers of features greedily

Back-fitting

•

After we have learned all the layers greedily, the

weights in the lower layers will no longer be

optimal.

•

The weights in the lower layers can be fine-

tuned in several ways.

–

For the generative model that comes next, the

fine-tuning involves a complicated and slow

stochastic learning procedure.

–

If our ultimate goal is discrimination, we can

use backpropagation for the fine-tuning.