 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
| • |
After we have
learned all the layers greedily, the
|
|
|
weights in the
lower layers will no longer be
|
|
|
optimal.
|
|
|
| • |
The weights in
the lower layers can be fine-
|
|
|
tuned in several
ways.
|
|
|
|
– |
For
the generative model that comes next, the
|
|
fine-tuning
involves a complicated and slow
|
|
|
stochastic
learning procedure.
|
|
|
|
– |
If
our ultimate goal is discrimination, we can
|
|
|
use
backpropagation for the fine-tuning.
|
|