 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
|
Train an
auto-encoder using 30
|
|
|
logistic units
for the code layer.
|
|
|
During the
fine-tuning stage,
|
|
|
add noise to the
inputs to the
|
|
|
code units.
|
|
|
|
The
noise vector for each
|
|
|
training
case is fixed. So we
|
|
|
still
get a deterministic
|
|
|
gradient.
|
|
|
|
The
noise forces their
|
|
|
activities to become bimodal
|
|
in
order to resist the effects
|
|
|
of
the noise.
|
|
|
|
Then
we simply round the
|
|
|
activities
of the 30 code units
|
|
to
1 or 0.
|
|