 |
 |
 |
 |
 |
 |
 |
The network
learns a density model for
|
|
unlabeled digit
images. When we generate
|
|
from the model we
often get things that look
|
like real digits
of all classes.
|
|
|
But do the hidden
features really help with
|
|
digit discrimination?
|
|
|
Add 10 softmaxed
units to the top and do
|
|
backpropagation.
This gets 1.15% errors.
|
|