 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
 |
| • |
If we really
need distributed representations (which we
|
|
|
nearly always
do), we can make inference much simpler
|
|
|
by using three
tricks:
|
|
|
|
– |
Use
an RBM for the interactions between hidden and
|
|
|
visible
variables. This ensures that the main source of
|
|
information
wants the posterior to be factorial.
|
|
|
|
– |
Include
short-range temporal information in each
|
|
|
time-slice
by concatenating several frames into one
|
|
|
visible
vector.
|
|
|
|
– |
Treat
the hidden variables in the previous time slice
|
|
|
as
additional fixed inputs (no smoothing).
|
|