Time series models
If we really need distributed representations (which we
nearly always do), we can make inference much simpler
by using three tricks:
Use an RBM for the interactions between hidden and
visible variables. This ensures that the main source of
information wants the posterior to be factorial.
Include short-range temporal information in each
time-slice by concatenating several frames into one
visible vector.
Treat the hidden variables in the previous time slice
as additional fixed inputs (no smoothing).