Recursive Restricted Boltzmann Machines
Is learning a model of the hidden activities just a hack?
It does not seem as if we are learning a proper
multilayer model because the lower weights do not
depend on the higher ones.
Can we treat the hidden layers of the whole stack of
RBM’s as part of one big generative model rather than a
model plus a model of a model etc.?
If it is one big model, it definitely is not a Boltzmann
machine. The first hidden layer has two sets of
weights (above and below) which would make the
hidden activities very different from the activities of
those units in either of the RBM’s.