Learning multiple layers of features greedily

Recursive Restricted Boltzmann Machines

•

Is learning a model of the hidden activities just a hack?

–

It does not seem as if we are learning a proper

multilayer model because the lower weights do not

depend on the higher ones.

•

Can we treat the hidden layers of the whole stack of

RBM’s as part of one big generative model rather than a

model plus a model of a model etc.?

–

If it is one big model, it definitely is not a Boltzmann

machine. The first hidden layer has two sets of

weights (above and below) which would make the

hidden activities very different from the activities of

those units in either of the RBM’s.