Learning multiple layers of features greedily

Recursive Restricted Boltzmann Machines


•	First learn a layer of
	hidden features.

•	Then treat the feature
	activations as data
	and learn a second
	layer of hidden
	features.

•	And so on for as
	many hidden layers
	as we want.

second layer of features

RBM2


data is activities of
first layer of features

first layer of features

RBM1

data