NIPS 2007 Tutorial on Deep Belief Nets

etc.


•	First learn with all the weights tied

		–	This is exactly equivalent to
			learning an RBM

		–	Contrastive divergence learning
			is equivalent to ignoring the small
			derivatives contributed by the tied
			weights between deeper layers.