Multimodal Learning with Deep Boltzmann Machines

[paper] [supplementary material] [poster] [video] [Presentation]
Extended JMLR version [paper][bibtex]

Code

Code for training deep models Deepnet

These were made by taking a multimodal query and reconstructing it after doing mean-field inference in the model.

These reconstructions were made by going up and down the stack of RBMs used for pretraining the DBM.
Text

Image

Gaussian RBM (Image model 1). Image RBM
2 hidden layer DBN (Binary RBM on top of Image model 1) (Image model 2). Image DBN

DBN Model files can be found here or on GitHub
Model files for DBMs