The
gated softmax
model is a very large mixture of logistic regressors with weight sharing.
This implementation uses Volodymyr Mnih's cudamat package for training on GPUs.
I have been using Python for almost all of my research
since about 2005.
Here are some slides and code-snippets from a ten-minute mini-tutorial I
gave to the machine learning group in Toronto in 2007
on switching from Matlab to Python.